{
    "version": "https://jsonfeed.org/version/1",
    "title": "Omar Kamali",
    "home_page_url": "https://omarkamali.com/",
    "feed_url": "https://omarkamali.com/json",
    "description": "I write about technical concepts in human terms.",
    "icon": "https://omarkamali.com/assets/img/ok-social.png",
    "author": {
        "name": "Omar Kamali",
        "url": "https://omarkamali.com/"
    },
    "items": [
        {
            "id": "beyond-tokenization-four-taxes-path-forward",
            "content_html": "The compounding tax stack low-resource languages carry, why vision encoders might hold the key, and the open research questions.",
            "url": "https://omarkamali.com/blog/beyond-tokenization-four-taxes-path-forward",
            "title": "Beyond Tokenization: The Four Taxes and the Path Forward",
            "summary": "The compounding tax stack low-resource languages carry, why vision encoders might hold the key, and the open research questions.",
            "image": "https://os.omneity.xyz/api/v1/property/80c3a5fe-d850-4498-8a22-54222b0aab2c/cms_uploads/0368ab6b2e0665cdaa5d3acabbc02d00.png",
            "date_modified": "2026-03-23T22:14:56.706Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "hidden-tax-llm-pays-bad-tokenization",
            "content_html": "How bad tokenization forces language models to waste capacity on reconstruction instead of reasoning.",
            "url": "https://omarkamali.com/blog/hidden-tax-llm-pays-bad-tokenization",
            "title": "The Hidden Tax Your LLM Pays for Bad Tokenization",
            "summary": "How bad tokenization forces language models to waste capacity on reconstruction instead of reasoning.",
            "image": "https://os.omneity.xyz/api/v1/property/80c3a5fe-d850-4498-8a22-54222b0aab2c/cms_uploads/2fc93fa1b07bebd8eec2ffb433cf7a9c.png",
            "date_modified": "2026-03-23T22:14:56.566Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "tokenization",
            "content_html": "Why tokenization is the hidden bottleneck blocking truly multilingual AI — lessons from building Sawalni and Wikilangs.",
            "url": "https://omarkamali.com/blog/tokenization",
            "title": "Tokenization is Killing Our Multilingual LLM Dream",
            "summary": "Why tokenization is the hidden bottleneck blocking truly multilingual AI — lessons from building Sawalni and Wikilangs.",
            "image": "https://os.omneity.xyz/api/v1/property/80c3a5fe-d850-4498-8a22-54222b0aab2c/cms_uploads/56d467083cd755ffed8006206791724a.png",
            "date_modified": "2026-03-23T22:14:56.418Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "wikipedia-monthly-pipeline",
            "content_html": "It all started with a DM from a friend, member and contributor to the Moroccan Wikipedia community.\r\n\"Are you using the current version of Wikipedia? The official dataset is severely outdated. We added so many cool articles nowhere on huggingface\" He was right. I was running a 2023 snapshot in 2025.",
            "url": "https://omarkamali.com/blog/wikipedia-monthly-pipeline",
            "title": "Why I stopped trusting the official Wikipedia dataset, and what I did about it",
            "summary": "It all started with a DM from a friend, member and contributor to the Moroccan Wikipedia community.\r\n\"Are you using the current version of Wikipedia? The official dataset is severely outdated. We added so many cool articles nowhere on huggingface\" He was right. I was running a 2023 snapshot in 2025.",
            "image": "https://os.omneity.xyz/api/v1/property/80c3a5fe-d850-4498-8a22-54222b0aab2c/cms_uploads/24fca8a5d24d3154946ed257f0d06213.png",
            "date_modified": "2026-03-07T20:58:14.790Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "worldle-for-the-world-wikilangs-launch",
            "content_html": "I built a word game for more than 300 languages, each drawing on its own Wikipedia as the source. Here's the thing nobody tells you: building a simple word game for most of these languages meant building things that didn't exist.",
            "url": "https://omarkamali.com/blog/worldle-for-the-world-wikilangs-launch",
            "title": "A Wordle for the Worldle",
            "summary": "I built a word game for more than 300 languages, each drawing on its own Wikipedia as the source. Here's the thing nobody tells you: building a simple word game for most of these languages meant building things that didn't exist.",
            "image": "https://os.omneity.xyz/api/v1/property/80c3a5fe-d850-4498-8a22-54222b0aab2c/cms_uploads/74e7a687d9f7665aef2934a869a35d87.png",
            "date_modified": "2026-03-04T01:09:53.142Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "picomon-amd-nvidia-apple-silicon-gpu-monitoring",
            "content_html": "Earlier this month, I whipped up a Python script with an LLM that parsed amd-smi output. It was ugly. It worked. I called it picomon.",
            "url": "https://omarkamali.com/blog/picomon-amd-nvidia-apple-silicon-gpu-monitoring",
            "title": "Picomon 0.2.0: From AMD Crash Fix to GPU Monitoring That Doesn’t Suck",
            "summary": "Earlier this month, I whipped up a Python script with an LLM that parsed amd-smi output. It was ugly. It worked. I called it picomon.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/picomon-amd-nvidia-apple-silicon-gpu-monitoring-043ea97c.png",
            "date_modified": "2025-12-27T18:14:25.908Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "wikipedia-monthly-fresh-clean-dumps-nlp-ai-research",
            "content_html": "Announcing Wikipedia Monthly, an always fresh dataset to support research for low-resource languages",
            "url": "https://omarkamali.com/blog/wikipedia-monthly-fresh-clean-dumps-nlp-ai-research",
            "title": "Introducing Wikipedia Monthly: Fresh, Clean Wikipedia Dumps for NLP & AI Research",
            "summary": "Announcing Wikipedia Monthly, an always fresh dataset to support research for low-resource languages",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/wikipedia-monthly-fresh-clean-dumps-nlp-ai-research-bebedf44.png",
            "date_modified": "2025-07-19T22:36:45.738Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "how-to-get-perfectly-structured-data-from-llms",
            "content_html": "If you've ever struggled to get consistent JSON output from large language models, I have a simple and clever solution for you.",
            "url": "https://omarkamali.com/blog/how-to-get-perfectly-structured-data-from-llms",
            "title": "Getting Perfectly Structured Data from LLMs",
            "summary": "If you've ever struggled to get consistent JSON output from large language models, I have a simple and clever solution for you.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/89b5baef.jpg",
            "date_modified": "2025-02-23T22:44:56.022Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "2024-a-year-of-growth-innovation-community",
            "content_html": "As we leave 2024 behind, I found myself reflecting over the holidays on a transformative year that reshaped my grasp of technology's role in human connection.",
            "url": "https://omarkamali.com/blog/2024-a-year-of-growth-innovation-community",
            "title": "2024: A Year of Growth, Innovation, and Community",
            "summary": "As we leave 2024 behind, I found myself reflecting over the holidays on a transformative year that reshaped my grasp of technology's role in human connection.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/2024-a-year-of-growth-innovation-community-2c7486df.jpg",
            "date_modified": "2025-01-04T01:50:31.474Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "datapluck",
            "content_html": "Exporting & importing Hugging Face datasets to spreadsheets and various file formats.",
            "url": "https://omarkamali.com/blog/datapluck",
            "title": "Datapluck: Portability Tool for Huggingface Datasets",
            "summary": "Exporting & importing Hugging Face datasets to spreadsheets and various file formats.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/datapluck-0df948f9.jpg",
            "date_modified": "2024-09-05T02:03:25.632Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "openai-rag",
            "content_html": "OpenAI's data integrations in Assistant and GPTs is causing ripples in the AI world. Beyond the excitement, let's look at OpenAI's strategy critically, the tension in its ecosystem, questionable comparisons to Apple, and the impending threat of commodification that OpenAI itself may face.",
            "url": "https://omarkamali.com/blog/openai-rag",
            "title": "OpenAI RAG: A Step Forward or a Step Too Far?",
            "summary": "OpenAI's data integrations in Assistant and GPTs is causing ripples in the AI world. Beyond the excitement, let's look at OpenAI's strategy critically, the tension in its ecosystem, questionable comparisons to Apple, and the impending threat of commodification that OpenAI itself may face.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/openai-rag-b58ad7ff.png",
            "date_modified": "2023-11-10T23:31:16.821Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "web-scraping-data-for-everyone",
            "content_html": "We know what Data is, but where does it come from? With Web Scraping you can collect data from any website. This article will get you started in this world.",
            "url": "https://omarkamali.com/blog/web-scraping-data-for-everyone",
            "title": "Web Scraping: Data For Everyone",
            "summary": "We know what Data is, but where does it come from? With Web Scraping you can collect data from any website. This article will get you started in this world.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/web-scraping-data-for-everyone-b9af80c0.png",
            "date_modified": "2023-10-30T00:56:54.621Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "what-is-data-a-brief-guide",
            "content_html": "An introduction to ease you into the world of data, what it is, what is it useful for and privacy concerns, as a preamble to the Data Series.",
            "url": "https://omarkamali.com/blog/what-is-data-a-brief-guide",
            "title": "What is Data: A Brief Guide",
            "summary": "An introduction to ease you into the world of data, what it is, what is it useful for and privacy concerns, as a preamble to the Data Series.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/what-is-data-675c5320.png",
            "date_modified": "2023-10-25T15:29:24.938Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "shaping-the-future-with-sawalni-dawn-of-moroccan-ai",
            "content_html": "I've been asked multiple times, \"Why are you creating a Moroccan AI?\" Today I want to share the story behind Sawalni, the first AI in history to speak our beautiful Moroccan Darija, with all of you.",
            "url": "https://omarkamali.com/blog/shaping-the-future-with-sawalni-dawn-of-moroccan-ai",
            "title": "Shaping the Future with Sawalni: The Dawn of Moroccan AI",
            "summary": "I've been asked multiple times, \"Why are you creating a Moroccan AI?\" Today I want to share the story behind Sawalni, the first AI in history to speak our beautiful Moroccan Darija, with all of you.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/shaping-the-future-with-sawalni-dawn-of-moroccan-ai-06c45bc6.png",
            "date_modified": "2023-10-17T18:48:24.897Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "tokens-the-secret-language-of-ai",
            "content_html": "Have you ever wondered how AI systems make sense of the vast amount of information they encounter? Let's look at AI tokens and why you should care.",
            "url": "https://omarkamali.com/blog/tokens-the-secret-language-of-ai",
            "title": "Tokens, the secret language of AI",
            "summary": "Have you ever wondered how AI systems make sense of the vast amount of information they encounter? Let's look at AI tokens and why you should care.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/tokens-the-secret-language-of-ai-c9edb1b0.jpg",
            "date_modified": "2023-06-26T02:06:39.137Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "dns-for-non-techies-domains-website-performance-and-content-restrictions",
            "content_html": "New websites are launched every day and others stop existing overnight. Servers and IP addresses get repurposed. How does DNS keep it all together?",
            "url": "https://omarkamali.com/blog/dns-for-non-techies-domains-website-performance-and-content-restrictions",
            "title": "DNS for non-Techies: Domains, Website Performance, and Content restrictions",
            "summary": "New websites are launched every day and others stop existing overnight. Servers and IP addresses get repurposed. How does DNS keep it all together?",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/dns-for-non-techies-domains-website-performance-and-content-restrictions-4a31e005.jpg",
            "date_modified": "2023-06-14T20:25:11.480Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "a-newsletter-using-gmail-google-sheets",
            "content_html": "I've recently wanted to add a newsletter to my blog. I decided to build it instead of using a marketing tool.",
            "url": "https://omarkamali.com/blog/a-newsletter-using-gmail-google-sheets",
            "title": "How I built my newsletter using Gmail and Google Sheets",
            "summary": "I've recently wanted to add a newsletter to my blog. I decided to build it instead of using a marketing tool.",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/a-newsletter-using-gmail-google-sheets-aebb7fd4.jpg",
            "date_modified": "2023-03-12T00:06:08.035Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "it-starts-with-a-message-internet-dns-networks-for-non-techies-part-1",
            "content_html": "There are over 22 billion machines connected to the internet. How do these machines communicate with each other to fulfil their purpose?",
            "url": "https://omarkamali.com/blog/it-starts-with-a-message-internet-dns-networks-for-non-techies-part-1",
            "title": "It starts with a message: Internet, DNS & Networks for non-Techies",
            "summary": "There are over 22 billion machines connected to the internet. How do these machines communicate with each other to fulfil their purpose?",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/dns-internet-routing-a-postal-system-for-machines-a43d1b97.avif",
            "date_modified": "2023-03-10T23:38:50.936Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "what-happens-when-you-open-a-tiktok",
            "content_html": "Do you ever stop and think about the countless invisible components working behind the scenes to bring an app like TikTok to life on your phone?",
            "url": "https://omarkamali.com/blog/what-happens-when-you-open-a-tiktok",
            "title": "What happens when you open a TikTok?",
            "summary": "Do you ever stop and think about the countless invisible components working behind the scenes to bring an app like TikTok to life on your phone?",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/what-happens-when-you-open-a-tiktok-7ef4df13.png",
            "date_modified": "2023-03-09T20:40:37.000Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "essential-guide-to-digital-infrastructure-how-the-internet-works",
            "content_html": "",
            "url": "https://omarkamali.com/blog/essential-guide-to-digital-infrastructure-how-the-internet-works",
            "title": "How the Internet Works: The Essential Guide to Digital Infrastructure",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/essential-guide-to-digital-infrastructure-how-the-internet-works-caa00886.png",
            "date_modified": "2023-03-06T20:29:59.000Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "the-market-decides-your-project-scope",
            "content_html": "",
            "url": "https://omarkamali.com/blog/the-market-decides-your-project-scope",
            "title": "The Market Decides Your Project Scope",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/the-market-decides-your-project-scope-18856089.png",
            "date_modified": "2019-02-09T01:00:19.000Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "should-you-automate-this-task",
            "content_html": "",
            "url": "https://omarkamali.com/blog/should-you-automate-this-task",
            "title": "Should you automate this task?",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/should-you-automate-this-task-444fc0ee.png",
            "date_modified": "2019-02-05T01:27:44.000Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        },
        {
            "id": "first-post",
            "content_html": "",
            "url": "https://omarkamali.com/blog/first-post",
            "title": "First Post",
            "image": "https://iwwnvqzchtveeuzoiigl.supabase.co/storage/v1/object/public/posts/first-post-ea146daf.png",
            "date_modified": "2019-02-03T01:06:07.000Z",
            "author": {
                "name": "Omar Kamali",
                "url": "https://omarkamali.com/"
            }
        }
    ]
}