Omar Kamali
Multilingual AI & Custom LLMs
I help teams build AI for languages and markets where off-the-shelf models fail. By day, I lead GenAI engineering & LLM training at Blue Yonder. On my own time, I run Omneity Labs, researching multilingual NLP and low-resource languages.
I built the first LLM for Moroccan Darija. I've published peer-reviewed research on multilingual phonetics. I maintain OSS tools, datasets and models for 340+ languages.
Recent writing
Picomon 0.2.0: From AMD Crash Fix to GPU Monitoring That Doesnโt Suck
Earlier this month, I whipped up a Python script with an LLM that parsed amd-smi output. It was ugly. It worked. I called it picomon.
Introducing Wikipedia Monthly: Fresh, Clean Wikipedia Dumps for NLP & AI Research
Announcing Wikipedia Monthly, an always fresh dataset to support research for low-resource languages
Getting Perfectly Structured Data from LLMs
If you've ever struggled to get consistent JSON output from large language models, I have a simple and clever solution for you.
2024: A Year of Growth, Innovation, and Community
As we leave 2024 behind, I found myself reflecting over the holidays on a transformative year that reshaped my grasp of technology's role in human connection.
Datapluck: Portability Tool for Huggingface Datasets
Exporting & importing Hugging Face datasets to spreadsheets and various file formats.
Get in touch
Interested in working together or just want to say hello? Email me at [email protected] or find me on Twitter and LinkedIn.