REAL-WORLD IMPACT ✦SOFTWARE ENGINEERING ✦COMPUTATIONAL LINGUISTICS ✦AI/ML RESEARCH ✦FULL-STACK DEVELOPMENT ✦LANGUAGE & TECHNOLOGY ✦REAL-WORLD IMPACT ✦SOFTWARE ENGINEERING ✦COMPUTATIONAL LINGUISTICS ✦AI/ML RESEARCH ✦FULL-STACK DEVELOPMENT ✦LANGUAGE & TECHNOLOGY ✦
Next Up
AI Product Builder Intern
Summer 2026
Experience
EDU Africa
Cape Town, South Africa
Software Engineering Intern
May – Jun 2025
Working with a non-profit in Cape Town, I built an interactive mapping application giving residents of underprivileged neighborhoods direct access to local government contacts and a channel to report critical housing issues — from evictions to power outages and water shortages. Every pin on that map represents a family that now has a voice. Built on a MERN stack with a custom REST API, the platform serves 100+ monthly active users across 30+ informal settlements in the Western Cape.
MERN StackReactLeafletREST APINode.js
UF GatorAI Club
Gainesville, FL
Machine Learning Engineer
Sep 2025 – Present
Built an AI Teaching Assistant that deploys course-specific chatbot instances for 50+ students, with Gemini 2.0 orchestration and custom guardrails that cut hallucination rates by 30% while maintaining academic integrity. Engineered a RAG pipeline with ChromaDB achieving sub-500ms semantic retrieval across 1,000+ academic documents.
Next.jsFastAPIGemini 2.0RAGChromaDB
Research
Aug 2025 – Present · University of Florida · with Dr. Zoey Liu
Computational Linguistics Lab
Language shapes how we see the world — yet most of today’s AI speaks only a handful of them fluently. Working with Dr. Zoey Liu, I investigate how data partitioning strategies on LLM training data impact model generalization across the world’s linguistic diversity, with a particular focus on low-resource languages that are systematically underrepresented in modern AI.
My current work quantifies a fundamental trade-off: how much does annotation quality matter when data is scarce? By systematically injecting controlled annotation noise into training sets and benchmarking OLMo-2 across 2,000 languages on UF’s HiPerGator supercomputer, I’m building an empirical map of where AI breaks down — and how to fix it.
✦ ✦ ✦
Quality vs. Quantity
Modeling the trade-off between dataset scale and annotation fidelity — a question with outsized implications for languages where data is precious.
2,000 Languages
Benchmarking across a typologically diverse language set to understand how multilingual scale affects cross-linguistic transfer beyond high-resource clusters.
Morphological Segmentation
Investigating cross-lingual partitioning of morphologically segmented data across language families to improve zero-shot performance for understudied tongues.
Lingua is the most powerful LMS ever created for foreign language education. Teachers spend hours upon hours creating and grading assignments - a burdensome task when you have 100+ students. Lingua reduces this to just seconds, allowing teachers to create high-quality reading, writing, listening, and speaking assignments tailored to their curriculum with just a few clicks, auto-graded by AI against their own rubrics and instructions, with personalized feedback for each student.
ReactNext.jsSupabaseGemini Live & TTSTailwindPostgreSQLRBACREST API
Students converse face-to-face with distinct AI characters based on in-class topics, enabling contextually rich, authentic language practice.
Reading & Listening Assignments
Teachers create authentic passages, audios, and comprehension questions tailored to their curriculum in just seconds. Students complete them in real-time with instant scoring feedback.
Writing Assignments
Prompt-based writing tasks, submitted through the platform text editor or handwritten images, are auto-graded by AI against teacher-defined rubrics, and marked up with detailed personalized feedback.
Oral Examinations
Students participate in presentations, conversations and discussions with the AI that are auto-graded with comprehensive feedback.
Nov – Dec 2025
AI Career Coach
Resume Optimizer & Interview Simulator
End-to-end AI career prep tool: Gemini-powered ATS-compliant resume generation, mock interview engine with performance persistence, and automated weekly industry skill & salary trend updates via Inngest workflows.
Next.jsNeonDBPrismaInngestGemini Flash
Oct – Nov 2025
Home Price Estimator
Data Structures · Full-Stack
Full-stack web app delivering neighborhood housing price estimates at 98% accuracy. Implements Red-Black Tree and B-Tree structures to query 100,000+ records in O(log n) time — a C++ backend connected to a React frontend via Next.js.
ReactNext.jsC++httplibRed-Black Tree
Technical Arsenal
Languages
Python
C/C++
JavaScript
HTML/CSS
MATLAB
Frameworks
React
Next.js
Node.js
FastAPI
Tailwind CSS
AI/ML
Gemini Flash, Live, TTS
RAG / ChromaDB
OLMo-2
Hugging Face
Databases
PostgreSQL
MongoDB
Supabase
NeonDB
Tools
Docker
Git
Linux
Prisma
Inngest
HiPerGator
Let's Build Something
Real problems. Real solutions.
I'm looking for opportunities where I can keep doing what I love — building technology that has a genuine impact on real people's lives.