Open to Opportunities · Summer 2026

Kailash Shankar

BUILDING
TECHNOLOGY
THAT MATTERS


University of Florida · CS + Linguistics · GPA 4.0

30+
Settlements Mapped
2,000
Languages Benchmarked
50+
Students Served
4.00
GPA
scroll to explore
REAL-WORLD IMPACTSOFTWARE ENGINEERINGCOMPUTATIONAL LINGUISTICSAI/ML RESEARCHFULL-STACK DEVELOPMENTLANGUAGE & TECHNOLOGYREAL-WORLD IMPACTSOFTWARE ENGINEERINGCOMPUTATIONAL LINGUISTICSAI/ML RESEARCHFULL-STACK DEVELOPMENTLANGUAGE & TECHNOLOGY
🌍

EDU Africa

Cape Town, South Africa
Software Engineering Intern
May – Jun 2025
30+ settlements connected · 100+ MAU
▼ read more
🤖

UF GatorAI Club

Gainesville, FL
Machine Learning Engineer
Sep 2025 – Present
95% accuracy · 30% ↓ hallucinations · <500ms retrieval
▼ read more
Aug 2025 – Present

Computational Linguistics Lab

University of Florida · Advisor: Dr. Zoey Liu

2,000
LANGUAGES BENCHMARKED

Language shapes how we see the world — yet most of today's AI speaks only a handful of them fluently. Working with Dr. Zoey Liu, I investigate how data partitioning strategies on LLM training data impact model generalization across the world's linguistic diversity, with a particular focus on low-resource languages that are systematically underrepresented in modern AI.

My current work quantifies a fundamental trade-off: how much does annotation quality matter when data is scarce? By systematically injecting controlled annotation noise into training sets and benchmarking OLMo-2 across 2,000 languages on UF's HiPerGator supercomputer, I'm building an empirical map of where AI breaks down — and how to fix it.

Zero-Shot TransferCross-Lingual NLPData PartitioningOLMo-2 (1B)HiPerGator HPCLow-Resource LanguagesAnnotation Noise
⚖️

Quality vs. Quantity

Systematically modeling the trade-off between dataset scale and annotation fidelity — a fundamental question with outsized implications for languages where data is precious.

🌐

2,000 Languages Tested

Benchmarking across a typologically diverse language set to understand how massive multilingual scale affects cross-linguistic transfer beyond high-resource language clusters.

🔬

Morphological Segmentation

Investigating cross-lingual partitioning of morphologically segmented data across language families to improve zero-shot performance for understudied tongues.

Jan 2025 – Present · Featured

LINGUA

AI Language Learning Platform

Students get just one hour of language class a day — nowhere near enough to build real-world fluency. Lingua changes that. An AI-powered platform where high schoolers practice authentic conversations with distinct AI characters, receive real-time feedback on grammar and vocabulary, and teachers get holistic insights into what their entire class is struggling with.

ReactNext.jsSupabaseGemini FlashTailwindPostgreSQLRBACREST API
View on GitHub →
🎭
AI Conversation Characters

Each character has a distinct personality and life story, enabling contextually rich, authentic language practice.

📋
Teacher Assignment Studio

Teachers define topic, theme, depth, key vocabulary, and grammar tenses — then let students take it from there.

📊
Real-Time Performance Insights

Multi-dimensional AI feedback after every conversation — what you did well, what to improve, tracked over time.

🔒
Secure Role-Based Access

Supabase Auth + Row Level Security cleanly separates student and teacher experiences with zero data bleed.

🚀Nov – Dec 2025

AI Career Coach

Resume Optimizer & Interview Simulator

End-to-end AI career prep tool: Gemini-powered ATS-compliant resume generation, mock interview engine with performance persistence, and automated weekly industry skill & salary trend updates via Inngest workflows.

Next.jsNeonDBPrismaInngestGemini Flash
🏠Oct – Nov 2025

Home Price Estimator

Data Structures · Full-Stack

Full-stack web app delivering neighborhood housing price estimates at 98% accuracy. Implements Red-Black Tree and B-Tree structures to query 100,000+ records in O(log n) time — a C++ backend connected to a React frontend via Next.js.

ReactNext.jsC++httplibRed-Black Tree
Languages
Python
C/C++
JavaScript
HTML/CSS
MATLAB
Frameworks
React
Next.js
Node.js
FastAPI
Tailwind CSS
AI/ML
Gemini 2.0
RAG / ChromaDB
OLMo-2
Hugging Face
Databases
PostgreSQL
MongoDB
Supabase
NeonDB
Tools
Docker
Git
Linux
Prisma
Inngest
HiPerGator

REAL PROBLEMS.
REAL SOLUTIONS.

I'm looking for opportunities where I can keep doing what I love — building technology that has a genuine impact on real people's lives. Whether that's a full-time role, a research collaboration, or an internship for Summer 2026, let's talk.

kailashshankar@ufl.eduLinkedIn ↗GitHub ↗

© 2026 KAILASH SHANKAR · GAINESVILLE, FL · UF CLASS OF 2027