Building the Next Generation of Agentic AI

Welcome! I am Udit Jain, an Agentic AI Engineer at the AI Core Team, Samsung Research, focused on the frontier of large language model post-training and autonomous agent systems.

What I Work On

My current work lives at the intersection of post-training research and production-scale AI:

LLM Post-Training: RLHF, DPO, SFT pipelines for instruction-following and alignment
Reinforcement Learning for LLMs: Reward modeling, policy optimization, and RL-based fine-tuning
Knowledge Distillation: Transferring capabilities from frontier models to efficient, deployable targets
Agentic Evaluation: Building multi-turn agent benchmarks and evaluation pipelines for complex reasoning tasks
Training Infrastructure: Distributed training with NeMo, large-scale inference with vLLM

Previously, I led computer vision R&D for Samsung’s next-generation smart appliances, shipped on-device AI models to millions of users, and built health data pipelines over Samsung Health’s 200M+ user dataset.

Please explore my Projects, read about my Professional Journey, or view my Resume & Skills.

Building the Next Generation of Agentic AI

What I Work On

New Articles