Apple
Apr 2025 — Present- Building World-Class Question Answering for Siri.
- Training and optimizing GenAI models using advanced post-training methods (DPO, Online RL).
- Designing verifiable reward pipelines with rubrics and improving LLM alignment using better Reward Models.
- Developing scalable evaluation and training frameworks for high-quality generative AI experiences.

