Himanshu Gupta

I am an Applied Scientist at Amazon (Stores Foundation AI team), where I work on mid-training, post-training optimization, synthetic data creation and benchmarking of large foundation models. I have hands on experience with Megatron-LM, verl (SFT, DPO, GRPO), vLLM , SGLang and TRL . I also had the opportunity to develop multi-lingual a foundation model from scratch at Krutrim which focused on 10 Indic languages (Media Coverage , Technical Report). My thesis was on Sample efficiency of Instruction Tuned Models under the guidance of Dr. Swaroop Mishra and Prof. Chitta Baral.

Email / Google Scholar / X / Linkedin

Research

My main research contributions are showcasing various aspects of Instruction Tuning ( Sample Efficiency, Aspect Based Sentiment Analysis, Long Sequence Medical Tasks, Financial NER, Variable Name Recovery with Decompiled Output, Event Detection ), Efficient Pretraining, Efficient LLMs-as-a-Judge, Mathematical Benchmarking ( Numerical Feasibility, Adversarial Math Word Problems, Multimodal Mathematical Benchmarking, Humanity's Last Exam and ), Synthetic Data Generation, My recent research focuses on building large foundation models that excel at instruction following, adherence to detailed guidelines, and high-precision reasoning, with particular emphasis on enhancing mid-training corpora and RL-based post-training alignment.

Selected Papers
Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts Chaitanya Dwivedi, Bo Huang, Himanshu Gupta, Pratik Jayarao, Neeraj Varshney, Bing Yin Preprint

Code Mixologist: A Practitioner's Guide to Building Code-Mixed LLMs Himanshu Gupta, Pratik Jayarao, Chaitanya Dwivedi, Neeraj Varshney Preprint

Explicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and Robustness Pratik Jayarao, Himanshu Gupta, Neeraj Varshney, Chaitanya Dwivedi NeurIPS 2025 Workshop on Efficient Reasoning

PolyMATH: A Challenging Multi-modal Mathematical Reasoning Benchmark Himanshu Gupta, Shreyas Verma, Ujjwala Anantheswaran, Kevin Scaria, Mihir Parmar, Swaroop Mishra, Chitta Baral NeurIPS Foundations of Reasoning in Language Models Workshop 2025

Krutrim LLM: Multilingual Foundational Model for over a Billion People Aditya Kallappa....Arveti Manjunath, Himanshu Gupta... Chandra Khatri Tech Report

Humanity's Last Exam Scale AI team, ....Chris Harjadi, Himanshu Gupta, Stephen Malina.... Nature 2025

Cutting Through the Noise: Boosting LLM Performance on Math Word Problems Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra Reasoning and Planning for LLMs @ ICLR 2025

TarGEN: Targeted Data Generation with Large Language Models Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra COLM 2024

EDM3: Event Detection as Multi-task Text Generation Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral *SEM NAACL 2024

InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral NAACL 2024

“Len or index or count, anything but v1”: Predicting Variable Names in Decompilation Output with Transfer Learning Kuntal Kumar Pal, Ati Priya Bajaj, Pratyay Banerjee, Audrey Dutcher, Mutsumi Nakamura, Zion Leonahenahe Basque, Himanshu Gupta, Saurabh Arjun Sawant, Ujjwala Anantheswaran, Yan Shoshitaishvili, Adam Doupé, Chitta Baral, Ruoyu Wang IEEE S&P 2023

A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution Neeraj Varshney, Himanshu Gupta, Eric Robertson, Bing Liu, Chitta Baral ACL 2023

"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral EACL 2023

Context-NER : Contextual Phrase Generation at Scale Himanshu Gupta, Shreyas Verma, Santosh Mashetty, Swaroop Mishra NeurIPS ENLSP Workshop 2022

Please check my google scholar page for all the papers.

Patents
Automated question-answer generation system for documents Himanshu Gupta, Raaed Ahmed Syed, Tarun Kumar, Tamanna Agrawal, Himanshu Sharad Bhatt US Patent 12,333,246 B1 (2025)

System and method for performing product analytics for machine learning platforms Himanshu Gupta, Gourav Kumar Sharma, Krishnaprasad Narayanan, Rahul Ghosh US Patent 12,229,788 B1 (2025)

Transaction and ownership information document extraction Tarun Kumar, Himanshu Gupta, Himanshu Sharad Bhatt, Rahul Ghosh, Nikhil K. Jain, Vinodh Kumar Rajagopalan Velayudham US Patent Application 2023/0113578 A1

Education

01.2022 - 12.2023 Masters (with Thesis) in Computer Science from Arizona State University
08.2015 - 07.2019 B.E. with Hons. in Electrical and Electronics Engineering, BITS Pilani,India

Experience

12.2023 - present Applied Scientist at Amazon (Stores Foundation AI Team)
08.2023 - 11.2023 Founding Scientist Scientist at Krutrim
05.2023 - 07.2023 Internship at Amazon Alexa
01.2022 - 05.2023 Graduate Research Assistant at CogInt Labs, ASU with Dr. Chitta Baral
07.2019 - 12.2021 AI Researcher at American Express AI Labs. Supervised by Dr. Himanshu Shrad Bhatt
01.2019 - 06.2019 Internship at American Express
01.2018 - 12.2018 Research Intern at Covenant University with Dr. Sanjay Misra
01.2018 - 12.2018 Undergraduate Research Assistant at BITS Pilani with Dr. NL Bhanu Murthy

Invited Talks and Panels

2025 Panelist, Experts in the Loop — panel on building safe, reliable, and multilingual AI, hosted by AI Circle with panelists from Amazon, Meta, Google, NVIDIA, Microsoft, and Appen (Appen recap, host recap).
2025 Invited attendee, HumanSignal × Unusual Ventures dinner — discussion on agentic systems, rubric design, and curated data for AI (recap).

Honors, Awards and Volunteer Opportunities

2026 Served as a Reviewer for 2026 ICLR, NeurIPS.
2025 Served as a Reviewer for 2025 ICLR, COLM.
2024 Served as a Reviewer for 2024 NAACL, ACL, ACL ARR (April, June, August, October, December) and SDU@AAAI.
2022 - 2023 Received Masters Graduate fellowship for Spring, Summer and Fall 2022 at Arizona State University. Also received Engineering Graduate Fellowship award for academic performance in Masters Study. Project mentor and supervisor for 16 Students for CSE 576: Advanced topics in NLP. Responsible for Problem statement delivery, setting up research goals, clearing coding doubts for the project of the students. The Project was 50% of the entire coursework. Involved in writing $6 Million grant to IARPA for Authorship Privacy Research for CogInt Labs.
2019 Secured World Rank 2 among 6000+ teams in HackHarvard Global 2019 Hackathon on the industry based education track. Was invited to Harvard University to present the project.
2014 Secured a rank of 901 among 1.4 million students PAN India to receive KVPY fellowship.