Himanshu Gupta

I am an Applied Scientist at Amazon where I work on development of foundation models for Amazon Business. My main research contributions are showcasing sample efficiency of instruction tuned language models, mathematical robustness, synthetic data generation and aspect based sentiment analysis. I completed my Bachelors from BITS Pilani and masters majoring in Computer Science (Thesis Track) from Arizona State University. My thesis was on "Sample efficiency of Instruction Tuned Models" supervised by Prof. Chitta Baral.

I recently interned at Krutrim where I created a multi-lingual 7B foundation model focused on Indic languages and developed a new tokenizer for the same! It keeps the vocabulary concise at around 100,000 words while still understanding most words. We also built a multilingual large language model (LLM) that understands 10 Indic languages (Media Coverage). This involved gathering pretraining corpora of nearly 2 trillion tokens and training the LLM. To further improve its performance, I worked on generating supervised finetuning data. Finally, with the help of vLLM and TensorRT, we deployed the LLM for efficient use with high throughput and low latency.

Email  /  CV  /  Google Scholar  /  X  /  Linkedin

profile photo
Research

My research interests include Large Language Models, Enhancing Pretraining Corpora, Instruction Tuning, and Direct Preference Optimization. I collaborate with Dr. Swaroop Mishra on topics such as Instruction tuning, Synthetic dataset creation and mathematical reasoning. My strength lies in generating new ideas and I am fortunate to collaborate with a diverse set of awesome researchers.

News

  • 02.2025 Contributed to HLE which is available now!
  • 02.2025 Serving as Reviewer for ICLR 2025
  • 12.2024 Served as Reviewer for ACL ARR (April, June, August, October, December) 2024
  • 11.2024 Reached 200 citations
  • 07.2024 TarGEN accepted at COLM2024
  • 05.2024 EDM 3 accepted at SEM NAACL 2024
  • 03.2024 InstructABSA accepted at NAACL 2024
  • 01.2024 Reached 100 citations
  • 12.2023 Joined Amazon as an Applied Scientist
  • 12.2023 Graduated from Arizona State University with Distinction
  • 08.2023 Received 1500$ merit scholarship from School of Computing and AI at ASU
  • 07.2023 Started 40hr co-op as Applied Scientist at Krutrim
  • 06.2023 Paper accepted at ACL 2023
  • 05.2023 Started internship at Amazon Alexa
  • 01.2022 Started Masters in Computer Science at Arizona State University
  • 07.2019 Joined American Express AI Labs as a Research Engineer
  • 06.2019 Graduated from BITS Pilani

Selected Papers
Krutrim LLM: Multilingual Foundational Model for over a Billion People
Aditya Kallappa....Arveti Manjunath, Himanshu Gupta... Chandra Khatri
preprint
 
Humanity's Last Exam
Scale AI team, ....Chris Harjadi, Himanshu Gupta, Stephen Malina....
preprint
 
PolyMATH: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta, Shreyas Verma, Ujjwala Anantheswaran, Kevin Scaria, Mihir Parmar, Swaroop Mishra, Chitta Baral
preprint
 
Investigating the Robustness of LLMs on Math Word Problems
Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra
preprint
 
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra
COLM 2024
 
EDM3: Event Detection as Multi-task Text Generation
Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral
SEM NAACL 2024
 
InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis
Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral
NAACL 2024
 
“Len or index or count, anything but v1”: Predicting Variable Names in Decompilation Output with Transfer Learning
Kuntal Kumar Pal, Ati Priya Bajaj, Pratyay Banerjee, Audrey Dutcher, Mutsumi Nakamura, Zion Leonahenahe Basque, Himanshu Gupta, Saurabh Arjun Sawant, Ujjwala Anantheswaran, Yan Shoshitaishvili, Adam Doupé, Chitta Baral, Ruoyu Wang
IEEE S&P 2023
 
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution
Neeraj Varshney, Himanshu Gupta, Eric Robertson, Bing Liu, Chitta Baral
ACL 2023
 
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility
Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral
EACL 2023
 
Context-NER : Contextual Phrase Generation at Scale
Himanshu Gupta, Shreyas Verma, Santosh Mashetty, Swaroop Mishra
NeurIPS ENLSP Workshop 2022
 
LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks
Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral
preprint
 
Instruction Tuned Models are Quick Learners
Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra, Santosh Mashetty, Chitta Baral
preprint
 
Please check my google scholar page for all the papers.

Education

Experience

  • 12.2023 - present Applied Scientist at Amazon.
  • 08.2023 - 11.2023 40hr co-op as Applied Scientist LLM at Krutrim.
  • 05.2023 - 07.2023 Internship at Amazon Alexa.
  • 01.2022 - 05.2023 Graduate Research Assistant at CogInt Labs, ASU under Dr. Chitta Baral.
  • 01.2019 - 12.2021 AI Researcher at American Express AI Labs. Supervised by Dr. Himanshu Shrad Bhatt
  • 01.2018 - 12.2018 Research Intern at Covenant University under Dr. Sanjay Misra.
  • 01.2018 - 12.2018 Undergraduate Research Assistant at BITS Pilani under Dr. NL Bhanu Murthy.

Honors and Awards

  • 12.2024 Served as a Reviewer for NAACL 2024, ACL 2024, SDU@AAAI 2024, ICLR 2024, ACL ARR (April, June, August, October, December) 2024
  • 08.2024 Received Engineering Graduate Fellowship award, for academic performance in Masters Study at ASU
  • 12.2022 Received Masters Graduate fellowship for Spring and Fall 2022 at Arizona State University. The fellowship awards 2/3 fee waiver and a monthly stipend for the semester
  • 07.2023 Project mentor and supervisor for 16 Students for CSE 576: Advanced topics in NLP. Responsible for Problem statement delivery, setting up research goals, clearing coding doubts for the project of the students. The Project was 50% of the entire coursework.
  • 02.2023 Involved in writing $6 Million grant to IARPA for Authorship Privacy Research for CogInt Labs.
  • 12.2021 Filed 3 patents in American Express
  • 03.2019 Secured World Rank 2 among 6000+ teams in HackHarvard Global 2019 Hackathon on the industry based education track. Was invited to Harvard University to present the project.
  • 04.2014 Secured a rank of 901 among 1.4 million students PAN India to receive KVPY fellowship.