Himanshu Gupta
I am an Applied Scientist at Amazon where I work on development of foundation models for Amazon
Business. My main research contributions are showcasing sample efficiency of instruction tuned language
models, mathematical robustness, synthetic data generation and aspect based sentiment analysis. I completed my
Bachelors from BITS Pilani and masters majoring in Computer Science (Thesis Track) from Arizona
State University. My thesis was on "Sample efficiency of Instruction Tuned Models" supervised by
Prof. Chitta Baral.
I recently interned at Krutrim where I created a multi-lingual 7B foundation model focused on Indic
languages and developed a new tokenizer for the same! It keeps the vocabulary concise at around
100,000 words while still understanding most words. We also built a multilingual large language
model (LLM) that understands 10 Indic languages (Media Coverage). This involved
gathering pretraining corpora of nearly 2 trillion tokens and training the LLM. To further improve
its performance, I worked on generating supervised finetuning data. Finally, with the help of vLLM
and TensorRT, we deployed the LLM for efficient use with high throughput and low latency.
Email /
CV /
Google Scholar /
X /
Linkedin
|
|
Research
My research interests include Large Language Models, Enhancing Pretraining Corpora, Instruction
Tuning, and Direct Preference Optimization.
I collaborate with Dr. Swaroop Mishra on topics such as
Instruction tuning, Synthetic dataset creation and mathematical reasoning.
My strength lies in generating new ideas and I am fortunate to collaborate with a diverse set of
awesome researchers.
|
News
- 02.2025 Contributed to HLE which is available now!
- 02.2025 Serving as Reviewer for ICLR 2025
- 12.2024 Served as Reviewer for ACL ARR (April, June, August, October, December) 2024
- 11.2024 Reached 200 citations
- 07.2024 TarGEN accepted at COLM2024
- 05.2024 EDM 3 accepted at SEM NAACL 2024
- 03.2024 InstructABSA accepted at NAACL 2024
- 01.2024 Reached 100 citations
- 12.2023 Joined Amazon as an Applied Scientist
- 12.2023 Graduated from Arizona State University with Distinction
- 08.2023 Received 1500$ merit scholarship from School of Computing and AI at ASU
- 07.2023 Started 40hr co-op as Applied Scientist at Krutrim
- 06.2023 Paper accepted at ACL 2023
- 05.2023 Started internship at Amazon Alexa
- 01.2022 Started Masters in Computer Science at Arizona State University
- 07.2019 Joined American Express AI Labs as a Research Engineer
- 06.2019 Graduated from BITS Pilani
|
Selected Papers
|
Krutrim LLM: Multilingual Foundational Model for over a Billion People
Aditya Kallappa....Arveti Manjunath, Himanshu Gupta... Chandra Khatri
preprint
|
|
Humanity's Last Exam
Scale AI team, ....Chris Harjadi, Himanshu Gupta, Stephen Malina....
preprint
|
|
PolyMATH: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta, Shreyas Verma, Ujjwala Anantheswaran, Kevin Scaria, Mihir Parmar,
Swaroop Mishra, Chitta Baral
preprint
|
|
Investigating the Robustness of LLMs on Math Word Problems
Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral,
Swaroop Mishra
preprint
|
|
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar,
Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra
COLM 2024
|
|
EDM3: Event Detection as Multi-task Text Generation
Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral
SEM NAACL 2024
|
|
InstructABSA: Instruction Learning for Aspect Based Sentiment
Analysis
Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra,
Chitta Baral
NAACL 2024
|
|
“Len or index or count, anything but v1”: Predicting Variable Names in Decompilation
Output with Transfer Learning
Kuntal Kumar Pal, Ati Priya Bajaj, Pratyay Banerjee, Audrey Dutcher, Mutsumi Nakamura, Zion
Leonahenahe Basque, Himanshu Gupta, Saurabh Arjun Sawant, Ujjwala Anantheswaran, Yan
Shoshitaishvili, Adam Doupé, Chitta Baral, Ruoyu Wang
IEEE S&P 2023
|
|
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an
Instantiation in Authorship Attribution
Neeraj Varshney, Himanshu Gupta, Eric Robertson, Bing Liu, Chitta Baral
ACL 2023
|
|
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of
Feasibility
Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun
Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral
EACL 2023
|
|
Context-NER : Contextual Phrase Generation at Scale
Himanshu Gupta, Shreyas Verma, Santosh Mashetty, Swaroop Mishra
NeurIPS ENLSP Workshop 2022
|
|
LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks
Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral
preprint
|
|
Instruction Tuned Models are Quick Learners
Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra,
Santosh Mashetty, Chitta Baral
preprint
|
|
Please check my
google scholar page for all the papers.
|
Experience
- 12.2023 - present Applied Scientist at Amazon.
- 08.2023 - 11.2023 40hr co-op as Applied Scientist LLM at Krutrim.
- 05.2023 - 07.2023 Internship at Amazon Alexa.
- 01.2022 - 05.2023 Graduate Research Assistant at CogInt Labs, ASU under Dr. Chitta Baral.
- 01.2019 - 12.2021 AI Researcher at American Express AI Labs. Supervised by Dr. Himanshu Shrad
Bhatt
- 01.2018 - 12.2018 Research Intern at Covenant University under Dr. Sanjay
Misra.
- 01.2018 - 12.2018 Undergraduate Research Assistant at BITS Pilani under Dr. NL Bhanu Murthy.
|
Honors and Awards
- 12.2024 Served as a Reviewer for NAACL 2024, ACL 2024, SDU@AAAI 2024, ICLR 2024, ACL ARR (April, June, August, October, December) 2024
- 08.2024 Received Engineering Graduate Fellowship award, for academic performance in Masters Study at ASU
- 12.2022 Received Masters Graduate fellowship for Spring and Fall 2022 at Arizona State University. The fellowship awards 2/3 fee waiver and a monthly stipend for the semester
- 07.2023 Project mentor and supervisor for 16 Students for CSE 576: Advanced topics in NLP. Responsible for Problem statement delivery, setting up research goals, clearing coding doubts for the project of the students. The Project was 50% of the entire coursework.
- 02.2023 Involved in writing $6 Million grant to IARPA for Authorship Privacy Research for CogInt Labs.
- 12.2021 Filed 3 patents in American Express
- 03.2019 Secured World Rank 2 among 6000+ teams in HackHarvard Global 2019 Hackathon on the industry based education track. Was invited to Harvard University to present the project.
- 04.2014 Secured a rank of 901 among 1.4 million students PAN India to receive KVPY fellowship.
|
|
|