Manisha Mukherjee

Manisha Mukherjee

PhD Candidate · School of Computer Science
Carnegie Mellon University
Advisor: Vincent Hellendoorn

I work on AI for software engineering, with a focus on making language models more reliable and useful for code-related tasks.

My research investigates how domain knowledge can be integrated into language models through complementary approaches such as training specialized models on domain-specific data, augmenting generation with retrieval from community knowledge bases, and leveraging execution feedback to improve code quality.

Research

Domain-Specialized Models

Investigating whether smaller models trained extensively on domain-specific data can outperform much larger general-purpose models for certain tasks. SOBert models (125M-762M parameters, trained for under $2,000) outperform much larger general-purpose models on StackOverflow code labeling tasks.

Secure Code Generation

Developing retrieval-augmented generation systems that leverage community security discussions from StackOverflow to identify and fix vulnerabilities in LLM-generated code. SOSecure achieves 72-97% fix rates compared to 38-56% for GPT-4 alone, without introducing new vulnerabilities.

Reinforcement Learning from Execution Feedback

Applying reinforcement learning with compiler and execution feedback to improve code quality, with a focus on decompilation tasks where the goal is producing readable, semantically equivalent code from binary executables using compiler and test feedback as learning signals.

Released Models

Publicly released domain-specialized language models on Hugging Face:

SOBertLarge
762M parameter model trained on StackOverflow.
SOBertBase
125M parameter model trained on StackOverflow.

Selected Publications and Patents

M. Mukherjee and V. J. Hellendoorn
arXiv preprint, 2025
M. Mukherjee and V. J. Hellendoorn
IEEE/ACM International Conference on AI Foundation Models and Software Engineering (FORGE), 2025
M. Mukherjee, S. Kim, X. Chen, D. Luo, T. Yu, and T. Mai
arXiv preprint, 2025
V. J. Hellendoorn, J. Tsay, M. Mukherjee, and M. Hirzel
29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE), 2021
M. Mukherjee, M. Bahrami, and W. P. Chen
US Patent 11,651,014, 2023
M. Mukherjee, J. Edwards, H. Kwon, and T. F. La Porta
IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom), 2015

Experience