cv
Table of contents
Basics
| Name | Benjamin V. Rozonoyer |
| Label | PhD Candidate in Machine Learning and NLP |
| brozonoyer@umass.edu | |
| Url | https://brozonoyer.github.io |
| Summary | PhD candidate at UMass Amherst (IESL) working on discrete diffusion models for text, with broader interests in representation learning, efficient LLM inference, and generative information retrieval. |
Work
- 2025.06 - 2025.09
- 2025.02 - 2025.06
- 2024.05 - 2024.08
- 2023.06 - 2023.08
Scientist Intern
Charles River Analytics Research internship in Cambridge, MA.
- Transformer-based contrastive learning for authorship embeddings (IARPA HIATUS) — advanced CRA from last to 2nd place in evaluation
- Probabilistic programming in Pyro for generation/inference of military activity patterns
- 2022.09 - Present
PhD Student
University of Massachusetts, Amherst Doctoral research in machine learning and NLP, advised by Andrew McCallum.
- Discrete diffusion models for text — non-autoregressive, flexible, and controllable language generation (thesis focus)
- Generative ranking for information retrieval
- Efficient inference with mixture-of-expert LLMs
- Geometric representation learning for large directed graphs
- 2020.06 - 2022.08
Engineer II
Raytheon BBN Technologies Software engineer and researcher in information extraction and NLP.
- Multilingual information extraction (DARPA AIDA)
- Full NLP stack for COVID-19 machine reading system (IARPA COVID Seedling)
- Maintained internal NLP and deep learning libraries; integrated open-source and research models
- 2019.06 - 2019.08
Research Intern (TechStudent)
Raytheon BBN Technologies Multilingual neural relation extraction via machine translation and MGIZA++ token-level annotation projection to train mBERT.
- 2017.08 - 2019.12
Teaching Assistant
Brandeis University TA for five undergraduate and graduate courses.
- Statistical Approaches to NLP (Fall 2019)
- Phonological Theory (Spring 2019)
- Data Structures and Algorithms (Fall 2018)
- Theory of Computation (Spring 2018)
- Discrete Structures (Fall 2017)
- 2017.01 - 2021.09
Research Assistant
Brandeis University — BiRCh Project Morphological analysis, parsing, segmentation, tokenization, transcription, and adjudication for the Bilingual Russian Children corpus.
Education
-
2022 - Present PhD
University of Massachusetts, Amherst Computer Science
- Machine Learning
- Distributed and Operating Systems
- Advanced Algorithms
- Probabilistic Graphical Models
- Reinforcement Learning
-
2020 - 2021 -
2016 - 2020
Awards
- 2021
Aravind K. Joshi Award for Outstanding Achievement in Computational Linguistics
Brandeis University
- 2018
Giumette Academic Achievement Award
Brandeis University
- 2016
Humanities Fellowship
Brandeis University
Certificates
| Accelerating CUDA C++ Applications with Concurrent Streams | ||
| NVIDIA | 2024-10 |
| Getting Started with Accelerated Computing in CUDA C/C++ | ||
| NVIDIA | 2024-09 |
| Fundamentals of Accelerated Computing with CUDA Python | ||
| NVIDIA | 2024-08 |
Skills
| Machine Learning | |
| PyTorch | |
| JAX | |
| Transformers | |
| Diffusion Models | |
| Representation Learning | |
| LLMs | |
| CUDA |
| Natural Language Processing | |
| Information Extraction | |
| Information Retrieval | |
| Computational Linguistics | |
| Multilingual NLP |
| Programming | |
| Python | |
| Julia | |
| Java | |
| SPARQL | |
| CUDA C++ |
Languages
| English | |
| Native |
| Russian | |
| Native |
Oracle Labs
Google Research