Research Interests
Awards & Honors
Group
Service
Google Scholar
lastname@cmu.edu
Gates & Hillman Centers, GHC8133
I am an Assistant Professor at Carnegie Mellon University (CMU) and a research scientist at the Allen Institute for Artificial Intelligence (Ai2). I am the creator and maintainer of bitsandbytes.
I earn my PhD from University of Washington advised by Luke Zettlemoyer. My main research goal is to make AI accessible so that everything can tinker with AI, learn from it, and integrate it and use it in their own work. I do this in two ways: (1) develop the next generation of AI methods and models, in particular agents; (2) by making these AI methods and models accessible through my research (QLoRA, LLM.int8(), k-bit inference scaling laws, Petals, SWARM) and by developing software that makes it easy to use these research innovations (bitsandbytes).
Research Interests
My main research thesis is that computational efficient methods will accelerate and enable progress in and understanding of deep learning. In particular, I am interested in:
- Open-source agents
- On-device mixture of experts
- Hierarchical LLM architectures and deployments
- Scientific automation (including healthcare)
- Automation with AI
Publications
For my full list of publications, please see my Google Scholar page.
Awards & Honors
2025 Google ML and Systems Junior Faculty Award
2024 AI2050 Early Career Fellow
2023 Madrona Prize
2023 Google Open Source Award
2023 PyTorch Foundation Award
2023 Martin & Beate Block Award
2021 NeurIPS 2021 Best Reviewer Award
2018/2019 Jeff Dean – Heidi Hopper Endowed Regental Fellowship
2016/2017 Google Scholarship
Group
Eulrang Cho
Trang Nguyen
Vladimir Malinovskii
Christine Park
Service
Reviewing:
- ICLR: 2018-2023
- NeurIPS 2019-2023
- ICML 2021-2023
- ARR 2022-2023
- JMLR 2020-2021
- IEEE Computational Intelligence Magazine (CIM) 2020
- Knowledge and Information Systems (KAIS) 2018