Nice to meet you! I'm am an undergraduate senior, researcher assistant, and engineer that goes to Rice University!
This is my third year researching under Dr. Anshumali Shrivastava, and so far I have researched topics including Approximate Nearest Neighbor Search, Large Scale Machine Learning, and Quantization.
I am also a co-founder and AI engineer of xMAD.ai. We aim to break the neural scaling curve tradeoff between compute and accuracy and create high performance models at a fraction of the compute. Find us through our website and HuggingFace .
Here is a PDF of my resume.
Below are some highlights of what I do.
NeurIPS 2024
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention
Tianyi Zhang · Jonah Yi · Bowen Yao · Zhaozhuo Xu · Anshumali Shrivastava
NeurIPS 2024
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization
Tianyi Zhang · Jonah Yi · Zhaozhuo Xu · Anshumali Shrivastava
CAPS: A Practical Partition Index for Filtered Similarity Search
Gaurav Gupta · Jonah Yi · Benjamin Coleman · Chen Luo · Vihan Lakshman · Anshumali Shrivastava
Class 12 of OwlSpark
As part of xMAD.ai, I took part in OwlSpark Startup and Small Business Accelerator this summer! I connected with industry leaders, investors and mentors, and did a final presentation at the Eleventh Annual Bayou Startup Showcase!
Here is xMAD.ai's introductory video!