Resume Resume
Basics
| Name | Yubo Li |
| yubol@andrew.cmu.edu |
Education
Work
-
2025.06 - 2025.08 Applied Scientist Intern
Amazon
Led end-to-end development of a generative recommendation system by pre-training a custom user encoder on 172k interaction sequences and progressively aligning it with LLMs, achieving strong gains over prompting baselines (Hit@3 60.1%, NDCG@3 67.5%) with 93.4% fewer tokens and 64.8% lower cost. Built a production-ready RAG/search pipeline with FAISS HNSW over 60k user embeddings and implemented distributed multi-GPU training with FSDP and BF16 mixed precision to scale training and inference efficiently.
-
2019.10 - 2020.03 Data Scientist
HireBeat
Leveraged machine learning and NLP techniques to develop a data analysis pipeline that optimized content recommendation algorithms, significantly improving personalization accuracy and user engagement.
-
2012.07 - 2014.05 Founder
R-Eat
Entrepreneurship: Founded and grew an online food delivery business by enhancing customer experience through UI optimization and personalized marketing, resulting in a 60% sales increase and a loyal customer base of over 2,000.
Awards
- 2025
TCS Presidential Fellowship
Carnegie Mellon University
- 2024
CMLH Fellowship in Generative AI in Healthcare
Center for Machine Learning and Health, Carnegie Mellon University
- 2023
CMU Heinz College Research Excellence Award
Center for Machine Learning and Health, Carnegie Mellon University
Skills
| Programming Languages | |
| Python | |
| Java | |
| SQL | |
| R |
| Technologies & Frameworks | |
| PyTorch | |
| TensorFlow | |
| scikit-learn | |
| Pandas | |
| NumPy | |
| AWS |
| Large Language Models | |
| Prompting Optimization | |
| RAG | |
| Fine-tuning | |
| LoRA | |
| RLHF | |
| Model Distillation | |
| Evaluation Metrics | |
| Parallel Training | |
| Agentic Frameworks |
| Machine Learning & Deep Learning | |
| Supervised Learning | |
| Unsupervised Learning | |
| Predictive Modeling | |
| Natural Language Processing | |
| Time Series Analysis | |
| Model Interpretability |
Languages
| Mandarin | |
| Native speaker |
| English | |
| Fluent |
Projects
- 2025.01 - Present
Agentic Q&A Systems for Trustworthy Organ Transplantation Guidance
- Q&A System: Developed Q&A systems for organ transplantation patient care by integrating retrieval-augmented generation (RAG) with adaptive frameworks that synthesize patient-specific medical history, clinical guidelines, and dynamic risk factors to deliver tailored guidance and support.
- RAG/Retriever: Engineered a hybrid retrieval pipeline combining both sparse and dense retrievers (DPR, ANCE) to accurately extract and verify medical content from over 100 transplant-specific handbooks.
- LLM Reasoning Framework: Developed an incentivized reasoning mechanism that optimizes LLM performance by rewarding multi-step chain-of-question sequences, enabling the system to reason freely and proactively request critical diagnostic information.
- 2024.05 - Present
LLM Alignment & Multi-Turn Consistency
- Distributed Training: Architected and optimized distributed training infrastructure for LLMs (Mistral, LLaMA, Deepseek) across 4 nodes with 16 NVIDIA GH200 GPUs, implementing tensor parallelism, pipeline parallelism, and ZeRO optimizer techniques to efficiently scale model training and reduce computational overhead.
- Fine-tuning: Performed fine-tuning of state-of-the-art LLMs both locally (on-premises clusters) and via OpenAI's official GPT-4o API, successfully improving model consistency, confidence-aware reasoning, and achieving state-of-the-art performance.
- Evaluation: Proposed a metric to quantitatively assess multi-turn response stability, prioritizing early interaction accuracy and swift recovery from initial errors. Released a comprehensive benchmark to test the model's ability to sustain coherent and contextually accurate responses during dynamic, multi-turn conversations.
- 2022.07 - 2025.7
Chronic Kidney Disease Progression Analysis with DL & XAI
- Deep Learning: Developed Temporal-Feature Cross Attention Mechanism (TFCAM), an attention-based deep learning framework that explicitly captures temporal and feature-level interactions to enhance clinical predictive modeling for Chronic Kidney Disease (CKD) progression.
- XAI: Provided multi-level explainability including temporal insights, feature importance ranking, and cross-temporal feature interactions, enabling clinicians to interpret model predictions transparently and identify actionable clinical insights.
- Enhanced Representation & Delivery: Designed an interactive, user-friendly dashboard GUI accessible to diverse user groups; integrated LLMs to automatically generate clear, insightful summaries of predictions and model interpretations, enhancing usability and clinical decision-making support.