Resume Resume

Basics

Name Yubo Li
Email yubol@andrew.cmu.edu

Education

  • 2022.08 - now

    Pittsburgh, PA

    Ph.D.
    Carnegie Mellon University
    Information Systems
  • 2020.08 - 2021.12

    Pittsburgh, PA

    M.S.
    Carnegie Mellon University
    Information Systems
  • 2015.09 - 2019.06

    La Jolla, CA

    B.S.
    University of California, San Diego
    Applied Mathematics | Business

Work

  • 2025.06 - 2025.08
    Applied Scientist Intern
    Amazon
    Led end-to-end development of a generative recommendation system by pre-training a custom user encoder on 172k interaction sequences and progressively aligning it with LLMs, achieving strong gains over prompting baselines (Hit@3 60.1%, NDCG@3 67.5%) with 93.4% fewer tokens and 64.8% lower cost. Built a production-ready RAG/search pipeline with FAISS HNSW over 60k user embeddings and implemented distributed multi-GPU training with FSDP and BF16 mixed precision to scale training and inference efficiently.
  • 2019.10 - 2020.03
    Data Scientist
    HireBeat
    Leveraged machine learning and NLP techniques to develop a data analysis pipeline that optimized content recommendation algorithms, significantly improving personalization accuracy and user engagement.
  • 2012.07 - 2014.05
    Founder
    R-Eat
    Entrepreneurship: Founded and grew an online food delivery business by enhancing customer experience through UI optimization and personalized marketing, resulting in a 60% sales increase and a loyal customer base of over 2,000.

Awards

Skills

Programming Languages
Python
Java
SQL
R
Technologies & Frameworks
PyTorch
TensorFlow
scikit-learn
Pandas
NumPy
AWS
Large Language Models
Prompting Optimization
RAG
Fine-tuning
LoRA
RLHF
Model Distillation
Evaluation Metrics
Parallel Training
Agentic Frameworks
Machine Learning & Deep Learning
Supervised Learning
Unsupervised Learning
Predictive Modeling
Natural Language Processing
Time Series Analysis
Model Interpretability

Languages

Mandarin
Native speaker
English
Fluent

Projects

  • 2025.01 - Present
    Agentic Q&A Systems for Trustworthy Organ Transplantation Guidance
    • Q&A System: Developed Q&A systems for organ transplantation patient care by integrating retrieval-augmented generation (RAG) with adaptive frameworks that synthesize patient-specific medical history, clinical guidelines, and dynamic risk factors to deliver tailored guidance and support.
    • RAG/Retriever: Engineered a hybrid retrieval pipeline combining both sparse and dense retrievers (DPR, ANCE) to accurately extract and verify medical content from over 100 transplant-specific handbooks.
    • LLM Reasoning Framework: Developed an incentivized reasoning mechanism that optimizes LLM performance by rewarding multi-step chain-of-question sequences, enabling the system to reason freely and proactively request critical diagnostic information.
  • 2024.05 - Present
    LLM Alignment & Multi-Turn Consistency
    • Distributed Training: Architected and optimized distributed training infrastructure for LLMs (Mistral, LLaMA, Deepseek) across 4 nodes with 16 NVIDIA GH200 GPUs, implementing tensor parallelism, pipeline parallelism, and ZeRO optimizer techniques to efficiently scale model training and reduce computational overhead.
    • Fine-tuning: Performed fine-tuning of state-of-the-art LLMs both locally (on-premises clusters) and via OpenAI's official GPT-4o API, successfully improving model consistency, confidence-aware reasoning, and achieving state-of-the-art performance.
    • Evaluation: Proposed a metric to quantitatively assess multi-turn response stability, prioritizing early interaction accuracy and swift recovery from initial errors. Released a comprehensive benchmark to test the model's ability to sustain coherent and contextually accurate responses during dynamic, multi-turn conversations.
  • 2022.07 - 2025.7
    Chronic Kidney Disease Progression Analysis with DL & XAI
    • Deep Learning: Developed Temporal-Feature Cross Attention Mechanism (TFCAM), an attention-based deep learning framework that explicitly captures temporal and feature-level interactions to enhance clinical predictive modeling for Chronic Kidney Disease (CKD) progression.
    • XAI: Provided multi-level explainability including temporal insights, feature importance ranking, and cross-temporal feature interactions, enabling clinicians to interpret model predictions transparently and identify actionable clinical insights.
    • Enhanced Representation & Delivery: Designed an interactive, user-friendly dashboard GUI accessible to diverse user groups; integrated LLMs to automatically generate clear, insightful summaries of predictions and model interpretations, enhancing usability and clinical decision-making support.