Machine Learning @ TikTok Search | CS + Financial Engineering @ WashU

Weizhi Du

building AI systems people actually use.

LLM post-training, AI agents, retrieval and ranking, medical AI, and teaching, with industry work closest to real users.

Scroll

Industry

Undergrad ML engineer on core search algorithms at Silicon Valley scale.

My industry work has put me close to practical relevance problems: ranking quality at TikTok Search, plus LLM post-training, model quality, and evaluation loops at Meituan.

May 2026 - Aug. 2026San Jose, CA

TikTok

Machine Learning Engineer Intern, TikTok Search

fine ranking

Working on the fine ranking core algorithm for TikTok Search, where user intent, relevance quality, and large-scale ranking infrastructure meet.

Jun. 2025 - Aug. 2025Beijing, China

Meituan

Machine Learning Engineer Intern, Search and Content Intelligence

LLM post-training SFT + DPO 72B to 7B 35% latency cut RAG 49.6% to 72.3%

Focused on LLM post-training and enterprise AI quality loops, with model compression, vLLM inference, compliant PII handling, and retrieval evaluation around Meituan-scale products (770M+ users and 15M+ merchants).

Research

Applied AI research with a reliability bias.

My research sits where agents meet accountability: medical image segmentation, grounded clinical reasoning, retrieval, and restoration under hard evaluation.

0.813

FDG-PET Auto-Contouring Agent

Test Dice from validation-gated, safety-aware self-improvement for tumor segmentation.

97.3%

Radiation Oncology RAG

Clinical QA accuracy on ACR TXIT with citation-grounded answer synthesis.

+1.58 dB

DDIM Restoration

Zero-shot restoration gains under severe Gaussian noise and compound degradation.

Submitted abstract Safety-aware agentic FDG-PET auto-contouring, sole first author, RRS 2025.
Manuscript in preparation Self-evolving agentic RAG for radiation oncology reasoning, fourth author.

Teaching

Teaching is where my algorithms become explainable.

I have supported algorithms, cloud computing, data mining, ML, and statistics courses through recitations, office hours, grading systems, exam support, rubric calibration, and TA mentorship.

Head TA

CSE 427: Cloud Computing

Head TA for 1 semester; helped students reason through cloud projects, infrastructure debugging, and system tradeoffs.

TA

ESE 326: Probability and Statistics

Clarified probability and statistics through office hours, exam support, and rubric calibration.

TA

CSE 514: Data Mining

Guided applied data mining work, ML assignments, and grading consistency.

TA

ESE 417: Introduction to Machine Learning

Supported model reasoning, programming assignments, and debugging sessions.

TA

CSE 247R: DSA Seminar

Led Data Structures and Algorithms practice from proof ideas to implementation instincts.

Honors + Skills

Credentials, tools, and the technical range behind the work.

This section is intentionally compact: enough signal for context, without turning the site into a transcript.

Academic Award

Ernest D. Weiss Junior Award

One junior selected across all six Computer Science & Engineering majors at WashU for academic excellence.

Scholarship

Howard Nemerov Scholar

Four-year merit scholarship at Washington University in St. Louis.

Consistency

Dean's List

Recognized every semester enrolled.

Early Awards

Pre-College Honors

S.-T. Yau CS Gold Prize, MCM Honorable Mention, Physics Unlimited 11th in the U.S.

Impact Board

Signal, scale, outcomes.

IndustryTikTok Search

Fine Ranking Core Algorithm

Current

High-impact algorithm work inside one of the world's largest recommendation and search ecosystems.

IndustryMeituan

LLM Post-Training

SFT + DPO

Post-training, model quality, PII-safe enterprise use, and retrieval evaluation.

IndustryLLM Systems

72B to 7B Compression

35%

Latency reduction through inference and CUDA profiling work.

ResearchMedical AI

FDG-PET Auto-Contouring

0.813

Test Dice from a validation-gated self-improving agent framework.

ResearchClinical RAG

Radiation Oncology Reasoning

97.3%

Accuracy on a radiation oncology benchmark with grounded retrieval.

ProjectRL

AlphaZero-Style Self-Play

6.9x

Training speedup for a large-action-space board-game agent.

ProjectVision

Training-Free Visual Grounding

62.8%

Acc@0.5 on RefCOCO-M validation queries.

ProjectCloud

Cloud-Native ML Platform

GKE

Ray, KubeRay, FastAPI, and artifact pipelines for distributed ML training.

Off Hours

Still a person after the metrics.

I write lyrics, cook, backpack through Death Valley, and gravitate toward moody, poetic storytelling like Yorushika. Often with my cat nearby.

Your Year with ChatGPT recap image
"Your Year with ChatGPT," 2025.

Contact

Say hi.

Talk to me about research, internships, projects, classes, or any topics you are excited about. I am always happy to meet and build!