Advancing the Frontier

Data Science, AI, and Statistical Learning

A Symposium in Honor of Jun Liu


July 28, 2025

9am - 4:30pm

Harvard University


Winokur Hall
Science and Engineering Complex
150 Western Ave, Allston, Boston


Upload Portrait/Video/Photo Parking

Important Notice

1. Call for Contributions: As part of the workshop celebration, we warmly invite all attendees to contribute to a tribute honoring Jun. You are welcome to submit a portrait, a short video, event photos, or a short message. These contributions will be compiled into a collective tribute to be shared during the event. If you have already submitted your materials, there is no need to resubmit. We look forward to celebrating Jun’s impact together through your voices, faces, and stories.

2. Parking: Come-and-go block parking will be available from 8am to 5pm here.

3. WiFi: Choose Harvard University (no password needed); then register here as a guest.

Let's celebrate Jun!

We are gathering to celebrate Jun Liu’s remarkable contributions to Data Science, Statistical Learning, and the broader academic community since 1988. From pioneering research to mentoring nearly 80 PhD students and postdoctoral fellows, Jun has profoundly shaped modern statistics across institutions and continents. His influential work in computational biology has also built bridges between statistics and biomedical research.

The workshop will be held at Harvard University on Monday, July 28, 2025. The program will feature academic sessions and discussions on recent developments in Data Science, AI, and related fields. Special thanks to Jun’s former students, friends, and Two Sigma for their generous support, and to the Harvard Department of Statistics for graciously hosting the event. In particular, we thank Bo Jiang for leading the fundraising initiative and securing full support for the event.

Schedule

July 28

Science and Engineering Complex
150 Western Ave, Allston, Boston

Breakfast & Coffee Break @ West Atrium
Workshop @ Winokur Hall

Time Speaker Title
(Click to Abstract)
8:30am Breakfast
9:00am Samuel Kou
Harvard University
Opening Remark
9:10–10:10am Chair: Xuming He
Washington University
in St. Louis
Organizer: Samuel Kou
Bayesian-inspired Methods in AI and Statistics
Yingnian Wu
University of California,
Los Angeles
Latent Thought Language Models with
Variational Bayes Inference-Time Computation
Shane Jensen
University of Pennsylvania
Bayesian Non-Parametric Clustering at Micro and Macro Scales
Xinran Li
University of Chicago
Bootstrap Inference for Partially Identified Parameters
with Unobserved Variables and Over-identification
Ke Deng
Tsinghua University
Integrating Statistical Learning and Deep Learning
for Interpretable Analysis of Complex Unstructured Data
10:10–10:30am Break
10:30–11:30am Chair: Haiyan Huang
University of California,
Berkeley
Organizer: Tianxi Cai, Xihong Lin
Advancing Biological Discovery and Therapeutics
through AI and Statistical Modeling
Hongyu Zhao
Yale University
Modeling and Predicting Single-Cell Multi-Gene Perturbation Responses
Chiara Sabatti
Stanford University
When Is a Variable Important?
Shirley Liu
GV20 Therapeutics /
Dana-Farber Cancer Institute
AI-driven Next-generation Cancer Biotherapeutics

Lunch @ West Atrium
Lightning Talks @ Winokur Hall

Time Speaker Title
(Click to Abstract)
11:30am–1:00pm Lunch, Group Photo
12:10–12:50pm Chair: Ping Ma
University of Georgia
Organizer: Ping Ma
From Spark to Signal: Lightning Talks on
Data Science, AI, and Statistics for Knowledge Discovery
Haiyan Huang
University of California,
Berkeley
A Journey from Mathematics to Genomic Data Analysis
Leo Guo
Bayesian Investment
More Than Numbers
Jinfeng Zhang
Insilicom LLC
A Comprehensive Large-Scale Biomedical Knowledge Graph
for AI-Powered Data-Driven Biomedical Research
Huimin Cheng
Boston University
JADE: Joint Alignment and Deep Embedding
for Multi-Slice Spatial Transcriptomics
Steve Qin
Emory University
Adaptive / Bayesian Hierarchical Model and Its Applications
Xin Xing
Virginia Tech
The Evolution of Statistical Inference: From Classical P-values
Through Mirror Statistics to Modern Generative Models
Junni Zhang
Peking University
Fast, Scalable Bayesian Demography
Yuguo Chen
University of Illinois Urbana-Champaign
Sampling Sequential Importance Sampling
Roee Gutman
Brown University
Record Linkage as a Data Science Challenge
Mayetri Gupta
University of Glasgow
Bayesian Hierarchical Latent Variable-based Modelling
for Large and Complex Genomic Datasets
Scott Schmidler
Duke University
Bayesian Inference for Site-Dependent Evolutionary Models:
Applications to Vaccine Design

Breakfast & Coffee Break @ West Atrium
Workshop @ Winokur Hall

Time Speaker Title
(Click to Abstract)
1:00–2:00pm Chair: Linda Zhao
University of Pennsylvania
Keynote
Augustine Kong
University of Oxford
Some Reflections on How the Field of Statistics Has Changed
in the Last Forty Years and the Role of Chinese Students
2:00–2:30pm Break
2:30–3:30pm Chair: Rong Chen
Rurgers University
Organizer: Linda Zhao
AI in Action: From Complex Data to Credible Discovery
Tony Cai
University of Pennsylvania
Transcending Data Boundaries:
Transfer Knowledge in Statistical Learning
Edoardo Airoldi
Temple University
Valid Statistical Analyses and Reproducible Science
in the Era of High-throughput Biology
Qing Zhou
University of California,
Los Angeles
Causal Discovery on Dependent Data
3:30–4:15pm Moderator: Tianxi Cai
Harvard University
Organizer: Tianxi Cai
Panel Discussion:
AI Meets Statistics — A Two-Way Power Boost
Bo Jiang
Two Sigma
Xiao-Li Meng
Harvard University
Jamie Robins
Harvard University
Yuan Yuan
Google
4:15pm Xihong Lin
Harvard University
Closing Remark

INFORMATION

Lodging

(Recommended hotel. No Special Rate Provided)

CitizenM Back Bay
(near the club)
Sheraton Boston Hote
(near the club)
408 Newbury St, Boston 39 Dalton St, Boston
DoubleTree Suites by Hilton Hotel Boston
(near workshop)
Courtyard Boston Cambridge
(on campus)
Le Méridien Boston Cambridge
(on campus)
400 Soldiers Fld Rd, Boston 777 Memorial Dr, Cambridge 20 Sidney St, Cambridge

Parking

FREE block parking 8:00am to 5:00pm. 5-min walk to SEC.

The Organizing Committee

Linda Zhao (Contact)

University of Pennsylvania
lzhao@wharton.upenn.edu


Ping Ma (Contact)

University of Georgia
pingma@uga.edu


Tianxi Cai

Harvard University

Jiajun Gu

Jump Trading

Bo Jiang

Two Sigma

Sam Kou

Harvard University

Xihong Lin

Harvard University

Shirley Liu

GV20 Therapeutics

Website Design / Maintain

Jeff Cai (University of Notre Dame)
jcai2@nd.edu

Linda Zhao (University of Pennsylvania)
lzhao@wharton.upenn.edu