I am a Postdoctoral Research Fellow at the Fred Hutchinson Cancer Center in Seattle, where I am mentored by Dr. Raphael Gottardo and Dr. Steven Henikoff in terms of statistical modeling and computational analysis of single-cell transcriptomics, proteomics and epigenomics. Additionally, through training and close collaborating with Dr. Cameron Turtle and Dr. Evan Newell, I am also studying the CAR-T cell therapy clinical outcome association with genomic markers from the single-cell multi-omics perspective. Before joining Fred Hutch, I received the Ph.D. in Statistics from the University of Wisconsin-Madison in 2019 under the supervision of Dr. Sündüz Keleş and we stay in close collaboration and extend our previous statistical modelings in 3D genomics to the single-cell domain.
I am inherently drawn to problems at the interface of statistical, biological and biomedical sciences. My current research focuses on:
Statistical modeling and computational analysis of immunological and immunotherapeutic Studies using multi-omics bulk and single-cell genomics data, such as data generated from the CITE-seq, scCUT&Tag, scRNA-seq, flow cytometry, CUT&Tag, and RNA-seq technologies.
Investigating the three-dimensional chromatin organization and the long-range gene regulation through multimodality integrative model and accompanying software, using data such as scHi-C, scRNA-seq, Paired-Tag, scCUT&Tag-pro.
My career goal is to solve biological and clinically important, and methodologically challenging problems by innovating cutting-edge statistical models. I have built a repertoire of collaborative research experience with statisticians, computational biologists, molecular biologists, and immunologists worldwide, working with established principal investigators who leverage the interplay of statistics, computation, and molecular genomics. I am open to discussion and collaboration and look forward to being inspired and motivated by the novel and intriguing problems in other disciplines.
Ph.D. in Statistics - Minor in Quantitative Biology, 2019
University of Wisconsin - Madison
B.E. in Statistics, 2014
Renmin University of China
Single-cell Transcriptomics, Epigenomics and Proteomics:
CAR-T Cell Immunotherapy:
Dissertation Research:
Collaborative Work with the Bresnick Lab:
Projects:
+: Co-first author; ++: Co-corresponding author
ADTnorm: R package for normalization and integration tools for CITE-seq cell surface measurement.
scGAD: R package for extracting the three-dimensional chromatin interaction at the unit of genes and facilitate the integration of single-cell 3D genomcis with other single-cell modalities.
scVI-3D: Normalization and de-noising of single-cell Hi-C data using deep generative modeling using python pipline.
BandNorm: R package for fast band normalization for sing-cell Hi-C data. (Co-developer)
FreeHiC Spike-In: FreeHi-C python pipeline with a user/data-driven spike-in module to allow a comprehensive comparison of differential chromatin interaction detection methods where the ground truth differential chromatin interactions are known.
FreeHiC: Python pipeline using FRagment Interactions Empirical Estimation method for fast simulation of Hi-C and other 3D proximity ligation sequencing data. Major computing parts are accelerated by C.
mHiC: Python pipeline of multi-mapping strategy for Hi-C data by probabilistically assigning reads originatedfrom repetitive regions. Major computing parts are accelerated by C.
permseq: R package for mapping protein-DNA interactions in highly repetitive regions of the genomes with prior-enhanced read mapping.
permseqExample: R package for the permseq package illustration and demo runs. Smaller raw data and demo R scripts are provided for quick runs in order to get to know permseq package.
Daily
Daily
Daily
Accelerting Pipielines
Daily
Daily
STAT 877 - Statistical Methods for Molecular Biology (Fall 2020 Guest Lecturer):
Gave lecture to statistics and biostatistics graduate students about 3D Genomics and Long-range Gene Regulations.
STAT 998 - Statistical Consulting (Fall 2019 Guest Lecturer):
Lead lectures to discuss real-world consulting problem with statistics graduate students utilizing the traditional and modern statistical tools.
STAT 877 - Statistical Methods for Molecular Biology (Spring 2019 Guest Lecturer):
Gave lecture to statistics and biostatistics graduate students about 3D Genomics and Long-range Gene Regulations.
2017-2018 Single-cell Technologies Journal Club (Organizer and Instructor):
Gave lectures about single-cell related research topics, such as scRNA-seq, scATAC-seq and scHi-C, to graduate students and post-docs from statistics background, and led paper review discussions.
2017-2018 Three-dimensional Chromatin Interactions Journal Club (Organizer and Instructor):
Gave lectures about 3D chromatin architecture related research topics to graduate students and post-docs from statistics background, and led paper review discussions.
STAT301 - Introduction to Statistical Methods (Fall 2014 Guest Lecturer for Discussion Sections):
Led undergraduate students discussions for solving hypothesis testing and statistical estimation problems.
Feb. 2022 to Present, Long Nguyen, Bioinformatics Analyst I at Fred Hutchinson Cancer Center:
Single-cell transcriptomics and proteomics integrative analysis for cell atlas construction of CAR-T cell therapy CITE-seq data and association with gene and protein markers with clinical responses.
June 2020 to Present, Siqi Shen, Ph.D. Candidate at UW-Madison:
Co-mentor with Dr. Sunduz Keles on single-cell 3D chromatin organization normalization and integrative analysis with single-cell transcriptomics and epigenomics.
June 2020 to Sep. 2021, Fanding Zhou, VISP student at UW-Madison, Currently Ph.D. student at UC Berkeley:
Co-mentor with Dr. Sunduz Keles on constructing tree-based statistical models for the false discovery rate control of 3D chromatin organization differential detection.
Summer 2019, Olivia Rae Steidl, Summer Undergraduate Student at University of Wisconsin - Madison, Currently Ph.D. student at University of Wisconsin - Madison:
Co-mentor with Dr. Sunduz Keles on investigation of the poly(UG) tails at the end of RNAs and its function in human using eCLIP-seq data.
Genome Medicine
Science Advances
PLOS Computational Biology
BMC Bioinformatics
Life Science Alliance
Annals of Applied Statistics