Welcome

Our main research interest is in understanding the structure and function of genomes, especially those of medical or agricultural importance. The core strength of our research is in developing novel algorithms and computational systems for large-scale biological sequence analysis, including leading algorithms for de novo genome assembly, variant detection, and related –omics assays. Using these advances we have contributed to the de novo genome assemblies of dozens of species; probed the sequence variations related to autism, cancer, and other human diseases; mapped the transcriptional and epigenetic profiles of tomatoes, corn, and other important plant species; and explored the role of microbes in different environments. In response to the deluge of biological sequence data we are now facing, we have also been at the forefront of distributed and parallel computing in genomics, and have pioneered the use of cloud computing as an enabling platform to address the big data challenges we are all facing.

Looking forward, we see ourselves at the intersection of biotechnology and algorithmics, developing systems for probing the structure and function of genomes using the best technologies possible. Our expertise spans from low level computer architecture, through sequencing, de novo assembly, variant identification, transcriptome & other -omics data and up to machine learning approaches to build predictive models of diseases and treatment response. In addition to ongoing projects in autism, cancer, and other human diseases, we also study agricultrual systems to better understand the underlying genotype to phenotype relations. Altogether, we intend to develop powerful new methods for analyzing large collections of genomes to address questions of disease, development, and evolution. Among other recognition, for this work I was granted an NSF CAREER award, a Sloan Foundation Fellowship, and was named a TIME100 recipient in 2022.

Recent News
» Gapless assembly of complete human and plant chromosomes using only nanopore sequencing
Nov 6, 2024
» BEATRICE: Bayesian Fine-mapping from Summary Data using Deep Variational Inference
Oct 3, 2024
» High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation
Oct 2, 2024
» Solanum pan-genomics and pan-genetics reveal paralogs as contingencies in crop engineering
Sept 14, 2024
» Differences in activity and stability drive transposable element variation in tropical and temperate maize
Sept 9, 2024
(past news)

Upcoming Events

~~ 2024 ~~

» AnVIL Community Conference
CSHL, NY. November 12-13 2024
» Biological Data Science
CSHL, NY. November 13-16 2024
~~ 2025 ~~

» Plant and Animal Genomes Conference (PAG)
San Diego, CA. Jan 2025
» Advances in Genome Biology and Technology (AGBT)
Marco Island, FL. Feb 2025
» Advances in Genome Biology and Technology (AGBT) Ag
Orlando, FL. March 2025
» Biology of Genomes
CSHL, NY. May 2025
» Galaxy Community Conference
CSHL, NY. June 2025
» American Society of Human Genetics
Boston MA, October 14-18. 2024
(presentation archive)



Michael Schatz

Bloomberg Distinguished
Professor of Computer Science
and Biology

CS office and Mailing address:
Johns Hopkins University
Department of Computer Science
3400 N Charles St
Malone Hall 323
Baltimore, MD 21218

Department of Biology:
UTL 391 (office)
UTL 398 (lab and conference room)

Cell: (703) 966-1987
E-mail: mschatz <at> cs.jhu.edu
Twitter: @mike_schatz
LinkedIn: mschatz
BlueSky: mikeschatz.bsky.social