RaPID

Random Projection-based IBD Detection for ultra-fast discovery of identity-by-descent segments in biobank-scale cohorts.

IBD Population Genetics Biobank Scale

Med-BERT

Contextualized embeddings and pretraining framework for structured electronic health records and disease prediction.

EHR Foundation Models Clinical AI

pytorch_ehr

PyTorch codebase for deep learning models over longitudinal electronic health record data.

PyTorch Predictive Modeling EHR

Gene2vec

Distributed representation of genes based on co-expression patterns for downstream bioinformatics modeling.

Gene Embedding Bioinformatics

HapSeq2

Methods for genotype calling and phasing in whole-genome sequencing data.

Phasing WGS

CovRNN

Deep learning framework for modeling clinical trajectories and COVID-19-related clinical outcomes from EHR data.

Clinical Time Series Risk Prediction