STAT718/BIO703

Spring 2026


Genomic Data Science: AI & Bioinformatics

Instructor: Yen-Yi Ho

Office: LeConte 216A

Class Meetings:

Monday/Wednesday 14:20-15:35PM

Classroom: LeConte 206


Dr. Ho's office Hours: Tuesday afternoon 15:00-17:00 PM,  Thursday afternoon 15:00-16:00PM, or by appointment (LeConte 216A)

Email: hoyen@stat.sc.edu

Teaching Assistant:TBA

TA office hours: TBA

Textbook:

1. Introduction to Data Science: Data Wrangling and Visualization with R by Rafael A. Irizarry   (Required)

2. Python Data Science Handbook by Jake VanderPlas

available at https://jakevdp.github.io/PythonDataScienceHandbook/

3. Python for Biologists Tutorial. Available at https://www.pythonforbiologists.org/

4. Machine Learning for Biology Tutorial. Available at https://pythonforbiologists.com/

5. Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville. Available at https://github.com/janishar/mit-deep-learning-book-pdf or https://www.deeplearningbook.org

6. Deep Learning for Genomics data-driven approaches for genomics applications in life sciences and biotechnology by Upendra Kumar Devisetty (available online from the Thomas Cooper Library)

Recommended

1. Python Crash Course, 3rd Edition: A Hands-On, Project-Based Introduction to Programming by Eric Mattens.

2. Deep Learning by PyTorch: Build, Train, and Tune Neural Networks Using Python Tools by Eli Stevens, Luca Antiga and Thomas Viehmann


Announcements:

Approximate course outline: (Lecture notes will be updated often)

Date Weekly topic
Homework
Code
  Reading         
Week 1
Jan 12
Syllabus


Lecture 1: Introduction to Genomic Data


Getting Started with Jupyter Notebook and JupyterLab




R Markdown (Chapter 20.2 in Rafa)






Google Colab Coupon


Link for Requesting
HPC account

(Choose Research Computing Account creation)


Week 2
Jan 19

Jan 19: No Class

Python Basic 1
  Data Types
   NumPy







Python Basic 1
Notebook1

Python Basic 2
Notebook2


Python Data Science Handbook Chap 2
Week 3
Jan 26

Plots in Python

Python Functions
 
If and Loops

Modules and Packages






Python Basic 3
Notebook3


Python Basic 4
Notebook4

Week 4
Feb 2

 
  
  Data Manipulation with Pandas

   Visualization with Matplotlib
 








Python Data Science Handbook Chap 3 & 4
Week 5
Feb 9

   
     BioPython Tutorial
     Machine Learning Part I

 








Week 6
Feb 16
 
  
    Machine Learning Part II
   

  

 















Week 7
Feb 23



DNA Convolutional Neuron Networks and Applications in Regulatory Genomics: DeepBind

DNA Methlyation: DeepCGC
















Week 8
March 2


      
 
Gene Expression, Splicing and Coexpression

    ResNet
 

  CNN for Gene Coexpression (CNNC) in Single-Cell Data



  

     













Week 9
March 9
  Spring Break: No Classes





Week 10
March 16

    Single-Cell RNA-seq
  










Week 11
March 23


Single-Cell RNA-seq with Autoencoder
   
     MMD-ResNet
      
     DCA
 
     Proust









HPC tutorial

Linux Commands

Linux File Transfer





Week 12
March 30



   Deep Generative Models for Single-Cell Transcriptomics
     
 







Week 13
April 6


   Spatial Transcriptomics
    
    SuperST












Week 14
April 13

    Drug Discovery

    Graphical Neural Networks
t




Week 15
April 20

    Student Presentation

 



Week 16
April 27

    Student Presentation
Final Project Due
Monday May 4 before 5PM