NTU Course
NewsHelpOverview

Computational Skills for Biological Data Analysis

Offered in 112-2
  • Serial Number

    41995

  • Course Number

    Agron5106

  • Course Identifier

    621 U7070

  • No Class

  • 3 Credits
  • Elective

    DEPARTMENT OF AGRONOMY / GRADUATE INSTITUTE OF AGRONOMY, BIOMETRY DIVISION / BIOLOGICAL STATISTICS

      Elective
    • DEPARTMENT OF AGRONOMY

    • GRADUATE INSTITUTE OF AGRONOMY, BIOMETRY DIVISION

    • BIOLOGICAL STATISTICS

  • STEVEN HUNG-HSI WU
  • Tue 6, 7, 8
  • Please contact the department office for more information

  • Type 3

  • 45 Student Quota

    NTU 45

  • No Specialization Program

  • English
  • NTU COOL
  • Notes
    The course is conducted in English。
  • NTU Enrollment Status

    Enrolled
    0/45
    Other Depts
    0/0
    Remaining
    0
    Registered
    0
  • Course Description
    In the era of big data, proficiency in several fundamental computational skills is required to conduct high-quality analysis and reproducible research in multiple disciplines. Within the field of biology and agriculture, large-scale datasets are easily accessible due to the advancement in technology. The amount of data will continue to increase at a dramatic rate over the following decades. Students will be required to have the ability to process and analyse large amounts of data efficiently in the “-omic” and even "post-omic" era. This course will introduce a few fundamental and transferable computational skills for students who work with biological data. These skills include but are not limited to command line interface, working with computer servers, software version control (Git and GitHub) for collaboration, software testing for reproducible analysis, working with the relational database (MySQL), data cleaning and manipulation. Although many of these skill sets are transferable to fields outside of biology, this course will focus on their application to biological data.
  • Course Objective
    At the completion of this course, students will be able to:
    • Work with the command line interface, navigate the filesystem, perform file manipulation, execute commands to solve simple tasks and connect to remote servers.
    • Utilise software version control systems (Git and GitHub) for reproducible work, collaborate with others, and perform software testing and validation.
    • Understand the principles and concepts of the relational database (MySQL), and be able to perform daily tasks, such as storing and retrieving data.
    • Work collaboratively in groups to solve computational challenges.
  • Course Requirement
    • This course will be conducted in English. All lectures, course materials, and assignments will be presented and conducted in English.
    • Cheating and plagiarism in assignments, exams or any other assessments are serious academic misconduct. All instances will be handled according to the university policy.
    • Students are required to have basic statistics knowledge. Any Statistics101 course that covers descriptive statistics, simple linear regression and ANOVA will be sufficient.
    • Students are required to have exposure to at least one programming language. It is recommended that students are familiar with the following basic concepts: declare variables, basic arithmetic operation, basic data type (R: vector, list, data.frame. Python: list, dictionary), declare functions.
  • Expected weekly study hours after class
    2-6 hours
  • Office Hour
    *This office hour requires an appointment
  • Designated Reading
  • References
  • Grading
    72%

    Assignments

    Several updates based on the feedback from last year

    14%

    Software colloboration - Individual

    14%

    Software colloboration - Group

  • Adjustment methods for students
  • Course Schedule
    Feb/20Week 1Introduction
    Feb/27Week 2Basic programming in Python and R – Part I
    Mar/05Week 3Basic programming in Python and R – Part II
    Mar/12Week 4Command line interface - Part I
    Mar/19Week 5Software version control with Git and GitHub
    Mar/26Week 6Introduction to the relational database
    Apr/02Week 7MySQL database - Basic operations
    Apr/09Week 8MySQL database - Advanced queries
    Apr/16Week 9MySQL database - Data manipulation
    Apr/23Week 10MySQL database - Admin
    Apr/30Week 11Collaboration on GitHub
    May/07Week 12Programming in Python and R - testing with unit testing
    May/14Week 13Programming in Python and R - error handling
    May/21Week 14Programming in Python and R- software development
    May/28Week 15Other topics in data analysis
    Jun/04Week 16Final group project presentation