國立中山大學 114學年度第1學期 課程教學大綱

National Sun Yat-sen University 114Academic year1st Semester Course syllabus

中文名稱
Course name(Chinese)

大數據分析、機器學習、與人工智慧方法

課號
Course Code

CM503

英文名稱
Course name(English)

BIG DATA ANALYTICS, MACHINE LEARNING, AND ARTIFICIAL INTELLIGENCE

課程類別
Type of the course

講授類

必選修
Required/Selected

選修

系所
Dept./faculty

管理學院

授課教師
Instructor

康藝晃

學分
Credit

3

課程大綱 Course syllabus

         本課程教學大綱已提供完整英文資訊(本選項僅供統計使用,未提供完整英文資訊者,得免勾記)【Provide information of course syllabus in English.(This is for statistical use only. For those who do not provide information of course syllabus in English, do not check this field.)】

*** 本課程採「同步遠距含錄影」上課,課程與連線相關資訊將置於本校網路大學課程頁面, 請至本校圖資處首頁網路大學登入***
*** 未選上課無法登入的同學,第一堂課上課請至 https://meet.google.com/qii-pwtd-ofj ***
*** 本課程有多個課號(MIS 413/572, CM 503),請修習相關學程的同學注意 ***

Prerequisite:
* Programming Proficiency: It is essential for students to have familiarity with at least one high-level programming language.
Expertise in scientific programming languages, such as R, Matlab, Python, Julia, or SAS, will be especially beneficial.
* Relational Databases & SQL: Students must possess a foundational understanding of relational databases and the Structured Query
Language (SQL). This knowledge is pivotal for many data engineering components within the course.
* Data Structures Fundamentals: A grasp of basic data structures, including but not limited to arrays, lists, sets, and dictionaries,
is vital. This understanding will serve students well as they delve into the intricacies of Data Engineering and Analytics.
* Introductory Statistics: A solid grounding in statistical concepts is required. This includes understanding descriptive analytics,
which encompasses measures of data dispersion and central tendency, as well as diagnostic analytics, such as hypothesis testing. Such
knowledge will pave the way for a deeper comprehension of data analytics methodologies and statistical machine learning algorithms.
* Analytical & Problem-solving Skills: The practical aspects of this course necessitate strong analytical thinking and
problem-solving abilities. Students should be prepared to apply their knowledge to tackle real-world challenges.

For those students who may find themselves lacking in any of the outlined prerequisites, it is strongly advised to pursue supplementary 
coursework or dedicated self-study. This proactive approach will ensure a richer and more effective learning experience throughout the course.




課程目標 Objectives

        

This course offers an in-depth exploration of the multifaceted domains of AI, Machine Learning (ML), and Big Data Analytics. Students will gain a
foundational understanding of Data Science, tracing its evolution and significance across diverse sectors such as business, healthcare, insurance, and finance.
The curriculum addresses essential topics, encompassing Big Data Analytics, Data Engineering, Business Analytics, Machine Learning Design Patterns, and the nuances 
between Supervised & Unsupervised Learning. Enhancing the course content, students will explore foundational concepts like deep neural networks, model architecture
design, functional programming, and ML Interpretability, which aim to demystify the black-box nature of many ML models. A central component of our curriculum
is the transformative potential of large language models, such as the GPT-series, Gemini, and Claude. Students will appreciate how these models are pivotal in data analytics,
especially in their capacity to generate R and Python code for streamlined and automated data processing.
Throughout the course, the emphasis on practical applications ensures that students garner hands-on experience with the R and Python programming languages, 
addressing modern data analytics challenges. This course is tailored for those eager to both grasp and apply the principles of data science and ML/AI in concrete
real-world contexts.




授課方式 Teaching methods

         1. Onsite & online video lectures
2. In-class quiz


















評分方式﹝評分標準及比例﹞Evaluation (Criteria and ratio)等第制單科成績對照表 letter grading reference

        
1.Quiz10%
2.In-class exercise20%
3.Homework30%
4.Midterm Proposal20%
5.Final Project20%

參考書/教科書/閱讀文獻 Reference book/ textbook/ documents
〔請遵守智慧財產權觀念,不可非法影印。教師所提供之教材供學生本人自修學習使用,不得散播及做為商業用途〕
No copies for intellectual property rights. Textbooks provided by the instructor used only for self-study, can not broadcast or commercial use

        

* G. James, D. Witten, T. Hastie, and R. Tibshirani, An Introduction to Statistical Learning: with Applications in R, 2nd Edition, Springer
(Available free online: https://www.statlearning.com/)
* G. James, D. Witten, T. Hastie, and R. Tibshirani, J. Taylor, An Introduction to Statistical Learning: with Applications in Python, Springer
(Available free online: https://www.statlearning.com/)
* F. Buisson, Behavioral Data Analysis with R and Python. O’Reilly Media, Inc., 2021.
* K. Hwang and M. Chen, Big-Data Analytics for Cloud, IoT and Cognitive Computing, 1st ed. Wiley Publishing, 2017.
* EMC Education Services, Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data. John Wiley & Sons, 2015.
* N. Matloff, The Art of R Programming: A Tour of Statistical Software Design, 1st edition. No Starch Press, 2011.
* Kabacoff, Robert. R in Action. Manning Publications Co., 2011
* C. O'Neil and R. Schutt, Doing Data Science: Straight Talk from the Frontline, 1st edition. O'Reilly Media, 2013.
* F. Chollet and J. J. Allaire, Deep Learning with R, 1 edition. Manning Publications, 2018.
* A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems.
O’Reilly Media, Inc., 2019.
* W. McKinney, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. O’Reilly Media, Inc., 2017.
* F. Provost, T. Fawcett. Data Science for Business: What you need to know about data mining and data-analytic thinking, O'Reilly Media, Inc., 2013.
* C. Molnar, Interpretable Machine Learning: A Guide For Making Black Box Models Explainable. Munich, Germany: Independently published, 2022.
(https://christophm.github.io/interpretable-ml-book/)
* J. Pearl, M. Glymour, and N. P. Jewell, Causal Inference in Statistics - A Primer, 1st edition. Chichester, West Sussex: Wiley, 2016.
* D. Rothman, Transformers for Natural Language Processing and Computer Vision: Explore Generative AI and Large Language Models with Hugging Face, ChatGPT, GPT-4V, and DALL-E 3. 3rd Edition, Packt Publishing Ltd, 2024.




課程時數規劃 Course Hour Planning

本校自114學年度起實施學期16週,仍以1學分18小時為原則。教師課程時數安排得選擇「16週+自主學習規劃」或「16週+實體上課規劃」。
Starting from the 114th academic year, the university will implement a 16-week course schedule, while maintaining the standard of 18 hours of instruction per credit. Instructors can choose between “16-weeks + Alternative learning periods”or“16-weeks + In-person classes.”

本門課程為「16週+自主學習規劃」:教師需於「A.每週課程內容及預計進度」欄位填寫16週課程進度,並於「B.自主學習規劃」欄位填寫每1學分2小時學生自主學習內容。
16 weeks + alternative learning periods: The instructor will include a 16-week course plan in the weekly scheduled progress section(16 hours of instruction per credit) and provide details of the learning plan (two hours of activity per credit) in the alternative learning period section.
本門課程為「16週+實體上課規劃」:教師需於「A.每週課程內容及預計進度」欄位填寫16週課程進度,並於「C.實體上課規劃」填寫2次授課內容及主題。
16 weeks + in-person classes: The instructor will include a 16-week course plan in the weekly scheduled progress section (16 hours of instruction per credit) and specify the content and topics of the 2 in-person classes in the in-person class plan section.

A.每週課程內容及預計進度 Weekly scheduled progress

        
全英課程之授課內容及主題應以英文或雙語呈現
For courses taught entirely in English, the content and topics should be presented in English or bilingually.
週次日期授課內容及主題
WeekDateContent and topic
12025/09/07~2025/09/13Course Introduction
22025/09/14~2025/09/20Data Engineering — I
32025/09/21~2025/09/27Data Engineering — II
42025/09/28~2025/10/04Fundamentals of Data Analytics — I
52025/10/05~2025/10/11Fundamentals of Data Analytics — II
62025/10/12~2025/10/18Fundamentals of Data Analytics — III
72025/10/19~2025/10/25Introduction to Statistical Learning — I
82025/10/26~2025/11/01Project Proposal Defense – I
92025/11/02~2025/11/08Project Proposal Defense – II
102025/11/09~2025/11/15Introduction to Statistical Learning — II
112025/11/16~2025/11/22Supervised Learning — Regression & Classification
122025/11/23~2025/11/29Unsupervised Learning — Recommendation Systems & Matrix Factorization
132025/11/30~2025/12/06Topics in Interpretable Machine Learning and Causal Inference
142025/12/07~2025/12/13Topics in Text Analytics and Large Language Models
152025/12/14~2025/12/20Term Project Presentation — I
162025/12/21~2025/12/27Term Project Presentation — II

B.自主學習規劃 Alternative learning periods

課程規劃學生自主學習內容(每1學分2小時)
Alternative learning periods planned for the course (with each credit corresponding to two hours of activity)

本門課程規劃學生彈性或自主學習內容(每1學分2小時):
Alternative learning periods planned for the course (each credit corresponds to two hours of activity):
學生自主學習活動
Alternative learning periods
勾選或填寫規劃內容
Place a check in the appropriate box or provide details
時數
Number of hours
學生分組實作及討論
Group work and discussion
參與課程相關作業、作品、實驗
Participation in course-related assignments, work, or experiments
參與校內外活動(研習營、工作坊、參訪)或競賽
Participation in on- or off-campus activities (e.g., seminars, workshops, and visits) or competitions
課外閱讀
Extracurricular reading
線上數位教材學習
Learning with online digital learning materials
其他(請填寫規劃內容)
Other (please provide details)

C.實體上課規劃 In-Person Class Plan

若無規劃學生自主學習,則請教師規劃2次實體上課(每1學分2小時),上課時間由師生自行討論,得利用週三下午4-7點或其他時段進行。
If there are no alternative learning periods planned for the course, the instructor should plan 2 in-person classes (2 hours of activity per credit). Class schedules can be arranged through discussions between instructors and students, utilizing Wednesday 4:00PM-7:00PM or other suitable time slots.
*第一次實體上課內容及主題 (Content and topic for the first In-Person class):
*第二次實體上課內容及主題 (Content and topic for the second In-Person class):

課業討論時間 Office hours

         時段1 Time period 1:
時間 Time:星期一10am-12pm
地點 Office/Laboratory:By appointment
時段2 Time period 2:
時間 Time:星期三10am-12pm
地點 Office/Laboratory:By appointment

系所學生專業能力/全校學生基本素養與核心能力 basic disciplines and core capabilitics of the dcpartment and the university

        
系所學生專業能力/全校學生基本素養與核心能力
basic disciplines and core capabilities of the department and the university
課堂活動與評量方式
Class activities and evaluation
本課程欲培養之能力與素養 This course enables students to achieve.紙筆考試或測驗 Test.課堂討論︵含個案討論︶ Group discussion (case analysis).個人書面報告、作業、作品、實驗 Indivisual paper report/ assignment/ work or experiment.群組書面報告、作業、作品、實驗 Group paper report/ assignment/ work or experiment.個人口頭報告 Indivisual oral presentation.群組口頭報告 Group oral presentation.課程規劃之校外參訪及實習 Off-campus visit and intership.證照/檢定 License.參與課程規劃之校內外活動及競賽 Participate in off-campus/ on-campus activities and competitions.課外閱讀 Outside reading.
※系所學生專業能力 Basic disciplines and core capabilities of the department
※全校學生基本素養與核心能力 Basic disciplines and core capabilities of the university
1.表達與溝通能力。 1. Articulation and communication skills           
2.探究與批判思考能力。 2. Inquisitive and critical thinking abilities           
3.終身學習能力。 3. Lifelong learning           
4.倫理與社會責任。 4. Ethnics and social responsibility           
5.美感品味。 5. Aesthetic appreciation           
6.創造力。 6. Creativity           
7.全球視野。 7. Global perspective           
8.合作與領導能力。 8. Team work and leadership           
9.山海胸襟與自然情懷。 9. Broad-mindedness and the embrace of nature            

本課程與SDGs相關項目:The course relates to SDGs items:

        
SDG1-消除貧窮(No Poverty)
SDG2-消除飢餓 (Zero Hunger)
SDG3-良好健康與福祉(Good Health and Well-being)
SDG4-教育品質(Quality Education)
SDG5-性別平等(Gender Equality)
SDG6-乾淨水源與公共衛生(Clean Water and Sanitation)
SDG7-可負擔乾淨能源(Affordable and Clean Energy)
SDG8-優質工作與經濟成長(Decent Work and Economic Growth)
SDG9-工業、創新和基礎建設(Industry,Innovation and Infrastructure)
SDG10-減少不平等(Reduced Inequalities)
SDG11-永續城市(Sustainable Cities and Communities)
SDG12-責任消費與生產(Responsible Consumption and Production)
SDG13-氣候行動(Climate Action)
SDG14-海洋生態(Life Below Water)
SDG15-陸域生態(Life on Land)
SDG16-和平、正義和穩健的制度(Peace,Justice And Strong Institutions)
SDG17-促進目標實現的全球夥伴關係(Partnership for the Goals)
本課程和SDGS無關

本課程校外實習資訊: This course is relevant to internship:

         本課程包含校外實習(本選項僅供統計使用,無校外實習者,得免勾記)
The course includes internship.(For statistical use only. If the course without internship, please ignore this item.)

實習定義:規劃具有學分或時數之必修或選修課程,且安排學生進行實務與理論課程實習,於實習終了取得考核證明繳回學校後,始得獲得學分;或滿足畢業條件者。(一般校內實習請勿勾選此欄位)

Internship: The required or elective courses should include credits and learning hours. Students should participate in the corporative company or institution to practice and learn the real skills. An internship certification must be handed in at the end of internship to get the credits or to fulfil the graduation requirements.

回上一頁