Using python package (pandas, seaborn) to visiualize data with different points of perspection. It's good for beginning user to start how to analize data and store telling skills.
Size: 2.13 MB
Language: none
Added: Oct 15, 2025
Slides: 48 pages
Slide Content
國立臺北護理健康大學 NTUNHS
資料視覺化
Orozco Hsu
2025-10-13
1
About me
2
•Education
•NCU (MIS)、NCCU (CS)
•Experiences
•Telecom big data Innovation
•Retail Media Network (RMN)
•Customer Data Platform (CDP)
•Know-your-customer (KYC)
•Digital Transformation
•LLM Architecture & Development
•Research
•Data Ops (ML Ops)
•Generative AI research
•Business Data Analysis, AI
9
參考: https://commons.wikimedia.org/wiki/File:Data_visualization_process_v1.png
EDA 用在探索資料,目的是 BI (80%) 或 建模 (20%)
Data Sanity Check
•EDA 就是第一步
•EDA 有助於我們了解資料樣貌
•總資料筆數、遺缺值比例、遺缺值處理方式、欄位值分布、欄位值合理
性(business domain knowledge)
•EDA 有助於事後模型預測
•進行處理 (Normalization與Standardization)
10
EDA is an approach to analyzing datasets to summarize their main characteristics,
often with visual methods (wikipedia)