Home > Subject Guide - Applied Data Science

Basic information about Applied Data Science
This guide provides an overview of the library resources available for Applied Data Science students.
Programme website
Course Reading
Code Course Name
ADS100Introduction to Data Science
ADS130Probability and Statistics
ADS151Python for Data Science
ADS210DIGITAL HUMANITIES: THEORIES AND METHODS
ADS220Python for Data Science
ADS230Introduction to Database Systems
ADS240Computer Programming & Problem Solving
ACM Digital Library
ACM Digital Library is a service offered by the Association for Computing Machinery. It includes journals, conference proceedings, magazines and newsletters. The platform provides the full images of journals, magazines, transactions, and conference proceedings. 

Statista
Statista is one of the world's leading statistics portals offering statistics from over 22,500 sources as well as their own analytics on digital & consumer markets.

GitHub - Open Source Code
GitHub is a web-based platform that serves as a hub for software development. It provides a collaborative environment for developers to work together on projects, track changes, and manage version control. GitHub is widely used for open-source projects, where developers worldwide can contribute code and collaborate on projects. It also offers various features such as bug tracking, project management, and code review tools. GitHub has become an essential tool for modern software development, enabling teams to work together efficiently and effectively.

Kaggle - Dataset
Kaggle is a popular online platform that hosts data science competitions, datasets, and machine learning models. It was founded in 2010 and has since grown into a vibrant community of data scientists, machine learning engineers, and researchers. Kaggle provides a platform for individuals and teams to compete against each other in solving real-world data problems by using machine learning and statistical techniques. The competitions hosted on Kaggle cover a wide range of topics, including image recognition, natural language processing, and predictive modeling. In addition to competitions, Kaggle also provides access to a vast collection of high-quality datasets, notebooks, and tutorials, making it an excellent resource for learning and practicing data science skills. With its strong community and resources, Kaggle has become a go-to platform for data enthusiasts and professionals worldwide.
data.world - Dataset
Data.world is a collaborative platform for data scientists, analysts, and enthusiasts to discover, share, and analyze data. It was founded in 2015 and has since grown into a thriving community of data professionals who use the platform to find and work with data. Data.world offers a variety of features, including a searchable repository of public datasets, tools for data analysis and visualization, and collaboration tools for teams. Users can upload their own datasets, collaborate with others on data projects, and share insights and findings with the broader community. The platform also offers integrations with popular data analysis tools like Tableau, R, and Python. With its focus on collaboration and community, Data.world has become a valuable resource for anyone looking to work with data, from beginners to seasoned professionals.
UC Irvine Machine Learning Repository - Dataset
The UC Irvine Machine Learning Repository is a collection of datasets that are widely used in the machine learning community for research and education purposes. It was created in 1987 as a way to make datasets more widely available to researchers and has since grown to include over 500 datasets. The datasets cover a wide range of topics, including classification, regression, clustering, and recommendation systems. Many of the datasets have been preprocessed and cleaned, making them suitable for use in machine learning experiments. The repository also provides various tools for accessing and working with the data, including software libraries and data visualization tools. With its extensive collection of datasets and resources, the UC Irvine Machine Learning Repository has become a valuable resource for researchers, students, and machine learning practitioners worldwide.
Physical Books
  Python Programming For Beginners In 2021: learning python in 5 days with step-by-step guidance, hands-on exercises and solution [fun tutorial for novice programmers]
Publication Date : 2021
Call number : 005.133 TUD 2021
Location : English Book (4/F)
  以Python取勝 : 計量交易快速上手
Publication Date : 2021
Call number : 563.53029 1612 2021
Location : Chinese Book (2/F)
  Data mining for business analytics : concepts, techniques and applications in python
Publication Date : 2020
Call number : 005.54 DAT 2020
Location : English Book (4/F)
  超圖解資料科學 ✕ 機器學習實戰探索 : 使用 Python = Practical exploration
Publication Date : 2021
Call number : 312.831 1214 2021
Location : Chinese Book (2/F)
  大數據分析與資料挖礦
Publication Date : 2018
Call number : 312.74 1814 2018
Location : Chinese Book (2/F)
Chinese eBooks
  大數據時代超吸睛視覺化工具與技術:Tableau打造30個經典數據圖表
Publication Date : 2021
Access via 華藝電子書 [ebook]
  TensorFlow自然語言處理:善用Python深度學習函式庫 教機器學會自然語言
Publication Date : 2019
Access via HyRead [ebook]
  Python零基礎入門班:一次打好程式設計、運算思維與邏輯訓練基本功! Publication Date : 2021
Access via HyRead [ebook]
  圖說演算法:使用Python:理解零負擔.採高CP值Python語言實作
Publication Date : 2018
Access via HyRead [ebook]
  圖解統計與大數據:圖解讓統計與大數據更簡單
Publication Date : 2018
Access via HyRead [ebook]
English eBooks
  Meta-learning : theory, algorithms and applications
Publication Date : 2023
Access via BSCOhost [ebook]
  Using AI for dialoguing with texts : from psychology to cinema and literature
Publication Date : 2023
Access via Ebook Central Perpetual [ebook]
  Statistics and data visualisation with Python
Publication Date : 2023
Access via Ebook Central Perpetual [ebook]
  Deep learning in practice
Publication Date : 2022
Access via Ebook Central Perpetual [ebook]
  Handbook of computer programming with Python
Publication Date : 2022
Access via Ebook Central Perpetual [ebook]