IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Data Mining for Junior Data Scientists: Data Analytics With Python

Data Mining for Junior Data Scientists: Data Analytics With Python
View Sample PDF
Copyright: 2023
Pages: 56
Source title: Principles and Theories of Data Mining With RapidMiner
Source Author(s)/Editor(s): Sarawut Ramjan (Thammasat University, Thailand)and Jirapon Sunkpho (Thammasat University, Thailand)
DOI: 10.4018/978-1-6684-4730-7.ch012

Purchase

View Data Mining for Junior Data Scientists: Data Analytics With Python on the publisher's website for pricing and purchasing information.

Abstract

It is crucial for junior data scientists to learn computer programming as data science software packages may not always cater to the requirements of data analysis. Python provides a vast library of algorithms for data analysis, including NumPy, Pandas, Matplotlib, Seaborn, and Scikit-learn. NumPy and Pandas aid in organizing datasets as part of the pre-processing stage, while Matplotlib and Seaborn offer a range of data visualization commands. These visualization tools are instrumental in data exploration processes, such as creating histograms and scatter plots, and displaying data mining results like cluster analysis outcomes. Scikit-learn is a popular library in the data science industry that offers various data mining commands for regression, decision constructs, and cluster analysis, covering both supervised and unsupervised learning. Therefore, junior data scientists must learn Python programming for data science applications, especially when using software packages that require editing the model using Python commands.

Related Content

. © 2023. 34 pages.
. © 2023. 15 pages.
. © 2023. 15 pages.
. © 2023. 18 pages.
. © 2023. 24 pages.
. © 2023. 32 pages.
. © 2023. 21 pages.
Body Bottom