TY - GEN
T1 - CRISP-DM for Data Science
T2 - 2021 IEEE International Conference on Big Data, Big Data 2021
AU - Saltz, Jeffrey S.
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - This paper explores the strengths and weaknesses of CRISP-DM when used for data science projects. The paper then explores what key actions data science teams using CRISP-DM should consider that addresses CRISP-DM's weaknesses. In brief, CRISP-DM, which is the most popular framework teams use to execute data science projects, provides an easy to understand description of the data science project workflow (i.e., the data science life cycle). However, CRISP-DM's project phases miss some key aspects of the data science project life cycle. In addition, CRISP-DM's task-focused approach fails to address how a team should prioritize tasks, and in general, collaborate and communicate. Hence, this paper also describes how CRISP-DM could be combined with a team coordination framework, such as Scrum or Data Driven Scrum, which is a newer collaboration framework developed to address the unique data science coordination challenges.
AB - This paper explores the strengths and weaknesses of CRISP-DM when used for data science projects. The paper then explores what key actions data science teams using CRISP-DM should consider that addresses CRISP-DM's weaknesses. In brief, CRISP-DM, which is the most popular framework teams use to execute data science projects, provides an easy to understand description of the data science project workflow (i.e., the data science life cycle). However, CRISP-DM's project phases miss some key aspects of the data science project life cycle. In addition, CRISP-DM's task-focused approach fails to address how a team should prioritize tasks, and in general, collaborate and communicate. Hence, this paper also describes how CRISP-DM could be combined with a team coordination framework, such as Scrum or Data Driven Scrum, which is a newer collaboration framework developed to address the unique data science coordination challenges.
UR - http://www.scopus.com/inward/record.url?scp=85125312919&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125312919&partnerID=8YFLogxK
U2 - 10.1109/BigData52589.2021.9671634
DO - 10.1109/BigData52589.2021.9671634
M3 - Conference contribution
AN - SCOPUS:85125312919
T3 - Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021
SP - 2337
EP - 2344
BT - Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021
A2 - Chen, Yixin
A2 - Ludwig, Heiko
A2 - Tu, Yicheng
A2 - Fayyad, Usama
A2 - Zhu, Xingquan
A2 - Hu, Xiaohua Tony
A2 - Byna, Suren
A2 - Liu, Xiong
A2 - Zhang, Jianping
A2 - Pan, Shirui
A2 - Papalexakis, Vagelis
A2 - Wang, Jianwu
A2 - Cuzzocrea, Alfredo
A2 - Ordonez, Carlos
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 December 2021 through 18 December 2021
ER -