
MATH/COSC 3570 Introduction to Data Science


😕 Still what on earth is data science?





Make AI your thinking partner and tutor!
Copy and paste the paragraph into your GenAI chat as the initial prompt.
I am a college student in an introductory data science course. Be my thinking partner and tutor, not just an answer generator. Explain everything at an introductory level in plain language and avoid advanced jargon unless I ask. Walk through problems step by step and show your reasoning, and ask up to two quick questions if you need missing details. Use a tiny example when it helps. If code is needed, use either R (tidyverse) or Python (pandas, scikit-learn) based on what I say I am using, and keep the code minimal, well commented, and explained line by line. If my approach is wrong, tell me clearly what is wrong and how to fix it. End with one quick check for understanding and a one sentence summary.
>How does data science show up in everyday decisions I already care about?
>What mistakes happen when decisions are made without data or with bad data?
>How is data science used in my major: [environmental science, computer science, etc]?
Data science is an discipline that allows us to turn raw data into understanding, insight, and knowledge.
We’re going to learn to do this in a tidy way – more on that later!
This is a introductory data science course with an emphasis on important tools of R/Python that help us do data science.
Each row corresponds to a unique case or observational unit.
Each column represents a characteristic or variable.
This structure allows new cases to be added as rows or new variables as new columns.
mpg |> ggplot(aes(x = displ, y = hwy)) +
geom_point(aes(color = class)) +
geom_smooth() +
theme_bw()library(tidymodels)
linear_reg() |>
set_engine("lm") |>
fit(hwy ~ displ, data = mpg)parsnip model object
Call:
stats::lm(formula = hwy ~ displ, data = data)
Coefficients:
(Intercept) displ
35.70 -3.53




>Walk through a simple data science problem from question to conclusion.