Great averages so far:
Wednesday: A1 grades posted, A2 due BEFORE class, go over A2 in class
Awesome feedback!
1
In if statement, when there is no alternative action needed, maybe there is no need to include "else: pass"
2
I enjoyed reading this code because it was very simple to follow and efficient. Nothing seemed unnecessary and the thought process/logic was clear. Something that I learned was how you accomplished Q7. I struggled with this a bit and it was great to learn how you did it. Nesting for loops seems like the most clear way to do this and now that I have seen it work in action I completely understand why it works.
eda.py
df.describe()
) ... much fasterSomeone(s): tell me some favorite plots you've seen?
The ability to plot large datasets is both powerful and exciting
(Take notes during this period)
Important: Data viz (and analysis) is iterative: you learn what's worth looking at only as you go
Overlaps with our ABCD rule: PLOT A LOT! A LOT! A LOT!
("sns" means the "seaborn" package)
describe
part of distributiontd_a
) has a mean of 0.24 and std 0.38Read Chapter 3.3 for much more discussion of plotting (the whys and the hows)
# | Step | Note |
---|---|---|
0 | Ask a question about the data | Ex: What is the distribution of unemployment in each state? |
1 | Q > What the plot should look like. Draw it! | Draw it on paper! |
2 | Plot appearance > which plot function/options to use | Find a pd or sns plot example that looks like that. |
3 | The function dictates how data should be formatted before you call the plot | Key: Wide or tall? |