Great averages so far:
Wednesday: A1 grades posted, A2 due BEFORE class, go over A2 in class
Awesome feedback!
1
In if statement, when there is no alternative action needed, maybe there is no need to include "else: pass"
2
I enjoyed reading this code because it was very simple to follow and efficient. Nothing seemed unnecessary and the thought process/logic was clear. Something that I learned was how you accomplished Q7. I struggled with this a bit and it was great to learn how you did it. Nesting for loops seems like the most clear way to do this and now that I have seen it work in action I completely understand why it works.
eda.py df.describe()) ... much fasterThe ability to plot large datasets is both powerful and exciting
(Take notes during this period)
Important: Data viz (and analysis) is iterative: you learn what's worth looking at only as you go
Overlaps with our ABCD rule: PLOT A LOT! A LOT! A LOT!
("sns" means the "seaborn" package)
describe part of distributiontd_a) has a mean of 0.24 and std 0.38Read Chapter 3.3 for much more discussion of plotting (the whys and the hows)
| # | Step | Note |
|---|---|---|
| 0 | Ask a question about the data | Ex: What is the distribution of unemployment in each state? |
| 1 | Q > What the plot should look like. Draw it! | Draw it on paper! |
| 2 | Plot appearance > which plot function/options to use | Find a pd or sns plot example that looks like that. |
| 3 | The function dictates how data should be formatted before you call the plot | Key: Wide or tall? |