Exploratory Analysis

Data visualization, part 1. Code for Quiz 7.

1. Load the R package we will use.

2. Quiz questions

Question: modify slide 34

Question: Modify intro-slide 35

ggplot(faithful) + 
   geom_point(aes(x = eruptions, y = waiting),
              colour = "purple")   

Question: Modify intro-slide 36

ggplot(faithful) + 
   geom_histogram(aes(x = waiting))

Question: Modify geom-ex-1

ggplot(faithful) + 
   geom_point(aes(x = eruptions, y = waiting), 
   shape = "cross", size = 4, alpha =0.3)

Question: Modify geom-ex-2

ggplot(faithful) + 
   geom_histogram(aes(x = eruptions, fill = eruptions > 3.2 ))

Question: Modify stat-slide-40

ggplot(mpg) + 
   geom_bar(aes(x = manufacturer))

Question: Modify stat-slide-41

mpg_counted <- mpg %>% 
  count(manufacturer, name = 'count')
ggplot(mpg_counted) + 
  geom_bar(aes(x = manufacturer, y = count), stat = 'identity')

Question: Modify stat-slide-43

ggplot(mpg) + 
  geom_bar(aes(x = manufacturer, y = after_stat(100 * count / sum(count))))

Question: Modify answer to stat-ex-2

for reference see: https://ggplot2.tidyverse.org/reference/stat_summary.html?q=stat%20_%20summary#examples

Use stat_summary() to add a dot at the median of each group

ggplot(mpg) + 
  geom_jitter(aes(x = class, y = hwy), width = 0.2) +
  stat_summary(aes(x = class, y = hwy), geom = "point", 
  fun = "median", color = "dodgerblue", 
  shape = "plus", size = 2 )

ggsave(filename = "preview.png", 
       path = here::here("_posts", "2022-03-18-exploratory-analysis"))