Summarizing rows

Author

Jeffrey R. Stevens

Published

February 21, 2025

For these exercises, we’ll use a new clean version of the dog breed traits data set.

  1. Import data from https://jeffreyrstevens.github.io/dpavir2025/data/dog_breed_traits_clean.csv and assign to traits.
# >
  1. What is the overall mean rating for affectionate?
# >
  1. What is the overall mean rating for each of the rating columns ignoring NAs?
# >
  1. How many breeds are there in each coat type?
# >
  1. What is the median grooming rating for each coat type?
# >
  1. What is the lowest rating per coat length for each of the rating columns, ignoring NAs?
# >
  1. What are the sample size, mean, and standard deviation of shedding ratings for medium coat length dogs per coat type sorted from largest to smallest sample size and only including coat types with 5 or more samples? Note: the order mentioned here is not necessarily the correct order for your pipeline.
# >
  1. Calculate each breed’s mean rating across each ratings column and return a data frame with the highest rating for each coat type. Don’t forget to undo rowwise() with ungroup() before further calculations.
# >