Chapter 2 Datasets
I have downloaded the relevant dataset put up by the authors. The following gives the list of datasets
<- data(package = 'resampledata')
dnames <- as_tibble(dnames$results[,c(3,4)])
df <- dim(df)[1]
n_datasets ::kable(df) knitr
Item | Title |
---|---|
Alelager | Calories and alcohol content for ales and lagers. |
Bangladesh | Bangladesh |
Batters2005 | Batters2005 |
Beerwings | Beer and hot wings consumption. |
BookPrices | Price of hardcover textbooks from mathematics and the natural sciences and the social sciences. |
Bushmeat | Bushmeat in Ghana. |
Bushmeat2 | Bushmeat in Ghana. |
Cameras | Prices of a sample of point-and-shoot digital cameras. |
Cereals | Information on various cereals. |
Challenger | Data on 23 Challenger flights. |
ChiMarathonMen | Data on Marathon times. |
Cuckoos | Cuckoos |
Diving2017 | Diving times in 2017 |
Fatalities | Random sample of 100 driver fatalities in 2009 in Pennsylvania. |
FishMercury | Mercury levels (parts per million) for 30 fish caught in lakes in Minnesota. |
FlightDelays | Information on 4029 United and American airlines departures from LGA during May and June 2009. |
GSS2002 | Results from 2002 General Society Survey. |
GSS2006 | Results from 2006 General Society Survey. |
Girls2004 | Random sample of 40 baby girls born in Alaska and 40 baby girls born in Wyoming. |
Groceries | Groceries. |
ILBoys | IL Boys |
IceCream | Calorie information for a sample of brands of chocolate and vanilla ice cream. |
Illiteracy | Data on a sample of countries where female illiteracy is more that 5 percent. |
Lottery | Winning numbers for the daily games from May 5, 2010 through August 15, 2010. |
MathAnxiety | Math Anxiety |
Maunaloa | Data on average CO2 levels (ppm) for the month of May from 1990 to 2010. |
MnGroundwater | Measurements on water quality of 895 randomly selected wells in Minnesota. |
MobileAds | Mobile Ads. |
NBA1617 | NBA 2016-2017 data. |
NCBirths2004 | Random sample of 1009 babies born in North Carolina during 2004. |
Nasdaq | NASDAQ Data. |
Olympics2012 | 2012 Olympics Data. |
Phillies2009 | Data from the 2009 season for the baseball team of the Philadelphia Phillies. |
Pitchers2005 | Pitchers2005 |
Quakes | Time between earthquakes (in days). |
Quetzal | Quetzal. |
RangersTwins2016 | Rangers/Twins 2016 Baseball Data. |
Reading | Children’s reading abilities. |
Recidivism | Recidivism data. |
Salaries | Salaries |
Sat2008 | Sat2008 |
Service | Service times (in minutes) for 174 customers at a college snack bar. |
Skateboard | Skateboarding data |
Skating2010 | Scores from the short program and free skate for men’s figure skating in the 2010 Olympics. |
Spruce | Study of factors affecting the growth of the black spruce. |
Starcraft | Starcraft |
TV | TV |
TXBirths2004 | Random sample of 1587 babies born in Texas in 2004. |
Titanic | Data on male passengers on the Titanic. |
Turbine | Wind Speeds (m/s) from Carleton College Turbine. |
Verizon | Random sample of repair times for 1664 ILEC and 23 CLEC customers. |
Volleyball2009 | Volleyball2009 |
Walleye | Length and weight measurements for a sample of 60 walleye caught in Minnesota lakes. |
Watertable | Relationship between seedling growth and water table depth for a sample of seedlings. |
corrExerciseA | Correlation Exercise A p.294 |
corrExerciseB | Correlation Exercise B p.294 |
manatees | manatees |
wafers | Wafers |
Hopefully once I work through 58 datasets, I will have a good enough
experience with revisting the resampling techniques in R
.