| Title: | A Comprehensive Collection of Penguin Datasets for Statistical Analysis and Experimental Design |
|---|---|
| Description: | Offers a comprehensive collection of penguin-related datasets suitable for descriptive statistics, hypothesis testing, and experimental design. Derived from open ecological and biological sources such as Palmer Station studies, the package integrates datasets covering adult morphology, clutch size, blood isotope composition, and heart rate. It is designed for researchers, students, and educators to explore statistical methods including ANOVA, regression, multivariate analysis, and design of experiments in an accessible and reproducible context. |
| Authors: | Juan Pablo Vargas Perez [aut, cre] |
| Maintainer: | Juan Pablo Vargas Perez <[email protected]> |
| License: | GPL-3 |
| Version: | 0.1.0 |
| Built: | 2026-06-01 08:01:06 UTC |
| Source: | https://github.com/jpablovargas340/penguinr |
This dataset, peng_df, is a data frame containing morphometric measurements for penguins observed near Palmer Station in the Palmer Archipelago, Antarctica. It includes data on three species across multiple islands, with variables for bill dimensions, flipper length, body mass, sex, and year of observation. Some observations may contain missing values.
data(peng_df)data(peng_df)
A data frame with 333 observations and 8 variables:
Penguin species (factor with 3 levels).
Island of observation (factor with 3 levels).
Bill length in millimeters (numeric).
Bill depth in millimeters (numeric).
Flipper length in millimeters (integer).
Body mass in grams (integer).
Sex of the penguin (factor with 2 levels).
Year of observation (integer).
The data were originally sourced from the palmerpenguins dataset and are provided in the heplots package for illustrative and educational use in multivariate statistical analysis examples.
The dataset name has been kept as 'peng_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the heplots package version 1.7.8
Offers a comprehensive collection of penguin-related datasets suitable for descriptive statistics, hypothesis testing, and experimental design.
PenguinR: A Comprehensive Collection of University and Engineering Datasets
A Comprehensive Collection of University and Engineering Datasets.
Maintainer: Juan Pablo Vargas Perez [email protected]
Useful links:
This dataset, penguins_df, is a data frame with measurements on adult penguins from three species and three islands in the Palmer Archipelago, Antarctica. Variables include flipper length, body mass, bill length and depth, sex, and year of observation. Some observations may contain missing values.
data(penguins_df)data(penguins_df)
A data frame with 344 observations and 8 variables:
Penguin species (factor with 3 levels).
Island of observation (factor with 3 levels).
Bill length in millimeters (numeric).
Bill depth in millimeters (numeric).
Flipper length in millimeters (integer).
Body mass in grams (integer).
Sex of the penguin (factor with 2 levels).
Year of observation (integer).
The columns in penguins_df are a subset of those in the more extensive penguins_raw data frame, which also includes nesting observations and blood isotope data. There are differences in column names and data types between the two datasets. See the Format section for details.
The dataset name has been kept as 'penguins_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the palmerpenguins package version 0.1.1
This dataset, penguins_raw_df, is a data frame containing comprehensive biological data for adult penguins observed near Palmer Station, Antarctica. It includes size measurements, clutch completion status, blood isotope values, and additional observational metadata. The dataset covers three penguin species from three islands in the Palmer Archipelago.
data(penguins_raw_df)data(penguins_raw_df)
A data frame with 344 observations and 17 variables:
Name of the study or project.
Unique numeric sample identifier.
Species name (character).
Region of the Palmer Archipelago (character).
Island of observation (character).
Stage of breeding or observation period (character).
Unique penguin identifier (character).
Clutch completion status (character).
Date the egg was recorded (Date).
Bill length in millimeters (numeric).
Bill depth in millimeters (numeric).
Flipper length in millimeters (numeric).
Body mass in grams (numeric).
Sex of the penguin (character).
Nitrogen isotope ratio (per mil).
Carbon isotope ratio (per mil).
Additional notes and remarks (character).
The columns of penguins_raw_df extend those of the penguins_df dataset, adding details
such as study name, individual ID, clutch information, isotope composition
( and ),
and additional observational notes. Some variables include missing values.
The dataset name has been kept as 'penguins_raw_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the palmerpenguins package version 0.1.1
This dataset, pinguinos_df, is a data frame containing the slopes of regressions of mass-specific metabolic rate on heart rate for three groups of Macaroni Penguins. The data summarize physiological relationships between metabolic and cardiac activity under different breeding and sex conditions.
data(pinguinos_df)data(pinguinos_df)
A data frame with 24 observations and 2 variables:
Penguin group (factor with 3 levels: "breeding female", "breeding male", "nonbreeding female").
Slope of the regression of mass-specific metabolic rate on heart rate (numeric).
The dataset includes one factor variable indicating the penguin group and one numeric variable representing the slope of the regression for each observation. Some observations may contain missing values.
The dataset name has been kept as 'pinguinos_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the abd package version 0.2-8