| Title: | A Curated Collection of Digestive System and Gastrointestinal Disease Datasets |
|---|---|
| Description: | Provides an extensive and curated collection of datasets related to the digestive system, stomach, intestines, liver, pancreas, and associated diseases. This package includes clinical trials, observational studies, experimental datasets, cohort data, and case series involving gastrointestinal disorders such as gastritis, ulcers, pancreatitis, liver cirrhosis, colon cancer, colorectal conditions, Helicobacter pylori infection, irritable bowel syndrome, intestinal infections, and post-surgical outcomes. The datasets support educational, clinical, and research applications in gastroenterology, public health, epidemiology, and biomedical sciences. Designed for researchers, clinicians, data scientists, students, and educators interested in digestive diseases, the package facilitates reproducible analysis, modeling, and hypothesis testing using real-world and historical data. |
| Authors: | Renzo Caceres Rossi [aut, cre] (ORCID: <https://orcid.org/0009-0005-0744-854X>) |
| Maintainer: | Renzo Caceres Rossi <[email protected]> |
| License: | GPL-3 |
| Version: | 0.2.0 |
| Built: | 2026-06-03 09:45:02 UTC |
| Source: | https://github.com/lightbluetitan/digestivedatasets |
This dataset, anorexia_weight_change_df, is a data frame containing weight change data for young female anorexia patients. It includes pre- and post-treatment weights, along with the type of treatment administered.
data(anorexia_weight_change_df)data(anorexia_weight_change_df)
A data frame with 72 observations and 3 variables:
Factor indicating the treatment type (3 levels)
Numeric vector indicating the patient's weight before treatment (in kilograms)
Numeric vector indicating the patient's weight after treatment (in kilograms)
The dataset name has been kept as 'anorexia_weight_change_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the MASS package version 7.3-65.
This dataset, bleeding_ulcers_df, is a data frame containing data from 40 experiments designed to compare a new surgery for stomach ulcer with an older surgery.
data(bleeding_ulcers_df)data(bleeding_ulcers_df)
A data frame with 80 observations and 9 variables:
Factor indicating the author of the study (20 levels)
Integer indicating the year of the study
Integer representing the quality score of the experiment
Integer indicating the age of the patients
Integer indicating the number of recurrent bleeds
Integer indicating the total number of patients
Integer indicating bleeding events
Factor indicating treatment type (6 levels)
Factor representing the experiment table (40 levels)
The dataset name has been kept as 'bleeding_ulcers_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the SMPracticals package version 1.4-3.1.
This dataset, campylobacter_infections_ts, is a time series object containing the number of cases of campylobacter infections in northern Quebec (Canada), recorded in four-week intervals from January 1990 to October 2000. Campylobacterosis is an acute bacterial infectious disease attacking the digestive system.
data(campylobacter_infections_ts)data(campylobacter_infections_ts)
A time series object ('ts') with 140 observations:
c(1990, 1)
c(2000, 10)
13 (observations per year)
The dataset name has been kept as 'campylobacter_infections_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.
Data taken from the tscount package version 1.4.3. Original source: Ferland, R., Latour, A. and Oraichi, D., "Integer-valued GARCH process". Journal of Time Series Analysis, 2006; 27(6): 923–942.
This dataset, cholera_deaths_1849_tbl_df, is a tibble containing daily deaths from Cholera and Diarrhaea in England for each day of the 12 months of 1849. It includes the month, cause of death, day of month, number of deaths, date, and day of week for each observation.
data(cholera_deaths_1849_tbl_df)data(cholera_deaths_1849_tbl_df)
A tibble with 730 observations and 6 variables:
Character indicating the month of observation
Factor with 2 levels indicating cause of death (Cholera or Diarrhaea)
Character indicating the day of the month
Numeric value indicating the number of deaths
Date object indicating the exact date
Ordered factor with 7 levels indicating the day of week
The dataset name has been kept as 'cholera_deaths_1849_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Data taken from the HistData package version 0.9-3. Original source: Bingham P., Verlander, N. Q., Cheal M. J. (2004). "John Snow, William Farr and the 1849 outbreak of cholera that affected London: a reworking of the data highlights the importance of the water supply". Public Health, 118(6), 387–394, Table 2.
This dataset, colon_stageBC_chemo_df, is a data frame containing data from one of the first successful trials of adjuvant chemotherapy for stage B/C colon cancer. The dataset includes 1858 observations (with two records per patient: one for recurrence and one for death) and 16 clinical variables.
data(colon_stageBC_chemo_df)data(colon_stageBC_chemo_df)
A data frame with 1858 observations and 16 variables:
Numeric patient identifier
Numeric study code
Factor with 3 levels indicating treatment group
Numeric gender code
Numeric age in years
Numeric obstruction status
Numeric perforation status
Numeric adhesion status
Numeric count of lymph nodes
Numeric event status
Numeric differentiation grade
Numeric tumor extent
Numeric surgery code
Numeric node4 status
Numeric follow-up time
Numeric event type
The dataset name has been kept as 'colon_stageBC_chemo_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the OncoDataSets package version 0.1.0.
This dataset, colonoscopy_features_tbl_df, is a tibble containing features extracted from 76 colonoscopic videos. Each video was recorded using both White Light (WL) and Narrow Band Imaging (NBI). The dataset includes histology results (classification ground truth), the opinion of endoscopists (4 experts and 3 beginners), and 698 features derived from patients with gastrointestinal lesions.
data(colonoscopy_features_tbl_df)data(colonoscopy_features_tbl_df)
A tibble with 76 observations and 7 variables:
Numeric feature extracted from colonoscopic videos
Numeric feature extracted from colonoscopic videos
Numeric feature extracted from colonoscopic videos
Numeric feature extracted from colonoscopic videos
Numeric score representing agreement among endoscopists
Numeric indicator for missing labels
Character string representing the histology-based classification
The dataset name has been kept as 'colonoscopy_features_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Data taken from the gmmsslm package version 1.1.6.
This dataset, crc_mirnas_pubmed_tbl_df, is a tibble containing information from PubMed abstracts related to microRNAs (miRNAs) in colorectal cancer. The data provides publication metadata, article abstracts, and associated miRNAs across 508 observations with 8 variables.
data(crc_mirnas_pubmed_tbl_df)data(crc_mirnas_pubmed_tbl_df)
A tibble with 508 observations and 8 variables:
Numeric PubMed identifier
Numeric publication year
Character article title
Character full abstract text
Character publication language
Character article type
Character research topic
Character microRNA identifiers
The dataset name has been kept as 'crc_mirnas_pubmed_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Data taken from the OncoDataSets package version 0.1.0.
This dataset, cystic_fibrosis_snps_df, is a data frame containing genetic association data for cystic fibrosis, including a case-control indicator and 23 single nucleotide polymorphisms (SNPs) with specified inter-marker distances. The dataset contains 186 observations across 24 variables.
data(cystic_fibrosis_snps_df)data(cystic_fibrosis_snps_df)
A data frame with 186 observations and 24 variables:
Integer case-control indicator
Integer SNP genotype at location 1
Integer SNP genotype at location 2
Integer SNP genotype at location 3
Integer SNP genotype at location 4
Integer SNP genotype at location 5
Integer SNP genotype at location 6
Integer SNP genotype at location 7
Integer SNP genotype at location 8
Integer SNP genotype at location 9
Integer SNP genotype at location 10
Integer SNP genotype at location 11
Integer SNP genotype at location 12
Integer SNP genotype at location 13
Integer SNP genotype at location 14
Integer SNP genotype at location 15
Integer SNP genotype at location 16
Integer SNP genotype at location 17
Integer SNP genotype at location 18
Integer SNP genotype at location 19
Integer SNP genotype at location 20
Integer SNP genotype at location 21
Integer SNP genotype at location 22
Integer SNP genotype at location 23
The dataset name has been kept as 'cystic_fibrosis_snps_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the gap.datasets package version 0.0.6. Original source: Liu JS, Sabatti C, Teng J, Keats BJB, Risch N (2001). "Bayesian Analysis of Haplotypes for Linkage Disequilibrium Mapping". Genome Research, 11:1716–1724.
This dataset, digestive_cancer_survival_df, is a data frame containing survival times (in days) of cancer patients with advanced cancer of the stomach, bronchus, colon, ovary, or breast. All patients included in this dataset received treatment that involved supplemental ascorbate.
data(digestive_cancer_survival_df)data(digestive_cancer_survival_df)
A data frame with 17 observations and 5 variables:
Integer values indicating survival times (in days) for patients with stomach cancer
Integer values indicating survival times (in days) for patients with bronchial cancer
Integer values indicating survival times (in days) for patients with colon cancer
Integer values indicating survival times (in days) for patients with ovarian cancer
Integer values indicating survival times (in days) for patients with breast cancer
The dataset name has been kept as 'digestive_cancer_survival_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the RbyExample package version 0.0.100.
This package provides a wide variety of datasets focused on the digestive system, stomach, intestines, liver, pancreas, and associated diseases, including clinical trials, observational studies, experimental datasets, cohort data, and case series involving gastrointestinal disorders such as gastritis, ulcers, pancreatitis, liver cirrhosis, colon cancer, colorectal conditions, Helicobacter pylori infection, irritable bowel syndrome, intestinal infections, and post-surgical outcomes.
DigestiveDataSets: A Curated Collection of Digestive System and Gastrointestinal Disease Datasets
A Curated Collection of Digestive System and Gastrointestinal Disease Datasets.
Maintainer: Renzo Caceres Rossi [email protected]
Useful links:
This dataset, ecoli_infections_df, is a data frame containing the weekly number of reported disease cases caused by Escherichia coli in the state of North Rhine-Westphalia (Germany) from January 2001 to May 2013, excluding cases of EHEC and HUS.
data(ecoli_infections_df)data(ecoli_infections_df)
A data frame with 646 observations and 3 variables:
Numeric value indicating the year of observation
Numeric value indicating the week of observation
Numeric value indicating the number of reported E. coli cases
The dataset name has been kept as 'ecoli_infections_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the tscount package version 1.4.3.
This dataset, gastric_cancer_trial_df, is a data frame containing data from a randomized clinical trial conducted by the Gastrointestinal Tumor Study Group on patients with gastric cancer. It includes survival time, event occurrence, and group assignment.
data(gastric_cancer_trial_df)data(gastric_cancer_trial_df)
A data frame with 90 observations and 3 variables:
Numeric vector representing survival time
Numeric vector indicating event occurrence (e.g., death or relapse)
Factor with 2 levels representing treatment groups
The dataset name has been kept as 'gastric_cancer_trial_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the package coin version 1.4-3.
This dataset, gi_damage_prevention_df, is a data frame containing results from four randomised clinical trials on the prevention of gastrointestinal damages by Misoprostol, reported by Lanza et al. (1987–1989).
data(gi_damage_prevention_df)data(gi_damage_prevention_df)
A data frame with 198 observations and 3 variables:
Factor indicating the clinical trial (4 levels)
Factor indicating the treatment group (2 levels: control or Misoprostol)
Ordered factor indicating the degree of gastrointestinal damage (5 levels)
The dataset name has been kept as 'gi_damage_prevention_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the HSAUR3 package version 1.0-15.
This dataset, helicobacter_children_tbl_df, is a tibble containing the prevalence of Helicobacter pylori infection in preschool children according to parental history of duodenal or gastric ulcer.
data(helicobacter_children_tbl_df)data(helicobacter_children_tbl_df)
A tibble with 863 observations and 2 variables:
Factor with 2 levels indicating parental history of duodenal or gastric ulcer
Factor with 2 levels indicating Helicobacter pylori infection status
The dataset name has been kept as 'helicobacter_children_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Data taken from the package pubh version 2.0.0.
This dataset, horse_colic_surgery_df, is a data frame containing clinical observations of horses with colic, where the primary task is to determine if the lesion requires surgery. The data consists of 300 cases with 31 clinical variables, modified from the original UCI repository version with adjusted factor levels.
data(horse_colic_surgery_df)data(horse_colic_surgery_df)
A data frame with 300 observations and 31 variables:
Factor with 2 levels indicating surgical requirement
Factor with 1 level (age group)
Integer hospital identifier
Numeric rectal temperature
Numeric pulse rate
Numeric respiratory rate
Factor with 4 levels (temperature extremes)
Factor with 4 levels (peripheral pulse)
Factor with 3 levels (capillary refill time)
Numeric pain score
Numeric peristalsis measure
Numeric distension score
Numeric tube measure
Numeric reflux quantity
Numeric reflux pH
Numeric exam result
Numeric abdomen assessment
Numeric cell volume
Numeric protein level
Numeric appearance score
Numeric protein measure
Factor with 3 levels (outcome status)
Factor with 2 levels (lesion type)
Factor with 60 levels (primary lesion type)
Integer secondary lesion code
Integer tertiary lesion code
Factor with 2 levels (CP data)
Ordered factor with 4 levels (temperature)
Numeric temperature measure
Factor with 6 levels (membrane color)
Factor with 5 levels (membrane group)
The dataset name has been kept as 'horse_colic_surgery_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way beyond factor level adjustments.
Data taken from the VIM package version 6.2.2 (originally from UCI repository).
This dataset, ibs_cam_trials_df, is a data frame containing results from 19 clinical trials examining complementary and alternative medicine (CAM) interventions for irritable bowel syndrome (IBS). The dataset includes 12 variables characterizing each trial and its outcomes.
data(ibs_cam_trials_df)data(ibs_cam_trials_df)
A data frame with 19 observations and 12 variables:
Integer trial identifier
Character study name/location
Integer publication year
Character country where study was conducted
Character IBS diagnostic criteria used
Integer study duration in days
Integer number of study visits
Integer Jadad score for study quality
Integer active treatment events
Integer active treatment sample size
Integer placebo group events
Integer placebo group sample size
The dataset name has been kept as 'ibs_cam_trials_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the metadat package version 1.4-0.
This dataset, intestinal_smartpill_df, is a data frame from a prospective cohort study evaluating gastric emptying, small bowel transit time, and total intestinal transit time using a SmartPill motility capsule. The study involved 8 critically ill trauma patients and 87 healthy volunteers. The capsule wirelessly transmitted pH, pressure, and temperature to a recorder attached to each subject's abdomen.
data(intestinal_smartpill_df)data(intestinal_smartpill_df)
A data frame with 95 observations and 22 variables:
Numeric indicator of group membership
Numeric indicator of gender
Numeric code indicating racial background
Height in centimeters
Weight in kilograms
Age in years
Gastric emptying time (minutes)
Small bowel transit time (minutes)
Colon transit time (minutes)
Whole gut transit time (minutes)
Number of contractions in the stomach
Sum of contraction amplitudes in the stomach
Mean peak amplitude in the stomach
Mean pH level in the stomach
Number of contractions in the small bowel
Sum of contraction amplitudes in the small bowel
Mean peak amplitude in the small bowel
Mean pH level in the small bowel
Number of contractions in the colon
Sum of contraction amplitudes in the colon
Mean peak amplitude in the colon
Mean pH level in the colon
The dataset name has been kept as 'intestinal_smartpill_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the medicaldata package version 0.2.0. Original source: Rauch et al., "Use of Wireless Utility Capsule to Determine Gastric Emptying and Small Intestinal Transit Times in Critically Ill Trauma Patients". Journal of Critical Care, 2012; 27(5): 534.e7–534.e12.
This dataset, intestinal_surgery_df, is a data frame containing intestinal surgery data from 844 cancer patients. The data consists of pairs (n_i, s_i) where n_i is the number of satellites removed and s_i is the number of satellites found to be malignant.
data(intestinal_surgery_df)data(intestinal_surgery_df)
A data frame with 844 observations and 2 variables:
Numeric value representing the number of satellites removed
Numeric value representing the number of malignant satellites found
The dataset name has been kept as 'intestinal_surgery_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the deconvolveR package version 1.2-1. Original source: Efron, B. (2016). "Empirical Bayes deconvolution estimates". Biometrika, 103(1), 1–20.
This dataset, liver_cirrhosis_prednisone_df, is a data frame containing data from a randomized control trial comparing prednisone (n=251) versus placebo (n=237) in 488 liver cirrhosis patients. The dataset includes both survival and longitudinal measurements of prothrombin index development over time, with 2968 total observations across 9 variables.
data(liver_cirrhosis_prednisone_df)data(liver_cirrhosis_prednisone_df)
A data frame with 2968 observations and 9 variables:
Integer patient identifier
Numeric time measurement
Integer death indicator
Numeric observation time
Integer prothrombin index value
Factor with 2 levels indicating treatment group (prednisone/placebo)
Numeric start time
Numeric stop time
Numeric event indicator
The dataset name has been kept as 'liver_cirrhosis_prednisone_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the JSM package version 1.0.1.
This dataset, lynch_ontario_families_df, is a data frame containing data from 32 Lynch Syndrome families segregating mismatch repair mutations selected from the Ontario Familial Colorectal Cancer Registry. The dataset includes 765 individuals (both probands and relatives) with 11 variables per observation.
data(lynch_ontario_families_df)data(lynch_ontario_families_df)
A data frame with 765 observations and 11 variables:
Integer family identifier
Integer individual identifier
Integer father's identifier
Integer mother's identifier
Integer gender code
Integer disease status
Integer time variable
Integer current age
Integer mutation gene status
Integer proband indicator
Integer relationship code
The dataset name has been kept as 'lynch_ontario_families_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the FamEvent package version 3.2.
This dataset, norovirus_derbyshire_df, is a data frame describing an outbreak of norovirus in the summer of 2001 in a primary school and nursery in Derbyshire, England. It contains 492 observations across 5 variables tracking illness patterns among students.
data(norovirus_derbyshire_df)data(norovirus_derbyshire_df)
A data frame with 492 observations and 5 variables:
Factor with 15 levels representing school classes
Integer day of absence
Integer day when illness started
Integer day when illness ended
Integer day when vomiting occurred
The dataset name has been kept as 'norovirus_derbyshire_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the outbreaks package version 1.9.0. Original source: O'Neill and Marks (2005).
This dataset, pancreatic_cancer_df, is a data frame containing data from a Phase II clinical trial of patients with locally advanced or metastatic pancreatic cancer. It includes time-to-event data for disease progression and death, as well as staging information.
data(pancreatic_cancer_df)data(pancreatic_cancer_df)
A data frame with 41 observations and 4 variables:
Factor indicating disease stage (locally advanced or metastatic)
Factor indicating time (in days) from enrollment
Factor indicating time (in days) to disease progression
Factor indicating time (in days) to death
The dataset name has been kept as 'pancreatic_cancer_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the asaur package version 0.50.
This dataset, pbc_mayo_survival_df, is a data frame containing data from a randomized control trial conducted at Mayo Clinic from 1974 to 1984, studying the progression of primary biliary cirrhosis. The dataset includes both survival and longitudinal measurements with 1945 observations across 16 clinical variables.
data(pbc_mayo_survival_df)data(pbc_mayo_survival_df)
A data frame with 1945 observations and 16 variables:
Integer patient identifier
Numeric time measurement
Numeric death indicator
Numeric observation time
Numeric serum bilirubin measurement
Numeric serum albumin measurement
Integer alkaline phosphatase level
Integer platelet count
Factor with 2 levels indicating treatment group
Numeric age in years
Factor with 2 levels indicating patient sex
Factor with 2 levels indicating presence of ascites
Factor with 2 levels indicating presence of hepatomegaly
Numeric start time for interval
Numeric stop time for interval
Numeric event indicator
The dataset name has been kept as 'pbc_mayo_survival_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the JSM package version 1.0.1.
This dataset, post_ercp_pancreatitis_tbl_df, is a tibble containing results from a randomized, placebo-controlled, prospective 2-arm trial of rectal indomethacin (100 mg) versus placebo to prevent post-ERCP pancreatitis in 602 participants, as reported by Elmunzer, Higgins, et al. (2012) in the New England Journal of Medicine.
data(post_ercp_pancreatitis_tbl_df)data(post_ercp_pancreatitis_tbl_df)
A tibble with 602 observations and 33 variables:
Numeric subject identifier
Factor indicating study site (4 levels)
Numeric age of the participant
Numeric risk score
Factor indicating gender (2 levels)
Factor indicating study outcome (2 levels)
Factor indicating presence of sphincter of Oddi dysfunction (2 levels)
Factor indicating presence of post-ERCP pancreatitis (2 levels)
Factor indicating recurrent pancreatitis (2 levels)
Factor indicating pancreatic sphincterotomy (2 levels)
Factor indicating precut sphincterotomy (2 levels)
Factor indicating difficult cannulation (2 levels)
Factor indicating pneumatic dilation (2 levels)
Factor indicating ampullary interventions (2 levels)
Factor indicating pancreatic injury (2 levels)
Factor indicating acinarization (2 levels)
Factor indicating brushing procedures (2 levels)
Factor indicating ASA 81 mg use (3 levels)
Factor indicating ASA 325 mg use (3 levels)
Factor indicating ASA status (3 levels)
Factor indicating prophylactic stent placement (2 levels)
Factor indicating therapeutic stent use (2 levels)
Factor indicating pancreatic duct stent (2 levels)
Factor indicating somatostatin use for SOD (2 levels)
Factor indicating biliary sphincterotomy (2 levels)
Factor indicating biliary stent (2 levels)
Factor indicating cholecystectomy (2 levels)
Factor indicating presence of pancreaticobiliary malignancy (2 levels)
Factor indicating if performed by trainee (2 levels)
Factor indicating trial status (2 levels)
Factor indicating procedure type (4 levels)
Factor indicating treatment group: placebo or indomethacin (2 levels)
Numeric bleeding indicator
The dataset name has been kept as 'post_ercp_pancreatitis_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Data taken from the medicaldata package version 0.2.0.
This dataset, ugi_bleeding_df, is a data frame containing results from 27 studies examining the effectiveness of histamine H2 antagonists (cimetidine or ranitidine) in treating acute upper gastrointestinal hemorrhage, with 14 variables per study.
data(ugi_bleeding_df)data(ugi_bleeding_df)
A data frame with 27 observations and 14 variables:
Integer study identifier
Character trial name/location
Integer publication year
Integer reference number
Character treatment description
Character control description
Integer treatment group sample size
Integer treatment group bleeding events
Integer treatment group other events
Integer treatment group deaths
Integer control group sample size
Integer control group bleeding events
Integer control group other events
Integer control group deaths
The dataset name has been kept as 'ugi_bleeding_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the metadat package version 1.4-0.
This function lists all datasets available in the 'DigestiveDataSets' package. If the 'DigestiveDataSets' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.
view_datasets_DigestiveDataSets()view_datasets_DigestiveDataSets()
A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.
if (requireNamespace("DigestiveDataSets", quietly = TRUE)) { library(DigestiveDataSets) view_datasets_DigestiveDataSets() }if (requireNamespace("DigestiveDataSets", quietly = TRUE)) { library(DigestiveDataSets) view_datasets_DigestiveDataSets() }
This dataset, weight_loss_df, is a data frame containing the weight, in kilograms, of an obese patient measured at 52 time points over an 8-month period as part of a weight rehabilitation programme.
data(weight_loss_df)data(weight_loss_df)
A data frame with 52 observations and 2 variables:
Integer vector indicating the number of days since the beginning of the programme
Numeric vector indicating the weight (in kilograms) of the patient at each time point
The dataset name has been kept as 'weight_loss_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the MASS package version 7.3-65.