| Title: | Access Japanese Data via Public APIs and Curated Datasets |
|---|---|
| Description: | Provides functions to access data from public RESTful APIs including 'Nager.Date', 'World Bank API', and 'REST Countries API', retrieving real-time or historical data related to Japan, such as holidays, economic indicators, and international demographic and geopolitical indicators. Additionally, the package includes one of the largest curated collections of open datasets focused on Japan, covering topics such as natural disasters, economic production, vehicle industry, air quality, demographics, and administrative divisions. The package supports reproducible research and teaching by integrating reliable international APIs and structured datasets from public, academic, and government sources. For more information on the APIs, see: 'Nager.Date' <https://date.nager.at/Api>, 'World Bank API' <https://datahelpdesk.worldbank.org/knowledgebase/articles/889392>, and 'REST Countries API' <https://restcountries.com/>. |
| Authors: | Renzo Caceres Rossi [aut, cre] (ORCID: <https://orcid.org/0009-0005-0744-854X>) |
| Maintainer: | Renzo Caceres Rossi <[email protected]> |
| License: | MIT + file LICENSE |
| Version: | 0.1.1 |
| Built: | 2026-05-27 06:56:57 UTC |
| Source: | https://github.com/lightbluetitan/japanapis |
This dataset, atomic_bomb_survivors_df, is a data frame containing frequencies of cancer deaths among Japanese atomic bomb survivors, categorized by extent of exposure, years after exposure, and age group. The dataset was used in the journal Statistical Sleuth and analyzed by Gore et al. (2006).
data(atomic_bomb_survivors_df)data(atomic_bomb_survivors_df)
A data frame with 84 observations and 4 variables:
Radiation exposure level (integer)
Type of count (factor)
Age group at time of observation (factor)
Frequency of cancer deaths (integer)
The dataset name has been kept as 'atomic_bomb_survivors_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the ACSWR package version 1.0
This dataset, centenarian_df, is a data frame containing the number of deaths at each year from 1963 to 1980 for Japanese male centenarians. The data is based on Table 2 of Emura and Murotani (2015), and also references the original work of Sibuya & Hanayama (2004).
data(centenarian_df)data(centenarian_df)
A data frame with 21 observations and 19 variables:
Age or identifier (numeric)
Number of deaths in 1963 (integer)
Number of deaths in 1964 (integer)
Number of deaths in 1965 (integer)
Number of deaths in 1966 (integer)
Number of deaths in 1967 (integer)
Number of deaths in 1968 (integer)
Number of deaths in 1969 (integer)
Number of deaths in 1970 (integer)
Number of deaths in 1971 (integer)
Number of deaths in 1972 (integer)
Number of deaths in 1973 (integer)
Number of deaths in 1974 (integer)
Number of deaths in 1975 (integer)
Number of deaths in 1976 (integer)
Number of deaths in 1977 (integer)
Number of deaths in 1978 (integer)
Number of deaths in 1979 (integer)
Number of deaths in 1980 (integer)
The dataset name has been kept as 'centenarian_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the depend.truncation package version 3.0
This dataset, earthquake_station_sf, is a spatial features (sf) tibble containing information about 671 earthquake observation stations managed by the Japan Meteorological Agency. It includes details such as the prefecture, area, station name, address, and observation period.
data(earthquake_station_sf)data(earthquake_station_sf)
An sf object with 671 observations and 7 variables:
Name of the prefecture (character)
Area within the prefecture (character)
Name of the observation station (character)
Physical address of the station (character)
Start date of observation (character)
End date of observation, if available (character)
XY coordinates of the station (sf geometry column)
The dataset name has been kept as 'earthquake_station_sf' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'sf' indicates that the dataset is a spatial features object. The original content has not been modified in any way.
Data taken from the jmastats package version 0.3.0
Retrieves selected, essential information about Japan using the REST Countries API. The function returns a tibble with core details such as population, area, capital, region, and official language(s).
See the API documentation at https://restcountries.com/. Example API usage: https://restcountries.com/v3.1/name/japan?fullText=true.
get_country_info_jp()get_country_info_jp()
The function sends a GET request to the REST Countries API. If the API returns data for Japan,
the function extracts and returns selected fields as a tibble. If the request fails or
Japan is not found, it returns NULL and prints an informative message.
A tibble with the following 8 columns, or NULL if the API is unavailable:
name_common: Common name of the country.
name_official: Official name of the country.
region: Geographical region.
subregion: Subregion within the continent.
capital: Capital city.
area: Area in square kilometers.
population: Population of the country.
languages: Languages spoken in the country, as a comma-separated string.
Requires internet connection. The data is retrieved in real time from the REST Countries API.
REST Countries API: https://restcountries.com/
# Requires internet connection result <- get_country_info_jp() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_country_info_jp() if (!is.null(result)) { print(result) }
Retrieves Japan's under-five mortality rate (per 1,000 live births)
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SH.DYN.MORT.
get_japan_child_mortality()get_japan_child_mortality()
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Mortality rate, under-5 (per 1,000 live births)")
country: Country name ("Japan")
year: Year of the data (integer)
value: Under-5 mortality rate per 1,000 live births (numeric)
Requires internet connection.
World Bank Open Data API: https://data.worldbank.org/indicator/SH.DYN.MORT
# Requires internet connection result <- get_japan_child_mortality() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_child_mortality() if (!is.null(result)) { print(result) }
Retrieves Japan's Consumer Price Index (2010 = 100)
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is FP.CPI.TOTL.
get_japan_cpi()get_japan_cpi()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Consumer price index (2010 = 100)")
country: Country name ("Japan")
year: Year of the data (integer)
value: Consumer Price Index value in numeric form
Requires internet connection. The data is retrieved in real time from the World Bank API.
World Bank Open Data API: https://data.worldbank.org/indicator/FP.CPI.TOTL
# Requires internet connection result <- get_japan_cpi() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_cpi() if (!is.null(result)) { print(result) }
Retrieves Japan's energy use per capita, measured in kilograms of oil equivalent,
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is EG.USE.PCAP.KG.OE.
get_japan_energy_use()get_japan_energy_use()
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Energy use (kg of oil equivalent per capita)")
country: Country name ("Japan")
year: Year of the data (integer)
value: Energy use in kilograms of oil equivalent per capita
Requires internet connection.
World Bank Open Data API: https://data.worldbank.org/indicator/EG.USE.PCAP.KG.OE
# Requires internet connection result <- get_japan_energy_use() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_energy_use() if (!is.null(result)) { print(result) }
Retrieves Japan's Gross Domestic Product (GDP) in current US dollars
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is NY.GDP.MKTP.CD.
get_japan_gdp()get_japan_gdp()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "GDP (current US$)")
country: Country name ("Japan")
year: Year of the data (integer)
value: GDP value in numeric form
value_label: Formatted GDP value (e.g., "1,466,464,899,304")
Requires internet connection. The data is retrieved in real time from the World Bank API.
World Bank Open Data API: https://data.worldbank.org/indicator/NY.GDP.MKTP.CD
GET, fromJSON, as_tibble, comma
# Requires internet connection result <- get_japan_gdp() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_gdp() if (!is.null(result)) { print(result) }
Retrieves the list of official public holidays in Japan for a specific year using the Nager.Date public holidays API. This function returns a tibble containing the date of the holiday, the name in the local language (Japanese), and the English name. It is useful for academic, planning, and data analysis purposes. The information is retrieved directly from the Nager.Date API and reflects the current status of holidays for the requested year. The field names returned are consistent with the API structure.
get_japan_holidays(year)get_japan_holidays(year)
year |
An integer indicating the year (e.g., 2024 or 2025). |
A tibble with the following columns, or NULL if the API is unavailable:
date: Date of the holiday (class Date)
local_name: Holiday name in the local language (Japanese)
name: Holiday name in English
Data obtained from the Nager.Date API: https://date.nager.at/
# Requires internet connection result <- get_japan_holidays(2024) if (!is.null(result)) { print(result) } result <- get_japan_holidays(2025) if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_holidays(2024) if (!is.null(result)) { print(result) } result <- get_japan_holidays(2025) if (!is.null(result)) { print(result) }
Retrieves data on the number of hospital beds per 1,000 people in Japan
from 2010 to 2022 using the World Bank Open Data API.
The indicator used is SH.MED.BEDS.ZS.
get_japan_hospital_beds()get_japan_hospital_beds()
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Hospital beds (per 1,000 people)")
country: Country name ("Japan")
year: Year of the data (integer)
value: Hospital beds per 1,000 people (numeric)
Requires internet connection.
World Bank Open Data API: https://data.worldbank.org/indicator/SH.MED.BEDS.ZS
# Requires internet connection result <- get_japan_hospital_beds() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_hospital_beds() if (!is.null(result)) { print(result) }
Retrieves Japan's life expectancy at birth (in years) for the years 2010 to 2022
using the World Bank Open Data API. The indicator used is SP.DYN.LE00.IN.
get_japan_life_expectancy()get_japan_life_expectancy()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Life expectancy at birth, total (years)")
country: Country name ("Japan")
year: Year of the data (integer)
value: Life expectancy value in numeric form (years)
Requires internet connection. The data is retrieved in real time from the World Bank API.
World Bank Open Data API: https://data.worldbank.org/indicator/SP.DYN.LE00.IN
# Requires internet connection result <- get_japan_life_expectancy() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_life_expectancy() if (!is.null(result)) { print(result) }
Retrieves Japan's literacy rate for adults aged 15 and above,
expressed as a percentage, for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SE.ADT.LITR.ZS.
get_japan_literacy_rate()get_japan_literacy_rate()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Literacy rate, adult total (
country: Country name ("Japan")
year: Year of the data (integer)
value: Literacy rate as numeric percentage
Requires internet connection. The data is retrieved in real time from the World Bank API. Note that literacy rate data for Japan may not be available for all years in the World Bank database.
World Bank Open Data API: https://data.worldbank.org/indicator/SE.ADT.LITR.ZS
# Requires internet connection result <- get_japan_literacy_rate() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_literacy_rate() if (!is.null(result)) { print(result) }
Retrieves Japan's total population for the years 2010 to 2022
using the World Bank Open Data API. The indicator used is SP.POP.TOTL.
get_japan_population()get_japan_population()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Population, total")
country: Country name ("Japan")
year: Year of the data (integer)
value: Population as a numeric value
value_label: Formatted population with commas (e.g., "126,000,000")
Requires internet connection. The data is retrieved in real time from the World Bank API.
World Bank Open Data API: https://data.worldbank.org/indicator/SP.POP.TOTL
GET, fromJSON, as_tibble, comma
# Requires internet connection result <- get_japan_population() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_population() if (!is.null(result)) { print(result) }
Retrieves Japan's Unemployment, total (
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SL.UEM.TOTL.ZS.
get_japan_unemployment()get_japan_unemployment()
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL with an informative message.
A tibble with the following columns, or NULL if the API is unavailable:
indicator: Indicator name (e.g., "Unemployment, total (
country: Country name ("Japan")
year: Year of the data (integer)
value: Unemployment rate as percentage in numeric form
Requires internet connection. The data is retrieved in real time from the World Bank API.
World Bank Open Data API: https://data.worldbank.org/indicator/SL.UEM.TOTL.ZS
# Requires internet connection result <- get_japan_unemployment() if (!is.null(result)) { print(result) }# Requires internet connection result <- get_japan_unemployment() if (!is.null(result)) { print(result) }
This dataset, hiroshima_tbl_df, is a tibble containing data on the number of deaths from leukemia and other cancers among survivors of the Hiroshima atom bomb. The data cover deaths that occurred during the period 1950–1959 among survivors who were aged 25 to 64 years in 1950.
data(hiroshima_tbl_df)data(hiroshima_tbl_df)
A tibble with 6 observations and 4 variables:
Radiation exposure category (character)
Number of deaths from leukemia (numeric)
Number of deaths from other cancers (numeric)
Total number of cancer deaths (numeric)
The dataset name has been kept as 'hiroshima_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the dobson package version 0.4
This dataset, J1League_tbl_df, is a tibble containing results of Japan's J1 League matches from 2012 to 2022. The dataset includes information such as the season, match date and time, home and away teams, goals scored, and final match result. The dataset preserves the original structure from its source on Kaggle.
data(J1League_tbl_df)data(J1League_tbl_df)
A tibble with 3,213 observations and 7 variables:
Season year of the match (numeric)
Date and time of the match (POSIXct)
Home team name (character)
Away team name (character)
Number of goals scored by the home team (numeric)
Number of goals scored by the away team (numeric)
Final result of the match (character)
The dataset name has been kept as 'J1League_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/irkaal/japanese-j1-league
This dataset, japan_birth_stats_tbl_df, is a tibble containing Japan’s birth-related demographic statistics from 1899 to 2022. The dataset includes birth counts by gender, birth rate, fertility rate, infant deaths, stillbirths, birth order, gestational weeks, average parental ages, and legitimacy of births. Some data are missing between the years 1944 and 1946 due to records lost during World War II. The dataset preserves the original structure from its source on Kaggle.
data(japan_birth_stats_tbl_df)data(japan_birth_stats_tbl_df)
A tibble with 124 observations and 51 variables:
Index column (numeric)
Year of observation (numeric)
Total number of births (numeric)
Number of male births (numeric)
Number of female births (numeric)
Birth rate per 1,000 population (numeric)
Male to female birth ratio (numeric)
Total fertility rate (numeric)
Total population (numeric)
Male population (numeric)
Female population (numeric)
Total number of infant deaths (numeric)
Number of male infant deaths (numeric)
Number of female infant deaths (numeric)
Infant deaths with unknown gender (numeric)
Infant mortality rate (numeric)
Male to female infant death ratio (numeric)
Proportion of infant deaths in total deaths (numeric)
Total number of stillbirths (numeric)
Number of male stillbirths (numeric)
Number of female stillbirths (numeric)
Stillbirths with unknown gender (numeric)
Stillbirth rate (numeric)
Male to female stillbirth ratio (numeric)
Number of firstborn children (numeric)
Number of secondborn children (numeric)
Number of thirdborn children (numeric)
Number of fourthborn children (numeric)
Number of fifthborn and above children (numeric)
Births before 28 gestational weeks (numeric)
Births between 28 and 31 gestational weeks (numeric)
Births between 32 and 36 gestational weeks (numeric)
Births between 37 and 41 gestational weeks (numeric)
Births after 42 gestational weeks (numeric)
Average age of mothers (numeric)
Average age of mothers for firstborns (numeric)
Average age of mothers for secondborns (numeric)
Average age of mothers for thirdborns (numeric)
Births to mothers under 19 years (numeric)
Births to mothers aged 20–24 (numeric)
Births to mothers aged 25–29 (numeric)
Births to mothers aged 30–34 (numeric)
Births to mothers aged 35–39 (numeric)
Births to mothers aged 40–44 (numeric)
Births to mothers over 45 years (numeric)
Average age of fathers (numeric)
Average age of fathers for firstborns (numeric)
Average age of fathers for secondborns (numeric)
Average age of fathers for thirdborns (numeric)
Number of legitimate children (numeric)
Number of illegitimate children (numeric)
The dataset name has been kept as 'japan_birth_stats_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/webdevbadger/japan-birth-statistics
This dataset, japan_population_tbl_df, is a tibble containing Japan’s population over time and space. In this dataset you will find the raw population numbers for the prefectures, going as far back as the 1870s. The dataset preserves the original structure from its source on Kaggle.
data(japan_population_tbl_df)data(japan_population_tbl_df)
A tibble with 2,632 observations and 7 variables:
Name of the prefecture (character)
Year of the population observation (numeric)
Total population in that year (numeric)
Name of the prefectural capital (character)
Geographic region of the prefecture (character)
Estimated area of the prefecture in square kilometers (numeric)
Name of the island to which the prefecture belongs (character)
The dataset name has been kept as 'japan_population_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/jd1325/japan-population-data
This dataset, japan_universities_tbl_df, is a tibble containing comprehensive data about 813 universities in Japan. It includes information such as the university name (in English and Japanese), location, type, year founded, number of faculties and departments, availability of graduate and remote programs, and difficulty and review metrics. The dataset preserves the original structure from its source on Kaggle.
data(japan_universities_tbl_df)data(japan_universities_tbl_df)
A tibble with 813 observations and 22 variables:
Index column (numeric)
University code (character)
University name in English (character)
University name in Japanese (character)
Type of university (e.g., Public, Private) (character)
Type of university in Japanese (character)
Full address of the university (character)
Postal code (character)
Contact phone number (character)
Name of the prefecture or region in English (character)
Name of the prefecture or region in Japanese (character)
Latitude coordinate (numeric)
Longitude coordinate (numeric)
Year the university was founded (character)
Number of faculties (numeric)
Number of departments (numeric)
Whether the university has graduate programs (logical)
Whether the university offers remote programs (logical)
Average user review rating (numeric)
Number of user reviews (numeric)
Standard deviation of difficulty ratings (numeric)
Relative difficulty rank (character)
The dataset name has been kept as 'japan_universities_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/webdevbadger/japanese-universities
This package provides functions to access data from public RESTful APIs including 'Nager.Date', 'World Bank API', and 'REST Countries API', retrieving real-time or historical data related to Japan, such as holidays, economic indicators, international demographic and geopolitical indicators. Additionally, the package includes one of the largest curated collections of open datasets focused on Japan, covering topics such as natural disasters, economic production, vehicle industry, air quality, demographics, and administrative divisions.
JapanAPIs: Access Japanese Data via Public APIs and Curated Datasets
Access Japanese Data via Public APIs and Curated Datasets.
Maintainer: Renzo Caceres Rossi [email protected]
Useful links:
This dataset, japanese_whisky_tbl_df, is a tibble containing over 1,000 reviews of Japanese whisky. The dataset includes the bottle name, brand, review title, and full review content. The dataset preserves the original structure from its source on Kaggle.
data(japanese_whisky_tbl_df)data(japanese_whisky_tbl_df)
A tibble with 1,130 observations and 5 variables:
Index column (numeric)
Name of the whisky bottle (character)
Brand of the whisky (character)
Title of the review (character)
Full text content of the review (character)
The dataset name has been kept as 'japanese_whisky_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/koki25ando/japanese-whisky-review
This dataset, JNcharacter_df, is a data frame containing a subset of responses from the Survey on the Japanese National Character. It includes demographic variables and responses related to values, opinions, and attitudes.
data(JNcharacter_df)data(JNcharacter_df)
A data frame with 85 observations and 10 variables:
Sex of the respondent (numeric)
Age of the respondent (numeric)
Political party preference (numeric)
Level of education (numeric)
Occupation category (numeric)
Religious identification: born again or not (numeric)
Perception of life as difficult (numeric)
Attitude toward pleasure (numeric)
Opinion on women working (numeric)
Importance of money (numeric)
The dataset name has been kept as 'JNcharacter_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the catdap package version 1.3.7
This dataset, jpn_climate_stations_tbl_df, is a tibble containing information about 157 climate observation stations located across various regions and prefectures in Japan. It includes details such as station name, location, altitude, coordinates, and city information.
data(jpn_climate_stations_tbl_df)data(jpn_climate_stations_tbl_df)
A tibble with 157 observations and 11 variables:
Geographical region where the station is located (character)
Prefecture name (character)
Station number (character)
Station name (character)
Altitude of the station (character)
Latitude coordinate (character)
Longitude coordinate (character)
Latitude direction, North or South (character)
Longitude direction, West or East (character)
Station name in Japanese phonetic script (character)
City name (character)
The dataset name has been kept as 'jpn_climate_stations_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the clidatajp package version 0.5.2
This dataset, jpn_climate_tbl_df, is a tibble containing climate data from various observation stations across Japan. It includes monthly data on temperature, precipitation, snowfall, solar insolation, and metadata such as station name, location, altitude, and coordinates.
data(jpn_climate_tbl_df)data(jpn_climate_tbl_df)
A tibble with 3,768 observations and 14 variables:
Observation number (numeric)
Name of the weather station (character)
Month of observation (numeric)
Average temperature (numeric)
Monthly precipitation (numeric)
Monthly snowfall (numeric)
Monthly solar insolation (numeric)
Country name (character)
Measurement period or time range (character)
Altitude of the station (numeric)
Latitude coordinate (numeric)
Longitude coordinate (numeric)
Latitude direction, North or South (character)
Longitude direction, West or East (character)
The dataset name has been kept as 'jpn_climate_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the clidatajp package version 0.5.2
This dataset, jpn_eq_miyagi_2003_df, is a data frame containing aftershock data from the earthquake of magnitude 6.2 that occurred on 26th July 2003 in northern Miyagi-Ken, Japan. It includes information on the time of occurrence, geographic coordinates, depth, and magnitude of each aftershock event.
data(jpn_eq_miyagi_2003_df)data(jpn_eq_miyagi_2003_df)
A data frame with 2305 observations and 5 variables:
Time of aftershock event (numeric)
Longitude coordinate (numeric)
Latitude coordinate (numeric)
Depth in kilometers (numeric)
Magnitude of the aftershock (numeric)
The dataset name has been kept as 'jpn_eq_miyagi_2003_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the mmpp package version 0.6
This dataset, jpn_gdp_cons_df, is a data frame containing information on Household Consumption (C) and Gross Domestic Product (GDP, denoted as Y) in Japan from 1978 to 2007. The data is useful for analyzing structural breaks under heteroskedasticity.
data(jpn_gdp_cons_df)data(jpn_gdp_cons_df)
A data frame with 30 observations and 3 variables:
Calendar year (integer)
Household Consumption (integer)
Gross Domestic Product (GDP) (integer)
The dataset name has been kept as 'jpn_gdp_cons_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the SupMZ package version 0.2.0
This dataset, jpn_manga_hafu_df, is a data frame containing information on half-caste manga characters. It includes attributes such as the year of publication, manga series, character name, gender, parental origin, and physical traits like eye and hair color.
data(jpn_manga_hafu_df)data(jpn_manga_hafu_df)
A data frame with 296 observations and 9 variables:
Year of publication (integer)
Manga series name (factor)
Character name (factor)
Gender of the character (factor)
Father's origin (factor)
Mother's origin (factor)
Eye color (factor)
Hair color (factor)
Additional notes about the character (factor)
The dataset name has been kept as 'jpn_manga_hafu_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the learningr package version 0.29.1
This dataset, jpn_prefectures_tbl_df, is a tibble representing Japan’s 47 prefectures arranged in a 7x7 grid layout. It includes prefectural codes, names in both romaji and kanji, regions, major islands, and grid coordinates.
data(jpn_prefectures_tbl_df)data(jpn_prefectures_tbl_df)
A tibble with 47 observations and 8 variables:
JIS code of the prefecture (character)
Prefectural name in romaji (character)
Geographical region of the prefecture (factor)
Major island the prefecture belongs to (character)
Prefectural name in kanji (character)
Region name in kanji (factor)
X coordinate for grid placement (integer)
Y coordinate for grid placement (integer)
The dataset name has been kept as 'jpn_prefectures_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the tabularmaps package version 0.1.0
This dataset, jpn_suicides_tbl_df, is a tibble containing the number of suicides in Japan from 1978 to 2022 by sex and age group. The dataset includes suicide counts for males and females, overall suicide rates, and counts by specific age ranges. The dataset preserves the original structure from its source on Kaggle.
data(jpn_suicides_tbl_df)data(jpn_suicides_tbl_df)
A tibble with 45 observations and 14 variables:
Year of observation (numeric)
Number of male suicides (numeric)
Number of female suicides (numeric)
Overall suicide rate (numeric)
Suicide rate among males (numeric)
Suicide rate among females (numeric)
Number of suicides aged 0–19 (numeric)
Number of suicides aged 20–29 (numeric)
Number of suicides aged 30–39 (numeric)
Number of suicides aged 40–49 (numeric)
Number of suicides aged 50–59 (numeric)
Number of suicides aged 60 and above (numeric)
Number of suicides with unknown age (numeric)
Total number of suicides (numeric)
The dataset name has been kept as 'jpn_suicides_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/krisztinboros/suicides-in-japan-by-sex-and-age-19782021
This dataset, jpn_us_cars_df, is a data frame containing information on 45 automobiles from the United States and Japan. It includes attributes such as model name, country of origin, mileage, and price.
data(jpn_us_cars_df)data(jpn_us_cars_df)
A data frame with 45 observations and 4 variables:
Car model (factor)
Country of origin (factor)
Mileage in miles per gallon (integer)
Price in US dollars (integer)
The dataset name has been kept as 'jpn_us_cars_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the fma package version 2.5
This dataset, jpn_usd_exchange_df, is a data frame containing weekly observations of the yen-dollar exchange rate from 1975 to 1989. It includes spot and forward rates, along with 30-day forward rates.
data(jpn_usd_exchange_df)data(jpn_usd_exchange_df)
A data frame with 778 observations and 4 variables:
Time index of the observation (integer)
Spot exchange rate (numeric)
Forward exchange rate (numeric)
30-day forward exchange rate (numeric)
The dataset name has been kept as 'jpn_usd_exchange_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the Ecdat package version 0.4-2
This dataset, jpn_vehicle_prod_ts, is a univariate time series containing the number of motor vehicles produced in Japan from 1947 to 1989. The figures are recorded annually and expressed in thousands.
data(jpn_vehicle_prod_ts)data(jpn_vehicle_prod_ts)
A univariate time series with 43 observations:
Yearly observations from 1947 to 1989
Motor vehicle production in Japan (in thousands)
The dataset name has been kept as 'jpn_vehicle_prod_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.
Data taken from the fma package version 2.5
This dataset, jpnprefs_tbl_df, is a tibble containing information about the 47 prefectures of Japan. It includes details such as prefectural names in kanji and romaji, corresponding JIS codes, the region each prefecture belongs to, and the major island associated with each.
data(jpnprefs_tbl_df)data(jpnprefs_tbl_df)
A tibble with 47 observations and 5 variables:
JIS code of the prefecture (character)
Prefectural name in kanji (character)
Prefectural name in romaji (character)
Geographical region of the prefecture (character)
Major island the prefecture belongs to (character)
The dataset name has been kept as 'jpnprefs_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the zipangu package version 0.3.3
This dataset, 'kobe_quake_1995_ts', is a time series containing data related to the 1995 Kobe earthquake. The data are organized sequentially with a frequency of 1, and consist of 3,048 observations.
data(kobe_quake_1995_ts)data(kobe_quake_1995_ts)
A time series with 3,048 observations:
Time series data related to the 1995 Kobe earthquake (numeric)
The dataset name has been kept as 'kobe_quake_1995_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.
Data taken from the waveslim package version 1.8.5
This dataset, kojima_tweets_tbl_df, is a tibble containing 454 original tweets from Hideo Kojima, posted between November 17, 2019, and January 6, 2020. Retweets are excluded. The dataset includes tweet text, engagement metrics, tweet metadata, user handle information, geolocation data, hashtags, mentions, URLs, and media details. The dataset preserves the original structure from its source on Kaggle.
data(kojima_tweets_tbl_df)data(kojima_tweets_tbl_df)
A tibble with 454 observations and 41 variables:
Text content of the tweet (character)
Date when the tweet was created (character)
Number of retweets (numeric)
Number of likes (numeric)
Sum of retweets and likes (numeric)
Count of hashtags used (numeric)
Count of user mentions (numeric)
Count of URLs included (numeric)
Count of media elements attached (numeric)
Username who posted the tweet (character)
Whether it was a reply to another user (logical)
Type of post (e.g., tweet, reply, etc.) (character)
Type of media included (e.g., photo, video) (character)
Whether the tweet is a quote (character)
Whether the tweet is a retweet (character)
Language of the tweet content (character)
Country information if available (logical)
Geolocation as specified in the tweet (logical)
Full name of the location (logical)
Type of location (logical)
Hashtags used in the tweet (character)
User mentions in the tweet (character)
URLs present in the tweet (character)
Media URLs attached to the tweet (character)
Name associated with the Twitter account (character)
Handle of the Twitter account (character)
Date when the Twitter account was created (character)
Preferred language of the account (character)
Total number of tweets from the account (numeric)
Number of followers (numeric)
Number of accounts followed (numeric)
Total number of liked tweets by the account (numeric)
Number of lists the account is included in (numeric)
Location listed in the user profile (character)
Whether the account is protected (character)
Whether the account is verified (character)
URL included in the profile (character)
Profile description or bio (character)
URL of the profile image (character)
Age of the account in days (numeric)
Average tweets per day (numeric)
The dataset name has been kept as 'kojima_tweets_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/luciesteel/kojima-tweets
This dataset, life_exp_japan_tbl_df, is a tibble containing life expectancy and related economic and social indicators for Japan's 47 prefectures as of 2020. It includes variables on healthcare infrastructure, education levels, income, public spending, and environmental indicators. The dataset preserves the original structure from its source on Kaggle.
data(life_exp_japan_tbl_df)data(life_exp_japan_tbl_df)
A tibble with 47 observations and 23 variables:
Name of the Japanese prefecture (character)
Average life expectancy (numeric)
Number of physicians per 100,000 people (numeric)
Number of junior colleges (numeric)
Number of universities (numeric)
Number of public hospitals (numeric)
Number of psychiatric hospitals (numeric)
Number of psychiatric hospital beds (numeric)
Number of nurses per 100,000 people (numeric)
Average weekly working hours (numeric)
Average annual salary (numeric)
Number of elementary schools (numeric)
Number of sports facilities (numeric)
Number of parks (numeric)
Percentage of forest area (numeric)
Per capita income (numeric)
Population density (people per km²) (numeric)
Total number of hospitals (numeric)
Total number of hospital beds (numeric)
Number of ambulances (numeric)
Health expenditure per capita (numeric)
Education expenditure per capita (numeric)
Welfare expenditure per capita (numeric)
The dataset name has been kept as 'life_exp_japan_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/gianinamariapetrascu/japan-life-expectancy
This dataset, nikkei_stock_index_df, is a data frame containing the daily log returns in percent of the NIKKEI stock index for the period from 1984-01-04 to 2000-12-22.
data(nikkei_stock_index_df)data(nikkei_stock_index_df)
A data frame with 4,246 observations and 2 variables:
Date or index label (character)
Daily log return in percent (numeric)
The dataset name has been kept as 'nikkei_stock_index_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the tsgarch package version 1.0.3
This dataset, sake_ratings_df, is a data frame containing data from a study of Japanese rice wine (sake), used to investigate the relationship between two subjective ratings (taste and smell) and a number of physical measurements on 30 brands of sake.
data(sake_ratings_df)data(sake_ratings_df)
A data frame with 30 observations and 10 variables:
Subjective taste rating (numeric)
Subjective smell rating (numeric)
pH level (numeric)
Acidity measure 1 (numeric)
Acidity measure 2 (numeric)
Sake meter value (numeric)
Reducing sugar content (numeric)
Total sugar content (numeric)
Alcohol content (numeric)
Nitrogen content (numeric)
The dataset name has been kept as 'sake_ratings_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the heplots package version 1.7.0
This dataset, seishu_wine_df, is a data frame containing data from a study on wine. The study records the odor and taste of wines, with the aim of explaining these characteristics through various chemical properties such as pH concentration, alcohol content, total sugar, and others.
data(seishu_wine_df)data(seishu_wine_df)
A data frame with 30 observations and 10 variables:
Taste rating of the wine (numeric)
Odor rating of the wine (numeric)
pH concentration (numeric)
First acidity measurement (numeric)
Second acidity measurement (numeric)
Sake meter value (numeric)
Direct reducing sugar content (numeric)
Total sugar content (numeric)
Alcohol content (numeric)
Formyl nitrogen content (numeric)
The dataset name has been kept as 'seishu_wine_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the ACSWR package version 1.0
This dataset, shinkansen_stations_tbl_df, is a tibble containing information about 113 Shinkansen train stations across Japan. Each row represents a station and includes its name, the Shinkansen line it belongs to, the year it opened, the prefecture it is located in, the distance from Tokyo Station in kilometers, and the operating company. The dataset preserves the original structure from its source on Kaggle.
data(shinkansen_stations_tbl_df)data(shinkansen_stations_tbl_df)
A tibble with 113 observations and 6 variables:
Name of the Shinkansen station (character)
Name of the Shinkansen line (character)
Year the station opened (numeric)
Prefecture in which the station is located (character)
Distance from Tokyo Station in kilometers (numeric)
Operating company of the station (character)
The dataset name has been kept as 'shinkansen_stations_tbl_df' to maintain consistency with the naming conventions in the JapanAPIs package. The suffix 'tbl_df' indicates that this is a tibble data frame. The original content has not been modified in any way.
Data obtained from Kaggle: https://www.kaggle.com/datasets/japandata509/shinkansen-stations-in-japan
This dataset, tokaido_stations_tbl_df, is a tibble containing information about stations on the Tokaido Line operated by East Japan Railway. It includes each station's code and name.
data(tokaido_stations_tbl_df)data(tokaido_stations_tbl_df)
A tibble with 20 observations and 2 variables:
Station code (character)
Station name (character)
The dataset name has been kept as 'tokaido_stations_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the ssrn package version 0.1.0
This dataset, tokyo_rainfall_vec, is a numeric vector containing Tokyo rainfall data from Kitagawa (1987), analysed also by Rue and Held (2005) and Fahrmeir and Tutz (2013). It includes daily rainfall measurements over a period of 366 days.
data(tokyo_rainfall_vec)data(tokyo_rainfall_vec)
A numeric vector with 366 observations:
Daily rainfall measurements (numeric values ranging from 0 to 2)
The dataset name has been kept as 'tokyo_rainfall_vec' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'vec' indicates that the dataset is a vector object. The original content has not been modified in any way.
Data taken from the gamlss.data package version 6.0-6
This dataset, tokyo_wards_tbl_df, is a tibble containing information on the 23 special wards of Tokyo. It includes ward numbers, names in both romaji and kanji, along with x and y coordinates for spatial layout.
data(tokyo_wards_tbl_df)data(tokyo_wards_tbl_df)
A tibble with 23 observations and 5 variables:
Ward number (character)
Ward name in romaji (character)
Ward name in kanji (character)
X coordinate for grid placement (numeric)
Y coordinate for grid placement (numeric)
The dataset name has been kept as 'tokyo_wards_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Data taken from the tabularmaps package version 0.1.0
This dataset, toyota_stock_prices_df, is a data frame containing the average monthly stock prices of Toyota from 1982 to 1998.
data(toyota_stock_prices_df)data(toyota_stock_prices_df)
A data frame with 255 observations and 2 variables:
Month number from the beginning of the time series (integer)
Average monthly stock price (numeric)
The dataset name has been kept as 'toyota_stock_prices_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the JapanAPIs package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Data taken from the MMAC package version 0.1.2
This function lists all datasets available in the 'JapanAPIs' package. If the 'JapanAPIs' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.
view_datasets_JapanAPIs()view_datasets_JapanAPIs()
A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.
if (requireNamespace("JapanAPIs", quietly = TRUE)) { library(JapanAPIs) view_datasets_JapanAPIs() }if (requireNamespace("JapanAPIs", quietly = TRUE)) { library(JapanAPIs) view_datasets_JapanAPIs() }