Showing 37 of total 37 results (show query)
btskinner
crosswalkr:Rename and Encode Data Frames Using External Crosswalk Files
A pair of functions for renaming and encoding data frames using external crosswalk files. It is especially useful when constructing master data sets from multiple smaller data sets that do not name or encode variables consistently across files. Based on similar commands in 'Stata'.
Maintained by Benjamin Skinner. Last updated 1 years ago.
28.6 match 9 stars 5.26 score 20 scriptsropengov
retroharmonize:Ex Post Survey Data Harmonization
Assist in reproducible retrospective (ex-post) harmonization of data, particularly individual level survey data, by providing tools for organizing metadata, standardizing the coding of variables, and variable names and value labels, including missing values, and documenting the data transformations, with the help of comprehensive s3 classes.
Maintained by Daniel Antal. Last updated 2 months ago.
15.2 match 10 stars 7.62 score 59 scriptseworx-org
iscoCrosswalks:Crosswalks Between Classifications of Occupations
Allows the user to perform approximate matching between the occupational classifications using concordances provided by the Institute for Structural Research and Faculty of Economics, University of Warsaw, <doi:10.1111/ecot.12145>. The crosswalks offer a complete step-by-step mapping of Standard Occupational Classification (2010) data to the International Standard Classification of Occupations (2008). We propose a mapping method based on the aforementioned research that converts measurements to the smallest possible unit of the target taxonomy, and then performs an aggregation/estimate to the requested degree Occupational Hierarchical level.
Maintained by Alexandros Kouretsis. Last updated 3 years ago.
21.6 match 7 stars 4.59 score 11 scriptsropensci
dataspice:Create Lightweight Schema.org Descriptions of Data
The goal of 'dataspice' is to make it easier for researchers to create basic, lightweight, and concise metadata files for their datasets. These basic files can then be used to make useful information available during analysis, create a helpful dataset "README" webpage, and produce more complex metadata formats to aid dataset discovery. Metadata fields are based on the 'Schema.org' and 'Ecological Metadata Language' standards.
Maintained by Bryce Mecum. Last updated 4 years ago.
datadatasetmetadataschema-orgunconfunconf18
12.6 match 162 stars 7.45 score 25 scriptspfizer-opensource
zippeR:Working with United States ZIP Code and ZIP Code Tabulation Area Data
Provides a set of functions for working with American postal codes, which are known as ZIP Codes. These include accessing ZIP Code to ZIP Code Tabulation Area (ZCTA) crosswalks, retrieving demographic data for ZCTAs, and tabulating demographic data for three-digit ZCTAs.
Maintained by Christopher Prener. Last updated 19 days ago.
13.0 match 11 stars 6.52 score 5 scripts 1 dependentsheli-xu
findSVI:Calculate Social Vulnerability Index for Communities
Developed by CDC/ATSDR (Centers for Disease Control and Prevention/ Agency for Toxic Substances and Disease Registry), Social Vulnerability Index (SVI) serves as a tool to assess the resilience of communities by taking into account socioeconomic and demographic factors. Provided with year(s), region(s) and a geographic level of interest, 'findSVI' retrieves required variables from US census data and calculates SVI for communities in the specified area based on CDC/ATSDR SVI documentation. Reference for the calculation methods: Flanagan BE, Gregory EW, Hallisey EJ, Heitgerd JL, Lewis B (2011) <doi:10.2202/1547-7355.1792>.
Maintained by Heli Xu. Last updated 1 months ago.
13.6 match 12 stars 5.68 score 16 scriptselipousson
mapbaltimore:Make maps for Baltimore City with open data
This package provides data from the Baltimore City, the state of Maryland, and other sources, functions to access additional data, and function to create and modify simple maps of Baltimore neighborhoods using sf and ggplot2.
Maintained by Eli Pousson. Last updated 4 months ago.
17.2 match 17 stars 3.85 score 14 scriptstonyfischetti
libbib:Various Utilities for Library Science/Assessment and Cataloging
Provides functions for validating and normalizing bibliographic codes such as ISBN, ISSN, and LCCN. Also includes functions to communicate with the WorldCat API, translate Call numbers (Library of Congress and Dewey Decimal) to their subject classifications or subclassifications, and provides various loadable data files such call number / subject crosswalks and code tables.
Maintained by Tony Fischetti. Last updated 2 years ago.
18.4 match 3.20 score 32 scriptswenjie2wang
touch:Tools of Utilization and Cost in Healthcare
R implementation of the software tools developed in the H-CUP (Healthcare Cost and Utilization Project) <https://www.hcup-us.ahrq.gov> and AHRQ (Agency for Healthcare Research and Quality) <https://www.ahrq.gov>. It currently contains functions for mapping ICD-9 codes to the AHRQ comorbidity measures and translating ICD-9 (resp. ICD-10) codes to ICD-10 (resp. ICD-9) codes based on GEM (General Equivalence Mappings) from CMS (Centers for Medicare and Medicaid Services).
Maintained by Wenjie Wang. Last updated 3 years ago.
ahrqcomorbiditycrosswalkhealthcareicd-10icd-9cpp
11.0 match 13 stars 4.58 score 29 scriptsbtskinner
duawranglr:Securely Wrangle Dataset According to Data Usage Agreement
Create shareable data sets from raw data files that contain protected elements. Relying on master crosswalk files that list restricted variables, package functions warn users about possible violations of data usage agreement and prevent writing protected elements.
Maintained by Benjamin Skinner. Last updated 4 years ago.
data-securitydata-usage-agreementdata-wrangling
9.2 match 9 stars 5.37 score 13 scriptsmarketbridge
zctaCrosswalk:Crosswalk Between 2020 Census ZIP Code Tabulation Areas (ZCTAs), States and Counties
Contains the US Census Bureau's 2020 ZCTA to County Relationship File, as well as convenience functions to translate between States, Counties and ZIP Code Tabulation Areas (ZCTAs).
Maintained by Ari Lamstein. Last updated 2 years ago.
8.1 match 5 stars 5.39 score 11 scripts 1 dependentsropensci
cffr:Generate Citation File Format ('cff') Metadata for R Packages
The Citation File Format version 1.2.0 <doi:10.5281/zenodo.5171937> is a human and machine readable file format which provides citation metadata for software. This package provides core utilities to generate and validate this metadata.
Maintained by Diego Hernangómez. Last updated 4 days ago.
attributioncitationcreditcitation-filescffmetadatacitation-file-formatropensci
4.2 match 26 stars 9.74 score 116 scripts 3 dependentspursuitofdatascience
tidyEmoji:Discovers Emoji from Text
Unicodes are not friendly to work with, and not all Unicodes are Emoji per se, making obtaining Emoji statistics a difficult task. This tool can help your experience of working with Emoji as smooth as possible, as it has the 'tidyverse' style.
Maintained by Youzhi Yu. Last updated 2 years ago.
7.3 match 2 stars 4.00 score 7 scriptselipousson
getACS:Help Wrangling American Community Survey Data from tidycensus
A package with helper functions for working with Census data downloaded with the tidycensus package.
Maintained by Eli Pousson. Last updated 1 months ago.
american-community-surveytidycensus
6.2 match 4 stars 4.68 score 10 scriptschoi-phd
PROsetta:Linking Patient-Reported Outcomes Measures
Perform scale linking to establish relationships between instruments that measure similar constructs according to the PROsetta Stone methodology, as in Choi, Schalet, Cook, & Cella (2014) <doi:10.1037/a0035768>.
Maintained by Seung W. Choi. Last updated 4 months ago.
5.3 match 1 stars 5.04 score 11 scriptsgavinrozzi
zipcodeR:Data & Functions for Working with US ZIP Codes
Make working with ZIP codes in R painless with an integrated dataset of U.S. ZIP codes and functions for working with them. Search ZIP codes by multiple geographies, including state, county, city & across time zones. Also included are functions for relating ZIP codes to Census data, geocoding & distance calculations.
Maintained by Gavin Rozzi. Last updated 1 years ago.
3.6 match 80 stars 7.31 score 176 scriptssumtxt
ags:Crosswalk Municipality and District Statistics in Germany
Construct time series for Germany's municipalities (Gemeinden) and districts (Kreise) using a annual crosswalk constructed by the Federal Office for Building and Regional Planning (BBSR).
Maintained by Moritz Marbach. Last updated 11 months ago.
5.4 match 8 stars 4.60 score 3 scriptscran
mwlaxeref:Cross-References Lake Identifiers Between Different Data Sets
Handy helper package for cross-referencing lake identifiers among different data sets in the Midwestern United States. There are multiple different state, regional, and federal agencies that have different identifiers on lakes. This package helps you to go between them.
Maintained by Paul Frater. Last updated 1 years ago.
8.5 match 2.00 scoreusaid-oha-si
tameDP:Import targets and PLHIV data from COP Target Setting Tool (formerly Data Pack)
Import PSNUxIM targets and PLHIV data from COP Data Pack. The purpose is to make the data tidy and more usable than their current structure in the Excel data packs.
Maintained by Aaron Chafetz. Last updated 1 years ago.
3.4 match 1 stars 4.92 score 46 scriptselipousson
getdata:Get Easy Access to Tabular and Spatial Data
Download and format spatial and non-spatial data with simple filtering by location.
Maintained by Eli Pousson. Last updated 5 months ago.
3.4 match 12 stars 4.46 score 18 scripts 3 dependentselipousson
filenamr:Make and Modify File Names and Metadata
Work with filenames and paths and read and write file metadata.
Maintained by Eli Pousson. Last updated 4 months ago.
3.6 match 3 stars 4.03 score 3 scripts 6 dependentspoissonconsulting
wqbc:Tidy Water Quality Data and Calculate Thresholds for British Columbia
Tidies water quality data and calculates water quality thresholds for British Columbia.
Maintained by Andy Teucher. Last updated 2 months ago.
3.3 match 4.23 score 34 scriptselbersb
segregation:Entropy-Based Segregation Indices
Computes segregation indices, including the Index of Dissimilarity, as well as the information-theoretic indices developed by Theil (1971) <isbn:978-0471858454>, namely the Mutual Information Index (M) and Theil's Information Index (H). The M, further described by Mora and Ruiz-Castillo (2011) <doi:10.1111/j.1467-9531.2011.01237.x> and Frankel and Volij (2011) <doi:10.1016/j.jet.2010.10.008>, is a measure of segregation that is highly decomposable. The package provides tools to decompose the index by units and groups (local segregation), and by within and between terms. The package also provides a method to decompose differences in segregation as described by Elbers (2021) <doi:10.1177/0049124121986204>. The package includes standard error estimation by bootstrapping, which also corrects for small sample bias. The package also contains functions for visualizing segregation patterns.
Maintained by Benjamin Elbers. Last updated 1 years ago.
entropysegregationstatisticscpp
2.0 match 36 stars 6.44 score 51 scriptsblakelanglais
ProAE:PRO-CTCAE Scoring, Analysis, and Graphical Tools
A collection of tools to facilitate standardized analysis and graphical procedures when using the National Cancer Institute’s Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE) and other PRO measurements.
Maintained by Blake Langlais. Last updated 5 months ago.
3.6 match 3.48 score 9 scriptsjamgreen
lehdr:Grab Longitudinal Employer-Household Dynamics (LEHD) Flat Files
Designed to query Longitudinal Employer-Household Dynamics (LEHD) workplace/residential association and origin-destination flat files and optionally aggregate Census block-level data to block group, tract, county, or state. Data comes from the LODES FTP server <https://lehd.ces.census.gov/data/lodes/LODES8/>.
Maintained by Jamaal Green. Last updated 4 months ago.
1.7 match 62 stars 7.05 score 90 scriptsadamlilith
omnibus:Helper Tools for Managing Data, Dates, Missing Values, and Text
An assortment of helper functions for managing data (e.g., rotating values in matrices by a user-defined angle, switching from row- to column-indexing), dates (e.g., intuiting year from messy date strings), handling missing values (e.g., removing elements/rows across multiple vectors or matrices if any have an NA), text (e.g., flushing reports to the console in real-time); and combining data frames with different schema (copying, filling, or concatenating columns or applying functions before combining).
Maintained by Adam B. Smith. Last updated 6 months ago.
count-decimalsleap-yearmerge-listsmissing-valuesrotate-matrixsampling
1.7 match 4 stars 5.83 score 54 scripts 3 dependentsbldavies
uk2us:Convert Words Between UK and US English
Functions for converting between UK and US spellings of English words.
Maintained by Benjamin Davies. Last updated 4 years ago.
3.6 match 2.70 score 1 scriptselipousson
baltimoredata:Access Updated and Historic Data on Baltimore City, Maryland
This package provides access to a small number of spatial and nonspatial datasets related to Baltimore City, Maryland.
Maintained by Eli Pousson. Last updated 8 months ago.
3.8 match 2.40 score 2 scriptsrmi-pacta
pacta.data.preparation:Prepare Data for PACTA for Investors
This package provides tools to prepare input datasets to be run in the PACTA_analysis tool.
Maintained by CJ Yetman. Last updated 5 months ago.
climate-changepactapactaversesustainable-finance
1.7 match 1 stars 4.36 score 11 scripts 1 dependentselipousson
marylandedu:Maryland State Department of Education Data
The marylandedu package provides access to selected datasets from the Maryland State Department of Education (MSDE) that have been standardized to better support year-by-year analysis and visualization.
Maintained by Eli Pousson. Last updated 1 years ago.
3.5 match 1 stars 1.70 scorechristopherkenny
cvap:Citizen Voting Age Population
Works with the Citizen Voting Age Population special tabulation from the US Census Bureau <https://www.census.gov/programs-surveys/decennial-census/about/voting-rights/cvap.html>. Provides tools to download and process raw data. Also provides a downloading interface to processed data. Implements a very basic approach to estimate block level citizen voting age population from block group data.
Maintained by Christopher T. Kenny. Last updated 12 months ago.
1.8 match 2 stars 3.30 score 7 scriptsomtarful
harmonydata:R Library for 'Harmony'
'Harmony' is a tool using AI which allows you to compare items from questionnaires and identify similar content. You can try 'Harmony' at <https://harmonydata.ac.uk/app/> and you can read our blog at <https://harmonydata.ac.uk/blog/> or at <https://fastdatascience.com/how-does-harmony-work/>. Documentation at <https://harmonydata.ac.uk/harmony-r-released/>.
Maintained by Omar Hassoun. Last updated 8 days ago.
1.9 match 1.20 score 16 scripts