• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
ironholds
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links toironholds

urltools - Vectorised Tools for URL Handling and Parsing

A toolkit for all URL-handling needs, including encoding and decoding, parsing, parameter extraction and modification. All functions are designed to be both fast and entirely vectorised. It is intended to be useful for people dealing with web-related datasets, such as server-side logs, although may be useful for other situations involving large sets of URLs.

Last updated

access-logsdata-importurlcpp

13.62 score 134 stars 288 dependents 1.1k scripts 44k downloads

triebeard - 'Radix' Trees in 'Rcpp'

'Radix trees', or 'tries', are key-value data structures optimised for efficient lookups, similar in purpose to hash tables. 'triebeard' provides an implementation of 'radix trees' for use in R programming and in developing packages with 'Rcpp'.

Last updated

data-structruesradix-trietriecpp

10.53 score 33 stars 293 dependents 15 scripts 39k downloads

WikipediR - A MediaWiki API Wrapper

A wrapper for the MediaWiki API, aimed particularly at the Wikimedia 'production' wikis, such as Wikipedia. It can be used to retrieve page text, information about users or the history of pages, and elements of the category tree.

Last updated

api-clientapi-wrappermediawiki

9.57 score 71 stars 16 dependents 95 scripts 7.6k downloads

humaniformat - A Parser for Human Names

Human names are complicated and nonstandard things. Humaniformat, which is based on Anthony Ettinger's 'humanparser' project <https://github.com/chovy/humanparser> provides functions for parsing human names, making a best-guess attempt to distinguish sub-components such as prefixes, suffixes, middle names and salutations.

Last updated

namesparsercpp

7.81 score 55 stars 11 dependents 53 scripts 1.3k downloads

webreadr - Tools for Reading Formatted Access Log Files

Read and tidy various common forms of web request log, including the Common and Combined Web Log formats and various Amazon access log types.

Last updated

access-logscpp

6.20 score 53 stars 12 scripts 232 downloads

piton - Parsing Expression Grammars in Rcpp

A wrapper around the 'Parsing Expression Grammar Template Library', a C++11 library for generating Parsing Expression Grammars, that makes it accessible within Rcpp. With this, developers can implement their own grammars and easily expose them in R packages.

Last updated

parsing-engineparsing-expression-grammarcpp

5.75 score 16 stars 16 dependents 4 scripts 1.5k downloads

primes - Fast Functions for Prime Numbers

Fast functions for dealing with prime numbers, such as testing whether a number is prime and generating a sequence prime numbers. Additional functions include finding prime factors and Ruth-Aaron pairs, finding next and previous prime numbers in the series, finding or estimating the nth prime, estimating the number of primes less than or equal to an arbitrary number, computing primorials, prime k-tuples (e.g., twin primes), finding the greatest common divisor and smallest (least) common multiple, testing whether two numbers are coprime, and computing Euler's totient function. Most functions are vectorized for speed and convenience.

Last updated

cpp

5.65 score 10 stars 4 dependents 65 scripts 1.1k downloads

reconstructr - Session Reconstruction and Analysis

Functions to reconstruct sessions from web log or other user trace data and calculate various metrics around them, producing tabular, output that is compatible with 'dplyr' or 'data.table' centered processes.

Last updated

log-analysissession-reconstructioncpp

5.64 score 28 stars 31 scripts 244 downloads

batman - Convert categorical representations of logicals to actual logicals

Survey systems and other third-party data sources commonly use non- standard representations of logical values when it comes to qualitative data - "Yes", "No" and "N/A", say. batman is a package designed to seamlessly convert these into logicals. It is highly localised, and contains equivalents to boolean values in languages including German, French, Spanish, Italian, Turkish, Chinese and Polish.

Last updated

5.34 score 11 stars 80 scripts 197 downloads

olctools - Open Location Code Handling in R

'Open Location Codes' (https://openlocationcode.com/) are a Google- created standard for identifying geographic locations. olctools provides utilities for validating, encoding and decoding entries that follow this standard.

Last updated

cpp

5.23 score 13 stars 13 scripts 181 downloads

exif - Read EXIF Metadata from JPEGs

Extracts Exchangeable Image File Format (EXIF) metadata, such as camera make and model, ISO speed and the date-time the picture was taken on, from JPEG images. Incorporates the 'easyexif' <https://github.com/mayanklahiri/easyexif> library.

Last updated

cpp

3.74 score 11 stars 8 scripts 176 downloads

rdian - Client Library for The Guardian

A client library for 'The Guardian' (https://www.guardian.com/) and their API, this package allows users to search for Guardian articles and retrieve both the content and metadata.

Last updated

3.40 score 5 stars 6 scripts 195 downloads

muckrock - Data on Freedom of Information Act Requests

A data package containing public domain information on requests made by the 'MuckRock' (https://www.muckrock.com/) project under the United States Freedom of Information Act.

Last updated

2.70 score 1 scripts 166 downloads