urltools - Vectorised Tools for URL Handling and Parsing
A toolkit for all URL-handling needs, including encoding and decoding, parsing, parameter extraction and modification. All functions are designed to be both fast and entirely vectorised. It is intended to be useful for people dealing with web-related datasets, such as server-side logs, although may be useful for other situations involving large sets of URLs.
Last updated
access-logsdata-importurlcpp
13.62 score 134 stars 288 dependents 1.1k scripts 44k downloadstriebeard - 'Radix' Trees in 'Rcpp'
'Radix trees', or 'tries', are key-value data structures optimised for efficient lookups, similar in purpose to hash tables. 'triebeard' provides an implementation of 'radix trees' for use in R programming and in developing packages with 'Rcpp'.
Last updated
data-structruesradix-trietriecpp
10.53 score 33 stars 293 dependents 15 scripts 39k downloadsWikipediR - A MediaWiki API Wrapper
A wrapper for the MediaWiki API, aimed particularly at the Wikimedia 'production' wikis, such as Wikipedia. It can be used to retrieve page text, information about users or the history of pages, and elements of the category tree.
Last updated
api-clientapi-wrappermediawiki
9.57 score 71 stars 16 dependents 95 scripts 7.6k downloadshumaniformat - A Parser for Human Names
Human names are complicated and nonstandard things. Humaniformat, which is based on Anthony Ettinger's 'humanparser' project <https://github.com/chovy/humanparser> provides functions for parsing human names, making a best-guess attempt to distinguish sub-components such as prefixes, suffixes, middle names and salutations.
Last updated
namesparsercpp
7.81 score 55 stars 11 dependents 53 scripts 1.3k downloadswebreadr - Tools for Reading Formatted Access Log Files
Read and tidy various common forms of web request log, including the Common and Combined Web Log formats and various Amazon access log types.
Last updated
access-logscpp
6.20 score 53 stars 12 scripts 232 downloadspiton - Parsing Expression Grammars in Rcpp
A wrapper around the 'Parsing Expression Grammar Template Library', a C++11 library for generating Parsing Expression Grammars, that makes it accessible within Rcpp. With this, developers can implement their own grammars and easily expose them in R packages.
Last updated
parsing-engineparsing-expression-grammarcpp
5.75 score 16 stars 16 dependents 4 scripts 1.5k downloadsreconstructr - Session Reconstruction and Analysis
Functions to reconstruct sessions from web log or other user trace data and calculate various metrics around them, producing tabular, output that is compatible with 'dplyr' or 'data.table' centered processes.
Last updated
log-analysissession-reconstructioncpp
5.64 score 28 stars 31 scripts 244 downloadsbatman - Convert categorical representations of logicals to actual logicals
Survey systems and other third-party data sources commonly use non- standard representations of logical values when it comes to qualitative data - "Yes", "No" and "N/A", say. batman is a package designed to seamlessly convert these into logicals. It is highly localised, and contains equivalents to boolean values in languages including German, French, Spanish, Italian, Turkish, Chinese and Polish.
Last updated
5.34 score 11 stars 80 scripts 197 downloadsolctools - Open Location Code Handling in R
'Open Location Codes' (https://openlocationcode.com/) are a Google- created standard for identifying geographic locations. olctools provides utilities for validating, encoding and decoding entries that follow this standard.
Last updated
cpp
5.23 score 13 stars 13 scripts 181 downloadsexif - Read EXIF Metadata from JPEGs
Extracts Exchangeable Image File Format (EXIF) metadata, such as camera make and model, ISO speed and the date-time the picture was taken on, from JPEG images. Incorporates the 'easyexif' <https://github.com/mayanklahiri/easyexif> library.
Last updated
cpp
3.74 score 11 stars 8 scripts 176 downloadsrdian - Client Library for The Guardian
A client library for 'The Guardian' (https://www.guardian.com/) and their API, this package allows users to search for Guardian articles and retrieve both the content and metadata.
Last updated
3.40 score 5 stars 6 scripts 195 downloads