WikipediR - A MediaWiki API Wrapper
A wrapper for the MediaWiki API, aimed particularly at the Wikimedia 'production' wikis, such as Wikipedia. It can be used to retrieve page text, information about users or the history of pages, and elements of the category tree.
Last updated 3 months ago
api-clientapi-wrappermediawiki
65 stars 5.19 score 9 dependencies 37 dependentspageviews - An API Client for Wikimedia Traffic Data
Pageview data from the 'Wikimedia' sites, such as 'Wikipedia' <https://www.wikipedia.org/>, from entire projects to per-article levels of granularity, through the new RESTful API and data source <https://wikimedia.org/api/rest_v1/?doc>.
Last updated 4 months ago
mediawikipageviewpageview-datawikimediawikipedia
23 stars 2.36 score 8 dependenciestriebeard - 'Radix' Trees in 'Rcpp'
'Radix trees', or 'tries', are key-value data structures optimised for efficient lookups, similar in purpose to hash tables. 'triebeard' provides an implementation of 'radix trees' for use in R programming and in developing packages with 'Rcpp'.
Last updated 1 years ago
data-structruesradix-trietrie
32 stars 9.63 score 1 dependencies 385 dependentsreconstructr - Session Reconstruction and Analysis
Functions to reconstruct sessions from web log or other user trace data and calculate various metrics around them, producing tabular, output that is compatible with 'dplyr' or 'data.table' centered processes.
Last updated 2 years ago
log-analysissession-reconstruction
29 stars 2.43 score 4 dependenciespiton - Parsing Expression Grammars in Rcpp
A wrapper around the 'Parsing Expression Grammar Template Library', a C++11 library for generating Parsing Expression Grammars, that makes it accessible within Rcpp. With this, developers can implement their own grammars and easily expose them in R packages.
Last updated 4 years ago
parsing-engineparsing-expression-grammar
16 stars 2.83 score 1 dependencies 8 dependentswebreadr - Tools for Reading Formatted Access Log Files
Read and tidy various common forms of web request log, including the Common and Combined Web Log formats and various Amazon access log types.
Last updated 4 years ago
access-logs
50 stars 3.19 score 26 dependenciesurltools - Vectorised Tools for URL Handling and Parsing
A toolkit for all URL-handling needs, including encoding and decoding, parsing, parameter extraction and modification. All functions are designed to be both fast and entirely vectorised. It is intended to be useful for people dealing with web-related datasets, such as server-side logs, although may be useful for other situations involving large sets of URLs.
Last updated 4 years ago
access-logsdata-importurl
131 stars 9.84 score 2 dependencies 382 dependentsexif -
Last updated 6 years ago
humaniformat - A Parser for Human Names
Human names are complicated and nonstandard things. Humaniformat, which is based on Anthony Ettinger's 'humanparser' project <https://github.com/chovy/humanparser> provides functions for parsing human names, making a best-guess attempt to distinguish sub-components such as prefixes, suffixes, middle names and salutations.
Last updated 8 years ago
namesparser
53 stars 3.66 score 1 dependencies 7 dependentsbatman - Convert categorical representations of logicals to actual logicals
Survey systems and other third-party data sources commonly use non- standard representations of logical values when it comes to qualitative data - "Yes", "No" and "N/A", say. batman is a package designed to seamlessly convert these into logicals. It is highly localised, and contains equivalents to boolean values in languages including German, French, Spanish, Italian, Turkish, Chinese and Polish.
Last updated 8 years ago
11 stars 1.52 score 0 dependenciesolctools - Open Location Code Handling in R
'Open Location Codes' (https://openlocationcode.com/) are a Google- created standard for identifying geographic locations. olctools provides utilities for validating, encoding and decoding entries that follow this standard.
Last updated 8 years ago
13 stars 1.76 score 1 dependenciesrdian - Client Library for The Guardian
A client library for 'The Guardian' (https://www.guardian.com/) and their API, this package allows users to search for Guardian articles and retrieve both the content and metadata.
Last updated 8 years ago
5 stars 1.17 score 8 dependencies