Package 'paperwizard'

Title: Scrape News Sites using 'readability.js'
Description: uses Mozillas readability.js to scrape text from websites.
Authors: David Schoch [aut, cre]
Maintainer: David Schoch <[email protected]>
License: MIT + file LICENSE
Version: 0.2.0.9000
Built: 2024-10-29 13:24:24 UTC
Source: https://github.com/schochastics/paperwizard

Help Index


Scrape using Readability.js

Description

Scrape using Readability.js

Usage

pw_deliver(x, type = c("static", "dynamic"))

Arguments

x

Either a vector of urls or a data.frame returned by paperboy::pb_collect().

type

either "static" or "dynamic" if articles are scraped

Value

A tibble similar to the output of paperboy::pb_deliver().


Run NPM install Run NPM install to install dependencies

Description

Run NPM install Run NPM install to install dependencies

Usage

pw_npm_install()

Value

An installed lib


Summary of delivered articles

Description

Summary of delivered articles

Usage

pw_report(x, n = 100)

Arguments

x

result from pw_deliver()

n

integer cutoff when articles are considered too short (default 100)

Value

nothing. called for side effects