Convert a html document to simple plain texts by removing all html tags. This package utilizes regular expressions to strip off html tags. It also offers gettxt() and browse() function, which enables you to get or browse texts at a certain web page.
Version: | 2.1.1 |
Depends: | R (≥ 3.0.0) |
Published: | 2017-10-19 |
Author: | Sangchul Park [aut, cre] |
Maintainer: | Sangchul Park <mail at sangchul.com> |
BugReports: | https://github.com/sangchulpark/htm2txt/issues |
License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
URL: | https://github.com/sangchulpark |
NeedsCompilation: | no |
In views: | WebTechnologies |
CRAN checks: | htm2txt results |
Reference manual: | htm2txt.pdf |
Package source: | htm2txt_2.1.1.tar.gz |
Windows binaries: | r-devel: htm2txt_2.1.1.zip, r-release: htm2txt_2.1.1.zip, r-oldrel: htm2txt_2.1.1.zip |
macOS binaries: | r-release: htm2txt_2.1.1.tgz, r-oldrel: htm2txt_2.1.1.tgz |
Old sources: | htm2txt archive |
Please use the canonical form https://CRAN.R-project.org/package=htm2txt to link to this page.