Browse Source

Windows!

master
hrbrmstr 8 years ago
parent
commit
c77bda8538
No known key found for this signature in database GPG Key ID: 1D7529BE14E2BBA9
  1. 2
      DESCRIPTION
  2. 8
      README.Rmd
  3. 7
      README.md

2
DESCRIPTION

@ -2,7 +2,7 @@ Package: htmltidy
Title: Clean Up Gnarly HTML/XML Title: Clean Up Gnarly HTML/XML
Version: 0.2.0.9000 Version: 0.2.0.9000
Authors@R: c( Authors@R: c(
person("Bob", "Rudis", email = "bob@rudis.net", role = c("aut", "cre")), person("Bob", "Rudis", email = "bob@rud.is", role = c("aut", "cre")),
person("Dave", "Dave", email = "dsr@w3.org", role = c("ctb", "aut"), person("Dave", "Dave", email = "dsr@w3.org", role = c("ctb", "aut"),
comment="HTML Tidy library") comment="HTML Tidy library")
) )

8
README.Rmd

@ -21,18 +21,12 @@ knitr::opts_chunk$set(
Inspired by [this SO question](http://stackoverflow.com/questions/37061873/identify-a-weblink-in-bold-in-r) and because there's a great deal of cruddy HTML out there that needs fixing to use properly when scraping data. Inspired by [this SO question](http://stackoverflow.com/questions/37061873/identify-a-weblink-in-bold-in-r) and because there's a great deal of cruddy HTML out there that needs fixing to use properly when scraping data.
It relies on a locally included version of [`libtidy`](http://www.html-tidy.org/) and presently is super-basic (no way to set options and pretty much only does HTML) It relies on a locally included version of [`libtidy`](http://www.html-tidy.org/) and works on macOS, Linux & Windows.
This works enough for me to use in a pinch. It should be straightforward (but tedious) to:
- enable passing options in a `list` (IN PROGRESS)
- Getting it to work on Windows (UNTESTED)
The following functions are implemented: The following functions are implemented:
- `tidy_html` : Clean up gnarly HTML/XML - `tidy_html` : Clean up gnarly HTML/XML
### Installation ### Installation
```{r eval=FALSE} ```{r eval=FALSE}

7
README.md

@ -6,12 +6,7 @@
Inspired by [this SO question](http://stackoverflow.com/questions/37061873/identify-a-weblink-in-bold-in-r) and because there's a great deal of cruddy HTML out there that needs fixing to use properly when scraping data. Inspired by [this SO question](http://stackoverflow.com/questions/37061873/identify-a-weblink-in-bold-in-r) and because there's a great deal of cruddy HTML out there that needs fixing to use properly when scraping data.
It relies on a locally included version of [`libtidy`](http://www.html-tidy.org/) and presently is super-basic (no way to set options and pretty much only does HTML) It relies on a locally included version of [`libtidy`](http://www.html-tidy.org/) and works on macOS, Linux & Windows.
This works enough for me to use in a pinch. It should be straightforward (but tedious) to:
- enable passing options in a `list`
- Getting it to work on Windows.
The following functions are implemented: The following functions are implemented:

Loading…
Cancel
Save