hrbrmstr
8 years ago
No known key found for this signature in database
GPG Key ID: 1D7529BE14E2BBA9
4 changed files with
41 additions and
5 deletions
-
R/htmltidy-package.r
-
README.Rmd
-
README.md
-
man/htmltidy.Rd
|
|
@ -1,8 +1,12 @@ |
|
|
|
#' Clean up gnarly HTML/XML |
|
|
|
#' Clean Up Gnarly HTML/XML |
|
|
|
#' |
|
|
|
#' HTML and XML documents can be beautiful and pristine. They can also be |
|
|
|
#' wretched, evil, malformed hellspawn. Now, you can tidy up that HTML and XML before |
|
|
|
#' processing it with your favorite angle-bracket parsing tools. |
|
|
|
#' |
|
|
|
#' @name htmltidy |
|
|
|
#' @docType package |
|
|
|
#' @author Bob Rudis (@@hrbrmstr) |
|
|
|
#' @author Bob Rudis (bob@@rud.is) |
|
|
|
#' @useDynLib htmltidy |
|
|
|
#' @importFrom Rcpp sourceCpp |
|
|
|
NULL |
|
|
|
|
|
@ -32,6 +32,21 @@ The following functions are implemented: |
|
|
|
|
|
|
|
- `tidy_html` : Clean up gnarly HTML/XML |
|
|
|
|
|
|
|
### TODO |
|
|
|
|
|
|
|
Fix: |
|
|
|
|
|
|
|
```{text} |
|
|
|
* checking compiled code ... WARNING |
|
|
|
File ‘htmltidy/libs/htmltidy.so’: |
|
|
|
Found ‘___stderrp’, possibly from ‘stderr’ (C) |
|
|
|
Objects: ‘alloc.o’, ‘streamio.o’, ‘tidylib.o’ |
|
|
|
Found ‘___stdoutp’, possibly from ‘stdout’ (C) |
|
|
|
Objects: ‘sprtf.o’, ‘tidylib.o’ |
|
|
|
Found ‘_exit’, possibly from ‘exit’ (C) |
|
|
|
Objects: ‘alloc.o’, ‘sprtf.o’ |
|
|
|
``` |
|
|
|
|
|
|
|
### Installation |
|
|
|
|
|
|
|
```{r eval=FALSE} |
|
|
|
|
|
@ -17,6 +17,21 @@ The following functions are implemented: |
|
|
|
|
|
|
|
- `tidy_html` : Clean up gnarly HTML/XML |
|
|
|
|
|
|
|
### TODO |
|
|
|
|
|
|
|
Fix: |
|
|
|
|
|
|
|
``` text |
|
|
|
* checking compiled code ... WARNING |
|
|
|
File ‘htmltidy/libs/htmltidy.so’: |
|
|
|
Found ‘___stderrp’, possibly from ‘stderr’ (C) |
|
|
|
Objects: ‘alloc.o’, ‘streamio.o’, ‘tidylib.o’ |
|
|
|
Found ‘___stdoutp’, possibly from ‘stdout’ (C) |
|
|
|
Objects: ‘sprtf.o’, ‘tidylib.o’ |
|
|
|
Found ‘_exit’, possibly from ‘exit’ (C) |
|
|
|
Objects: ‘alloc.o’, ‘sprtf.o’ |
|
|
|
``` |
|
|
|
|
|
|
|
### Installation |
|
|
|
|
|
|
|
``` r |
|
|
|
|
|
@ -4,11 +4,13 @@ |
|
|
|
\name{htmltidy} |
|
|
|
\alias{htmltidy} |
|
|
|
\alias{htmltidy-package} |
|
|
|
\title{Clean up gnarly HTML/XML} |
|
|
|
\title{Clean Up Gnarly HTML/XML} |
|
|
|
\description{ |
|
|
|
Clean up gnarly HTML/XML |
|
|
|
HTML and XML documents can be beautiful and pristine. They can also be |
|
|
|
wretched, evil, malformed hellspawn. Now, you can tidy up that HTML and XML before |
|
|
|
processing it with your favorite angle-bracket parsing tools. |
|
|
|
} |
|
|
|
\author{ |
|
|
|
Bob Rudis (@hrbrmstr) |
|
|
|
Bob Rudis (bob@rud.is) |
|
|
|
} |
|
|
|
|
|
|
|