You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

1.6 KiB

output: rmarkdown::github_document
[![Travis-CI Build Status](](

<!-- is generated from README.Rmd. Please edit that file -->

```{r, echo = FALSE}
collapse = TRUE,
comment = "#>",
message = FALSE,
warning = FALSE,
error = FALSE,
fig.path = "README-"

`htmltidy` — Clean up gnarly HTML/XML

Inspired by [this SO question]( and because there's a great deal of cruddy HTML out there that needs fixing to use properly when scraping data.

It relies on a locally included version of [`libtidy`]( and presently is super-basic (no way to set options and pretty much only does HTML)

This works enough for me to use in a pinch. It should be straightforward (but tedious) to:

- enable passing options in a `list` (IN PROGRESS)
- Getting it to work on Windows (UNTESTED)

The following functions are implemented:

- `tidy_html` : Clean up gnarly HTML/XML

### Installation

```{r eval=FALSE}

```{r echo=FALSE}

### Usage


# current verison

cat(tidy_html("<b><p><a href=''>google &gt</a></p></b>"))

### Code of Conduct

Please note that this project is released with a [Contributor Code of Conduct](
By participating in this project you agree to abide by its terms.