Tools to Scrape Dynamic Web Content via the ‘HtmlUnit’ Java Library
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 

24 lines
1.1 KiB

#' Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library
#'
#' `HtmlUnit` (<http://htmlunit.sourceforge.net/>) is _a "'GUI'-Less
#' browser for 'Java' programs". It models 'HTML' documents and provides an 'API'
#' that allows one to invoke pages, fill out forms, click links and more just like
#' one does in a "normal" browser. The library has fairly good and constantly
#' improving 'JavaScript' support and is able to work even with quite complex 'AJAX'
#' libraries, simulating 'Chrome', 'Firefox' or 'Internet Explorer' depending on
#' the configuration used. It is typically used for testing purposes or to retrieve
#' information from web sites._
#'
#' Tools are provided to work with this library at a higher level than provided by
#' the exposed 'Java' libraries in the [`htmlunitjars`](https://gitlab.com/hrbrmstr/htmlunitjars)
#' package.
#'
#' - URL: <https://gitlab.com/hrbrmstr/htmlunit>
#' - BugReports: <https://gitlab.com/hrbrmstr/htmlunit/issues>
#'
#' @md
#' @name htmlunit
#' @docType package
#' @author Bob Rudis (bob@@rud.is)
#' @import rvest htmlunitjars rJava xml2
NULL