Tools to Work with the 'Splash' JavaScript Rendering Service in R
Nie możesz wybrać więcej, niż 25 tematów Tematy muszą się zaczynać od litery lub cyfry, mogą zawierać myślniki ('-') i mogą mieć do 35 znaków.

83 wiersze
3.2 KiB

7 lat temu
% Generated by roxygen2: do not edit by hand
% Please edit documentation in R/render-html.R
7 lat temu
\name{render_html}
\alias{render_html}
\title{Return the HTML of the javascript-rendered page.}
\usage{
render_html(splash_obj = splash_local, url, base_url, timeout = 30,
resource_timeout, wait = 0, proxy, js, js_src, filters,
allowed_domains, allowed_content_types, forbidden_content_types,
viewport = "1024x768", images, headers, body, http_method, save_args,
load_args, raw_html = FALSE)
7 lat temu
}
\arguments{
\item{splash_obj}{Object created by a call to \code{\link[=splash]{splash()}}}
7 lat temu
\item{url}{The URL to render (required)}
7 lat temu
\item{base_url}{The base url to render the page with.}
7 lat temu
7 lat temu
\item{timeout}{A timeout (in seconds) for the render (defaults to 30). Without
reconfiguring the startup parameters of the Splash server (not this package)
the maximum allowed value for the timeout is 60 seconds.}
7 lat temu
\item{resource_timeout}{A timeout (in seconds) for individual network requests.}
\item{wait}{Time (in seconds) to wait for updates after page is loaded (defaults to 0).}
7 lat temu
\item{proxy}{Proxy profile name or proxy URL.}
7 lat temu
7 lat temu
\item{js}{Javascript profile name.}
7 lat temu
7 lat temu
\item{js_src}{JavaScript code to be executed in page context.}
7 lat temu
7 lat temu
\item{filters}{Comma-separated list of request filter names.}
7 lat temu
7 lat temu
\item{allowed_domains}{Comma-separated list of allowed domain names. If present, Splash
won’t load anything neither from domains not in this list nor from subdomains of
domains not in this list.}
7 lat temu
7 lat temu
\item{allowed_content_types}{Comma-separated list of allowed content types. If present,
Splash will abort any request if the response’s content type doesn’t match any of
the content types in this list. Wildcards are supported.}
7 lat temu
7 lat temu
\item{forbidden_content_types}{Comma-separated list of forbidden content types. If
present, Splash will abort any request if the response’s content type matches
any of the content types in this list. Wildcards are supported.}
7 lat temu
7 lat temu
\item{viewport}{View width and height (in pixels) of the browser viewport to render the
web page. Format is “<width>x<height>”, e.g. 800x600. Default value is "full".}
7 lat temu
7 lat temu
\item{images}{Whether to download images.}
7 lat temu
7 lat temu
\item{headers}{HTTP headers to set for the first outgoing request.}
7 lat temu
7 lat temu
\item{body}{Body of HTTP POST request to be sent if method is POST.}
7 lat temu
7 lat temu
\item{http_method}{HTTP method of outgoing Splash request.}
7 lat temu
7 lat temu
\item{save_args}{A list of argument names to put in cache.}
7 lat temu
7 lat temu
\item{load_args}{Parameter values to load from cache}
\item{raw_html}{if \code{TRUE} then return a character vector vs an XML document. Only valid for \code{render_html}}
}
\value{
An XML document. Note that this is processed by \code{\link[xml2:read_html]{xml2::read_html()}} so it will not be
7 lat temu
the pristine, raw, rendered HTML from the site. Use \code{raw_html=TRUE} if you do not want it
to be processed first by \code{xml2}. If you choose \code{raw_html=TRUE} you'll get back a
7 lat temu
character vector.
7 lat temu
}
\description{
Similar (i.e. a dynamic equivalent) to \code{rvest::read_html}.
7 lat temu
}
7 lat temu
\references{
\href{http://splash.readthedocs.io/en/stable/index.html}{Splash docs}
}
7 lat temu
\seealso{
Other splash_renderers: \code{\link{execute_lua}},
\code{\link{render_har}}, \code{\link{render_jpeg}},
\code{\link{render_json}}, \code{\link{render_png}}
7 lat temu
}
6 lat temu
\concept{splash_renderers}