Tools to Work with the 'Splash' JavaScript Rendering Service in R
Você não pode selecionar mais de 25 tópicos Os tópicos devem começar com uma letra ou um número, podem incluir traços ('-') e podem ter até 35 caracteres.

83 linhas
3.2 KiB

7 anos atrás
% Generated by roxygen2: do not edit by hand
5 anos atrás
% Please edit documentation in R/render-html.R
7 anos atrás
\name{render_html}
\alias{render_html}
\title{Return the HTML of the javascript-rendered page.}
\usage{
render_html(splash_obj = splash_local, url, base_url, timeout = 30,
resource_timeout, wait = 0, proxy, js, js_src, filters,
allowed_domains, allowed_content_types, forbidden_content_types,
viewport = "1024x768", images, headers, body, http_method, save_args,
load_args, raw_html = FALSE)
7 anos atrás
}
\arguments{
\item{splash_obj}{Object created by a call to \code{\link[=splash]{splash()}}}
7 anos atrás
\item{url}{The URL to render (required)}
7 anos atrás
\item{base_url}{The base url to render the page with.}
7 anos atrás
7 anos atrás
\item{timeout}{A timeout (in seconds) for the render (defaults to 30). Without
reconfiguring the startup parameters of the Splash server (not this package)
the maximum allowed value for the timeout is 60 seconds.}
7 anos atrás
\item{resource_timeout}{A timeout (in seconds) for individual network requests.}
\item{wait}{Time (in seconds) to wait for updates after page is loaded (defaults to 0).}
7 anos atrás
\item{proxy}{Proxy profile name or proxy URL.}
7 anos atrás
7 anos atrás
\item{js}{Javascript profile name.}
7 anos atrás
7 anos atrás
\item{js_src}{JavaScript code to be executed in page context.}
7 anos atrás
7 anos atrás
\item{filters}{Comma-separated list of request filter names.}
7 anos atrás
7 anos atrás
\item{allowed_domains}{Comma-separated list of allowed domain names. If present, Splash
won’t load anything neither from domains not in this list nor from subdomains of
domains not in this list.}
7 anos atrás
7 anos atrás
\item{allowed_content_types}{Comma-separated list of allowed content types. If present,
Splash will abort any request if the response’s content type doesn’t match any of
the content types in this list. Wildcards are supported.}
7 anos atrás
7 anos atrás
\item{forbidden_content_types}{Comma-separated list of forbidden content types. If
present, Splash will abort any request if the response’s content type matches
any of the content types in this list. Wildcards are supported.}
7 anos atrás
7 anos atrás
\item{viewport}{View width and height (in pixels) of the browser viewport to render the
web page. Format is “<width>x<height>”, e.g. 800x600. Default value is "full".}
7 anos atrás
7 anos atrás
\item{images}{Whether to download images.}
7 anos atrás
7 anos atrás
\item{headers}{HTTP headers to set for the first outgoing request.}
7 anos atrás
7 anos atrás
\item{body}{Body of HTTP POST request to be sent if method is POST.}
7 anos atrás
7 anos atrás
\item{http_method}{HTTP method of outgoing Splash request.}
7 anos atrás
7 anos atrás
\item{save_args}{A list of argument names to put in cache.}
7 anos atrás
7 anos atrás
\item{load_args}{Parameter values to load from cache}
\item{raw_html}{if \code{TRUE} then return a character vector vs an XML document. Only valid for \code{render_html}}
}
\value{
An XML document. Note that this is processed by \code{\link[xml2:read_html]{xml2::read_html()}} so it will not be
7 anos atrás
the pristine, raw, rendered HTML from the site. Use \code{raw_html=TRUE} if you do not want it
to be processed first by \code{xml2}. If you choose \code{raw_html=TRUE} you'll get back a
7 anos atrás
character vector.
7 anos atrás
}
\description{
Similar (i.e. a dynamic equivalent) to \code{rvest::read_html}.
7 anos atrás
}
7 anos atrás
\references{
\href{http://splash.readthedocs.io/en/stable/index.html}{Splash docs}
}
7 anos atrás
\seealso{
Other splash_renderers: \code{\link{execute_lua}},
\code{\link{render_har}}, \code{\link{render_jpeg}},
\code{\link{render_json}}, \code{\link{render_png}}
7 anos atrás
}
6 anos atrás
\concept{splash_renderers}