Browse Source

code of conduct

master
boB Rudis 7 years ago
parent
commit
4e92c64ff8
  1. 28
      README.md
  2. BIN
      img/cap.jpg
  3. BIN
      img/cap.png

28
README.md

@ -20,7 +20,7 @@ All you need for this package to work is a running Splash instance. You provide
### About Splash ### About Splash
> 'Splash' <https://github.com/scrapinghub/splash> is a javascript rendering service. It’s a lightweight web browser with an 'HTTP' API, implemented in Python using 'Twistedand 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages but with a Java-free footprint. The (twisted) 'QT' reactor is used to make the sever fully asynchronous allowing to take advantage of 'webkit' concurrency via QT main loop. Some of Splash features include the ability to process multiple webpages in parallel; retrieving HTML results and/or take screenshots; disabling images or use Adblock Plus rules to make rendering faster; executing custom JavaScript in page context; getting detailed rendering info in HAR format. > 'Splash' <https://github.com/scrapinghub/splash> is a javascript rendering service. It’s a lightweight web browser with an 'HTTP' API, implemented in Python using 'Twisted'and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages but with a Java-free footprint. The (twisted) 'QT' reactor is used to make the sever fully asynchronous allowing to take advantage of 'webkit' concurrency via QT main loop. Some of Splash features include the ability to process multiple webpages in parallel; retrieving HTML results and/or take screenshots; disabling images or use Adblock Plus rules to make rendering faster; executing custom JavaScript in page context; getting detailed rendering info in HAR format.
The following functions are implemented: The following functions are implemented:
@ -73,7 +73,7 @@ splash("splash", 8050L) %>%
splash_active() splash_active()
``` ```
## Status of splash instance on [http://splash:8050]: ok. Max RSS: 397705216 ## Status of splash instance on [http://splash:8050]: ok. Max RSS: 402407424
``` r ``` r
splash("splash", 8050L) %>% splash("splash", 8050L) %>%
@ -83,13 +83,13 @@ splash("splash", 8050L) %>%
## List of 7 ## List of 7
## $ active : list() ## $ active : list()
## $ argcache: int 0 ## $ argcache: int 0
## $ fds : int 18 ## $ fds : int 17
## $ leaks :List of 4 ## $ leaks :List of 4
## ..$ Deferred : int 50 ## ..$ Deferred : int 50
## ..$ LuaRuntime: int 1 ## ..$ LuaRuntime: int 1
## ..$ QTimer : int 1 ## ..$ QTimer : int 1
## ..$ Request : int 1 ## ..$ Request : int 1
## $ maxrss : int 388384 ## $ maxrss : int 392976
## $ qsize : int 0 ## $ qsize : int 0
## $ url : chr "http://splash:8050" ## $ url : chr "http://splash:8050"
## - attr(*, "class")= chr [1:2] "splash_debug" "list" ## - attr(*, "class")= chr [1:2] "splash_debug" "list"
@ -105,7 +105,7 @@ splash("splash", 8050L) %>%
## {xml_document} ## {xml_document}
## <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en" dir="ltr"> ## <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en" dir="ltr">
## [1] <head>\n<script src="http://widget-cdn.rpxnow.com/manifest/login?version=1.114.1_widgets_244" type="text/javascri ... ## [1] <head>\n<script src="http://widget-cdn.rpxnow.com/manifest/login?version=1.114.1_widgets_244" type="text/javascri ...
## [2] <body>\n<iframe src="http://tpc.googlesyndication.com/safeframe/1-0-5/html/container.html" style="visibility: hid ... ## [2] <body>\n<iframe src="http://tpc.googlesyndication.com/safeframe/1-0-6/html/container.html" style="visibility: hid ...
``` r ``` r
read_html("http://marvel.com/universe/Captain_America_(Steve_Rogers)") read_html("http://marvel.com/universe/Captain_America_(Steve_Rogers)")
@ -136,10 +136,10 @@ print(har)
## --------HAR PAGES-------- ## --------HAR PAGES--------
## Page id: 1 , Page title: Poynter – A global leader in journalism. Strengthening democracy. ## Page id: 1 , Page title: Poynter – A global leader in journalism. Strengthening democracy.
## --------HAR ENTRIES-------- ## --------HAR ENTRIES--------
## Number of entries: 53 ## Number of entries: 55
## REQUESTS: ## REQUESTS:
## Page: 1 ## Page: 1
## Number of entries: 53 ## Number of entries: 55
## - http://www.poynter.org/ ## - http://www.poynter.org/
## - http://www.poynter.org/wp-content/plugins/easy-author-image/css/easy-author-image.css?ver=2016_06_24.1 ## - http://www.poynter.org/wp-content/plugins/easy-author-image/css/easy-author-image.css?ver=2016_06_24.1
## - http://maxcdn.bootstrapcdn.com/bootstrap/3.3.5/css/bootstrap.min.css?ver=2016_06_24.1 ## - http://maxcdn.bootstrapcdn.com/bootstrap/3.3.5/css/bootstrap.min.css?ver=2016_06_24.1
@ -147,10 +147,10 @@ print(har)
## - http://maxcdn.bootstrapcdn.com/font-awesome/4.4.0/css/font-awesome.min.css?ver=2016_06_24.1 ## - http://maxcdn.bootstrapcdn.com/font-awesome/4.4.0/css/font-awesome.min.css?ver=2016_06_24.1
## ........ ## ........
## - http://ntvcld-a.akamaihd.net/image/upload/w_286,h_161,c_fill,g_auto,f_auto/assets/C6B95A2AECA04462AC9FCD7C9802256... ## - http://ntvcld-a.akamaihd.net/image/upload/w_286,h_161,c_fill,g_auto,f_auto/assets/C6B95A2AECA04462AC9FCD7C9802256...
## - data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAMgAAACgCAYAAABJ/yOpAAAAGXRFWHRTb2Z0d2FyZQBBZG9iZSBJbWFnZVJlYWR5ccl... ## - http://srv-2017-02-05-03.pixel.parsely.com/plogger/?rand=1486264735645&idsite=poynter.org&url=http%3A%2F%2Fwww.po...
## - http://srv-2017-02-05-03.pixel.parsely.com/plogger/?rand=1486264555280&idsite=poynter.org&url=http%3A%2F%2Fwww.po... ## - https://tpc.googlesyndication.com/simgad/15471443418029360623
## - https://tpc.googlesyndication.com/simgad/6842796207574772216 ## - https://securepubads.g.doubleclick.net/pcs/view?xai=AKAOjsu3mzkIuC8SYIGCp5136h6q7AtaZDrZ109tKADwc544iipyqEmWMxVMC...
## - https://securepubads.g.doubleclick.net/pcs/view?xai=AKAOjst8b0fIBntQlYitsxLm3ttuvJblqMbtaye6mD7k4t5HZngXYMdHF5fmc... ## - data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAMgAAACgCAYAAABJ/yOpAAAAGXRFWHRTb2Z0d2FyZQBBZG9iZSBJbWFnZVJlYWR5ccl...
You can use [`HARtools::HARviewer`](https://github.com/johndharrison/HARtools/blob/master/R/HARviewer.R) — which this pkg import/exports — to get view the HAR in an interactive HTML widget. You can use [`HARtools::HARviewer`](https://github.com/johndharrison/HARtools/blob/master/R/HARviewer.R) — which this pkg import/exports — to get view the HAR in an interactive HTML widget.
@ -179,7 +179,7 @@ library(testthat)
date() date()
``` ```
## [1] "Sat Feb 4 22:16:00 2017" ## [1] "Sat Feb 4 22:19:00 2017"
``` r ``` r
test_dir("tests/") test_dir("tests/")
@ -189,3 +189,7 @@ test_dir("tests/")
## OK: 0 SKIPPED: 0 FAILED: 0 ## OK: 0 SKIPPED: 0 FAILED: 0
## ##
## DONE =================================================================================================================== ## DONE ===================================================================================================================
### Code of Conduct
Please note that this project is released with a [Contributor Code of Conduct](CONDUCT.md). By participating in this project you agree to abide by its terms.

BIN
img/cap.jpg

Binary file not shown.

Before

Width:  |  Height:  |  Size: 120 KiB

After

Width:  |  Height:  |  Size: 123 KiB

BIN
img/cap.png

Binary file not shown.

Before

Width:  |  Height:  |  Size: 505 KiB

After

Width:  |  Height:  |  Size: 433 KiB

Loading…
Cancel
Save