Interacting with Web Data using R: Web APIs and Web Scraping (including RSelenium)

Description: 

Presented April 22, 2019.

Scientists who use data to gain insight for their operational or research interests often need to extract data from web pages or APIs from time to time. While this process can be completed manually, it can take orders of magnitude longer to complete without automation/scripting techniques, especially if a task becomes routine and must be executed on a recurring basis. This webinar will demonstrate working with an API from R to extract information from healthdata.gov. We will also demonstrate scraping static web content using the rvest package, and also how to scrape static content by driving a web browser using RSelenium. Real time demos navigating the websites we scrape will be given, and resources for learning how to navigate a website’s structure (document object model, DOM) using CSS and Xpath will be provided.

Presenter

Spencer George Lourens, Indiana University 

Primary Topic Areas: 
Original Publication Year: 
2019
Event/Publication Date: 
April, 2019

April 25, 2019

Contact Us

INTERNATIONAL SOCIETY FOR
DISEASE SURVEILLANCE

288 Grove Street, Box 203
Braintree, MA 02184
(617) 779 0880
Email:syndromic@syndromic.org

This Knowledge Repository is made possible through the activities of the Centers for Disease Control and Prevention Cooperative Agreement/Grant #1 NU500E000098-01, National Surveillance Program Community of Practice (NSSP-CoP): Strengthening Health Surveillance Capabilities Nationwide, which is in the interest of public health.

Site created by Fusani Applications