(警告,新手,慢慢学习R)
嗨,您好,
我正在尝试使用R从网站自动下载数据.该网站使用的是sharepoint,在询问(R download from aspx in https getting website instead of CSV)之后有人向我指出了RSelenium.
我需要的是从这样的地址下载csv文件:
https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Production/ProductionByCountyExport.aspx?UNCONVENTIONAL_ONLY=false&INC_HOME_USE_WELLS=true&INC_NON_PRODUCING_WELLS=true&PERIOD=15AUGU&COUNTY=ALLEGHENY
但在我需要接受协议之前(我点击“我正在使用RSelenium”)代码在这里:
# Using RSelenium to save file
##Installing the package if needed
install.packages("RSelenium")
##Activating
library("RSelenium")
checkForServer()
startServer()
#I had to start the server manually!
remDr <- remoteDriver()
remDr
remDr$open()
#open website and accepting conditions
remDr$navigate("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Welcome/Agreement.aspx")
AgreeButton<-remDr$findElement(using = 'id', value="MainContent_AgreeButton")
AgreeButton$highlightElement()
AgreeButton$clickElement()
remDr$navigate("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Production/ProductionByCountyExport.aspx?UNCONVENTIONAL_ONLY=false&INC_HOME_USE_WELLS=true&INC_NON_PRODUCING_WELLS=true&PERIOD=15AUGU&COUNTY=ALLEGHENY")
我的问题是:
我无法在RSelenium中找到“保存链接为”的命令
我想我需要找到这种类型的东西:
CSVurl<-remDr$navigate ("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Production/ProductionByCountyExport.aspx?UNCONVENTIONAL_ONLY=false&INC_HOME_USE_WELLS=true&INC_NON_PRODUCING_WELLS=true&PERIOD=15AUGU&COUNTY=ALLEGHENY")remDr$navigate("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Production/ProductionByCountyExport.aspx?UNCONVENTIONAL_ONLY=false&INC_HOME_USE_WELLS=true&INC_NON_PRODUCING_WELLS=true&PERIOD=15AUGU&COUNTY=ALLEGHENY")
CSVurl$saveLinkAs(fileName)
这存在吗?
在R中有更好的方法吗?
提前致谢
最佳答案
`# Using RSelenium to save file
##Installing the package if needed
##Activating
library(RSelenium)
checkForServer()
startServer()
#I had to start the server manually!
cprof<-makeFirefoxProfile(list(
"browser.helperApps.neverAsk.saveToDisk"='text/plain, application/vnd.ms-excel, text/csv, text/comma-separated-values, application/octet-stream',
"browser.helperApps.neverAsk.openFile"='text/plain, application/vnd.ms-excel, text/csv, text/comma-separated-values, application/octet-stream'
))
remDr <- remoteDriver(extraCapabilities=cprof)
remDr$open()
#open website and accepting conditions
remDr$navigate("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Welcome/Agreement.aspx")
AgreeButton<-remDr$findElement(using = 'id', value="MainContent_AgreeButton")
AgreeButton$highlightElement()
AgreeButton$clickElement()
remDr$navigate("https://www.paoilandgasreporting.state.pa.us/publicreports/Modules/Production/ProductionByCountyExport.aspx?UNCONVENTIONAL_ONLY=false&INC_HOME_USE_WELLS=true&INC_NON_PRODUCING_WELLS=true&PERIOD=15AUGU&COUNTY=ALLEGHENY")`
要访问该文件,您必须搜索firefox的默认下载文件夹.
如果您收到错误消息,指出R无法创建cprof或无法压缩内容,那么您可能需要安装RTools.
从here起
检查您已安装的R的确切版本.
希望这可以帮助.