Anyone web scrape with R? Looking for good resources
Anyone have any good web scrape resources, tutorials, examples for a total illiterate R newbie? Sports related material would be preferred.
What are your thoughts on web scraping with R?
Waterstpub87
SBR MVP
09-09-09
4102
#2
I've never thought of R as being particularly good for webscraping. I see more stuff referencing python.
Comment
A4K
SBR Hall of Famer
10-08-12
5243
#3
Just a word of advice.....
Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
Comment
jtoler
BARRELED IN @ SBR!
12-17-13
30967
#4
R as opposed to S and T? Definitely R.
Comment
xbalto
SBR High Roller
10-14-10
106
#5
R sucks for this, use Beautiful Soup in python and dump to a csv for R
Comment
HeeeHAWWWW
SBR Hall of Famer
06-13-08
5487
#6
Originally posted by A4K
Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
That's probably a really good suggestion. Scraping is a massive pita - get some Indian phd to do it for peanuts.
Comment
matthewmsturgeon
SBR Rookie
08-24-17
11
#7
Originally posted by A4K
Just a word of advice.....
Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
This is intriguing to me. How would you go about finding someone to do this and to describing what you are after. I looked briefly on fiverr, but I didn't want to create an account. Do you just post as request for what you are after and wait for someone to respond. Do you describe the site you want to scrape and indicate that you want it dumped at some interval (or run manually) into a spreadsheet or My SQL DB or something like that? I'm not trying to get you to explain everything that you did, I just feel like this is a concept worthy of more discussion if you have the time. Thanks.
Comment
Bluehorseshoe
SBR Posting Legend
07-13-06
14998
#8
Originally posted by A4K
Just a word of advice.....
Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
I'm looking for someone to scrape sportsbooks for me if anyone is interested.
Comment
Bsims
SBR Wise Guy
02-03-09
827
#9
I still write code in Basic and cannot directly fetch most web pages. For these, I display the page I want and save the page to a file. I then have code that parses the html and builds line files. I then process these.
Another technique that I've used after displaying the page desired is to right click and Select All, then copy to the clipboard. The version of Basic that I use is capable of retrieving this clipboard data. Then I can build the line files I need.
The downside of these approaches is that they are static. It requires my intervention. I cannot simulate an API type application.
Comment
hubie69
SBR Hall of Famer
09-16-10
7329
#10
Originally posted by xbalto
R sucks for this, use Beautiful Soup in python and dump to a csv for R
This. Exactly this. Beautiful Soup makes life super easy. I use BS4 with a bash wrapper and insert it into a mysql DB.