Anyone web scrape with R? Looking for good resources

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • oilcountry99
    SBR Wise Guy
    • 08-29-10
    • 707

    #1
    Anyone web scrape with R? Looking for good resources
    Anyone have any good web scrape resources, tutorials, examples for a total illiterate R newbie? Sports related material would be preferred.

    What are your thoughts on web scraping with R?
  • Waterstpub87
    SBR MVP
    • 09-09-09
    • 4102

    #2
    I've never thought of R as being particularly good for webscraping. I see more stuff referencing python.
    Comment
    • A4K
      SBR Hall of Famer
      • 10-08-12
      • 5243

      #3
      Just a word of advice.....

      Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
      Comment
      • jtoler
        BARRELED IN @ SBR!
        • 12-17-13
        • 30967

        #4
        R as opposed to S and T? Definitely R.
        Comment
        • xbalto
          SBR High Roller
          • 10-14-10
          • 106

          #5
          R sucks for this, use Beautiful Soup in python and dump to a csv for R
          Comment
          • HeeeHAWWWW
            SBR Hall of Famer
            • 06-13-08
            • 5487

            #6
            Originally posted by A4K
            Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
            That's probably a really good suggestion. Scraping is a massive pita - get some Indian phd to do it for peanuts.
            Comment
            • matthewmsturgeon
              SBR Rookie
              • 08-24-17
              • 11

              #7
              Originally posted by A4K
              Just a word of advice.....

              Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.

              This is intriguing to me. How would you go about finding someone to do this and to describing what you are after. I looked briefly on fiverr, but I didn't want to create an account. Do you just post as request for what you are after and wait for someone to respond. Do you describe the site you want to scrape and indicate that you want it dumped at some interval (or run manually) into a spreadsheet or My SQL DB or something like that? I'm not trying to get you to explain everything that you did, I just feel like this is a concept worthy of more discussion if you have the time. Thanks.
              Comment
              • Bluehorseshoe
                SBR Posting Legend
                • 07-13-06
                • 14998

                #8
                Originally posted by A4K
                Just a word of advice.....

                Pay someone on Fiverr.com or post an ad on CraigsList to do the scraping and sorting for you. I did it this way and it cost me $50. The results were far better than I could have managed on my own.
                I'm looking for someone to scrape sportsbooks for me if anyone is interested.
                Comment
                • Bsims
                  SBR Wise Guy
                  • 02-03-09
                  • 827

                  #9
                  I still write code in Basic and cannot directly fetch most web pages. For these, I display the page I want and save the page to a file. I then have code that parses the html and builds line files. I then process these.

                  Another technique that I've used after displaying the page desired is to right click and Select All, then copy to the clipboard. The version of Basic that I use is capable of retrieving this clipboard data. Then I can build the line files I need.

                  The downside of these approaches is that they are static. It requires my intervention. I cannot simulate an API type application.
                  Comment
                  • hubie69
                    SBR Hall of Famer
                    • 09-16-10
                    • 7329

                    #10
                    Originally posted by xbalto
                    R sucks for this, use Beautiful Soup in python and dump to a csv for R
                    This. Exactly this. Beautiful Soup makes life super easy. I use BS4 with a bash wrapper and insert it into a mysql DB.
                    Comment
                    SBR Contests
                    Collapse
                    Top-Rated US Sportsbooks
                    Collapse
                    Working...