So with the ever increasing news that GameFAQs is getting full fandom’d, I figured I would grab some of the more useful guides for when I want to play a “retro” game or just 100% a LAD. From quick research, it looks like the txt guides are more than covered but the HTML ones are still kind of in a void.

So before I write my own set of scripts, I figured I would check if I am just not aware of something that already does this.

  • vildis@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 year ago

    For simple HTML websites using wget -p -k example.com should work.

    For more complex sites maybe try ArchiveBox

    • Puzzle_Sluts_4Ever@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      23
      ·
      edit-2
      1 year ago

      A few years back, CBS sold CNET (CNET, Gamespot, GameFAQs, Giant Bomb, and probably one or two others) to Red Ventures who then sold basically everything but CNET proper to Fandom. If people aren’t aware of what Fandom is, just go to basically any video game wiki and see how many pop ups you need to close just to see some misinformation.

      Anywho, Fandom have a decent record of killing every property they buy in the interest of monetization. And they can do that because they buy EVERYTHING. And as of a few days ago, one of the long standing admins at GameFAQs announced they were stepping down. Which… suggests Fandom realized they own GameFAQs and are likely about to start gutting it to add as many ads and autoplay twitch pages as possible.

    • Puzzle_Sluts_4Ever@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 year ago

      Looked at that. Seems like it would have worked back back when ?single=1 didn’t break all images. But since the guides are broken up into multiple pages, the automated scraping tends to lose its mind because it will try to get the entire site. Rather than a subset of pages.

      Thanks though

  • SkullHex2@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 year ago

    I only know of HTTrack. Fairly old and it needs some tweaks to work. Let me know if you find anything else