[00:05] *** drumstick has quit IRC (Ping timeout: 255 seconds) [00:06] *** drumstick has joined #archiveteam-bs [00:11] *** drumstick has quit IRC (Ping timeout: 255 seconds) [00:16] *** drumstick has joined #archiveteam-bs [00:22] *** C4K3 has quit IRC (leaving) [00:22] *** C4K3 has joined #archiveteam-bs [00:36] *** BlueMaxim has joined #archiveteam-bs [00:38] Are there any tools out there that can use info in json files to archive things? [00:46] *** zhongfu has joined #archiveteam-bs [00:48] schbirid, https://twitter.com/Doctor_Cupcakes/status/921876712631230464 [00:51] JAA, NeoGAF needs to be added to archivebot if it comes back up, right now it's 'Our apologies for the temporary inconvenience. NeoGAF is currently down for scheduled maintenance. Please be patient while the site is down.' So I imagine they are scrubbing it :/ [01:11] *** schbirid has quit IRC (Ping timeout: 255 seconds) [01:22] *** schbirid has joined #archiveteam-bs [01:42] hook54321: apparently you can, but it's only the latest version that's supported now [03:14] *** qw3rty14 has joined #archiveteam-bs [03:19] *** qw3rty13 has quit IRC (Ping timeout: 600 seconds) [03:32] *** Soni has quit IRC (Ping timeout: 264 seconds) [03:44] *** drumstick has quit IRC (Read error: Operation timed out) [03:45] *** drumstick has joined #archiveteam-bs [03:54] odemg: NeoGAF is several magnitudes of order bigger than SPUF [03:54] warrior is going to be absolutely necessary to get all of it. [04:02] *** pizzaiolo has quit IRC (Remote host closed the connection) [04:02] wp494, how large is it? [04:03] their numbers say 120M posts spread across 832K threads [04:04] SPUF had 13.8M posts across ~1.3M threads [04:05] btw I could've sworn wikipedia had a list of largest vbulletin forums, did they toss it [04:07] *** Sk1d has quit IRC (Ping timeout: 186 seconds) [04:07] *** Mateon1 has quit IRC (Ping timeout: 250 seconds) [04:13] *** Sk1d has joined #archiveteam-bs [04:27] *** BlueMaxim has quit IRC (Quit: Leaving) [04:40] *** ScruffyB has joined #archiveteam-bs [04:41] *** Stilett0 has joined #archiveteam-bs [05:51] *** fie has quit IRC (Ping timeout: 246 seconds) [05:51] *** BlueMaxim has joined #archiveteam-bs [05:59] *** Mateon1 has joined #archiveteam-bs [06:34] *** ZexaronS has quit IRC (Ping timeout: 255 seconds) [07:19] *** midas has quit IRC (Read error: Operation timed out) [07:20] *** midas has joined #archiveteam-bs [08:03] *** Stilett0 has quit IRC () [08:04] *** ZexaronS has joined #archiveteam-bs [08:12] *** ZexaronS- has joined #archiveteam-bs [08:15] *** ZexaronS has quit IRC (Ping timeout: 260 seconds) [08:44] *** ZexaronS- has quit IRC (Quit: Leaving) [08:52] *** ZexaronS has joined #archiveteam-bs [09:03] *** jtn2 has quit IRC (Ping timeout: 250 seconds) [09:03] *** jtn2 has joined #archiveteam-bs [10:32] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [11:01] *** Soni has joined #archiveteam-bs [11:33] *** vitzli has joined #archiveteam-bs [11:35] *** pizzaiolo has joined #archiveteam-bs [11:47] *** drumstick has quit IRC (Read error: Operation timed out) [12:29] wired is so full of infinite url traps i am giving up on my mirror [12:30] 2G log for a 500MB warc.gz, yup [15:40] *** Stilett0 has joined #archiveteam-bs [15:49] dashcloud, wp494: https://twitter.com/CatTheUndying/status/921848303138037761 [15:50] Also, try going to the neogaf.com [15:50] "Our apologies for the temporary inconvenience. NeoGAF is currently down for scheduled maintenance. Please be patient while the site is down." [15:54] Even mail.neogaf.com won't load, I'm guessing the site is dead. [15:55] This is useful. https://twitter.com/NeoGAFNewThread [17:23] if someone wants to continue a wired.com wpull of "Total disk usage: 362.9GiB Apparent size: 357.6GiB Items: 5244140", shout within the next 2 hours. it is a horrible mess of redundant url sinkholes so i stopped [17:25] such as "14.5GiB /google_internet_balloons", "10.7GiB /stories-about-girls-part-2" or "10.6GiB /westeroscraft-game-thrones-minecraft" [17:38] *** vitzli has quit IRC (Quit: Leaving) [17:59] schbirid: please hold [17:59] joepie91: https://www.youtube.com/watch?v=6g4dkBF5anU [18:01] schbirid: hehe, exactly [18:01] schbirid: actually, let me PM [18:01] PerMission granted [18:17] *** jschwart has joined #archiveteam-bs [18:23] joepie91: I'm moving from Amersfoort to near Eindhoven, is any of that close to you? [18:23] jschwart: Did you want to upload your CDs to the Internet Archive, or were you planning to send them off to someone to handle for you? [18:24] *** wabu has quit IRC (Read error: Operation timed out) [18:25] *** odemg has quit IRC (Read error: Operation timed out) [18:27] dashcloud: it will probably be easier if someone takes them over [18:28] otherwise I will probably have to throw them away at some point [18:29] if you want them to be available immediately, but don't want to store them, you can always scan+upload them, then donate the CDs to a local thrift store (otherwise, you can just pack up everything and send it to the Internet Archive) [18:30] jschwart: I'm in Dordrecht [18:30] but yeah, if international shipping is a possibility, then that's probably preferable as SketchCow is currently better equipped to handle this than I am :P [18:32] alright, I am still sorting the discs now [18:33] maybe it would be useful if I try to make some kind of list of the discs? [18:33] I do not have a scanner myself here [18:33] that's always a good idea, even if just to make sure nothing gets lost in transit [18:47] *** odemg has joined #archiveteam-bs [19:11] jschwart: About how many disks do you have? [20:02] Somebody2: around 50 I guess [20:03] could be >100 though, I'm not really sure [20:04] dutch versions of games it seems and it seems promotional discs were populair when I was in high school [20:04] *** C4K3 has quit IRC (leaving) [20:05] *** C4K3 has joined #archiveteam-bs [20:13] *** jschwart has quit IRC (Quit: Konversation terminated!) [20:14] *** icedice has joined #archiveteam-bs [20:34] *** fie has joined #archiveteam-bs [20:58] *** ZexaronS- has joined #archiveteam-bs [21:00] *** schbirid has quit IRC (Quit: Leaving) [21:01] *** ZexaronS has quit IRC (Ping timeout: 260 seconds) [21:12] *** Aerochrom has joined #archiveteam-bs [21:52] *** kristian_ has joined #archiveteam-bs [21:55] *** ZexaronS- has quit IRC (Quit: Leaving) [22:05] *** yuitimoth has quit IRC (Remote host closed the connection) [22:05] *** yuitimoth has joined #archiveteam-bs [22:32] *** drumstick has joined #archiveteam-bs [22:36] *** Stilett0 has quit IRC (Ping timeout: 260 seconds) [22:53] *** ZexaronS has joined #archiveteam-bs [23:00] *** kristian_ has quit IRC (Quit: Leaving) [23:09] Anyone around to give me a hand with the tracker please? Trying to requeue some NewsGrabber items and I'm just getting an Internal Server Error [23:15] *** BlueMaxim has joined #archiveteam-bs [23:40] Aerochrom: if you wanted to archive sites manually, small things can be thrown into the #archivebot channel, and you can have them archived there. If you have a larger site or want to do it yourself, wpull is generally the recommended tool now- it creates archives using the WARC format, which is what the Internet Archive uses behind the Wayback Machine. [23:49] *** Stilett0 has joined #archiveteam-bs