#archiveteam-bs 2017-10-22,Sun

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***drumstick has quit IRC (Ping timeout: 255 seconds)
drumstick has joined #archiveteam-bs
[00:05]
drumstick has quit IRC (Ping timeout: 255 seconds) [00:11]
drumstick has joined #archiveteam-bs [00:16]
C4K3 has quit IRC (leaving)
C4K3 has joined #archiveteam-bs
[00:22]
BlueMaxim has joined #archiveteam-bs [00:36]
mundusAre there any tools out there that can use info in json files to archive things? [00:38]
***zhongfu has joined #archiveteam-bs [00:46]
odemgschbirid, https://twitter.com/Doctor_Cupcakes/status/921876712631230464
JAA, NeoGAF needs to be added to archivebot if it comes back up, right now it's 'Our apologies for the temporary inconvenience. NeoGAF is currently down for scheduled maintenance. Please be patient while the site is down.' So I imagine they are scrubbing it :/
[00:48]
..... (idle for 20mn)
***schbirid has quit IRC (Ping timeout: 255 seconds) [01:11]
schbirid has joined #archiveteam-bs [01:22]
..... (idle for 20mn)
dashcloudhook54321: apparently you can, but it's only the latest version that's supported now [01:42]
................... (idle for 1h32mn)
***qw3rty14 has joined #archiveteam-bs [03:14]
qw3rty13 has quit IRC (Ping timeout: 600 seconds) [03:19]
Soni has quit IRC (Ping timeout: 264 seconds) [03:32]
drumstick has quit IRC (Read error: Operation timed out)
drumstick has joined #archiveteam-bs
[03:44]
wp494odemg: NeoGAF is several magnitudes of order bigger than SPUF
warrior is going to be absolutely necessary to get all of it.
[03:54]
***pizzaiolo has quit IRC (Remote host closed the connection) [04:02]
odemgwp494, how large is it? [04:02]
wp494their numbers say 120M posts spread across 832K threads
SPUF had 13.8M posts across ~1.3M threads
btw I could've sworn wikipedia had a list of largest vbulletin forums, did they toss it
[04:03]
***Sk1d has quit IRC (Ping timeout: 186 seconds)
Mateon1 has quit IRC (Ping timeout: 250 seconds)
[04:07]
Sk1d has joined #archiveteam-bs [04:13]
BlueMaxim has quit IRC (Quit: Leaving) [04:27]
ScruffyB has joined #archiveteam-bs
Stilett0 has joined #archiveteam-bs
[04:40]
............... (idle for 1h10mn)
fie has quit IRC (Ping timeout: 246 seconds)
BlueMaxim has joined #archiveteam-bs
[05:51]
Mateon1 has joined #archiveteam-bs [05:59]
........ (idle for 35mn)
ZexaronS has quit IRC (Ping timeout: 255 seconds) [06:34]
.......... (idle for 45mn)
midas has quit IRC (Read error: Operation timed out)
midas has joined #archiveteam-bs
[07:19]
......... (idle for 43mn)
Stilett0 has quit IRC ()
ZexaronS has joined #archiveteam-bs
[08:03]
ZexaronS- has joined #archiveteam-bs
ZexaronS has quit IRC (Ping timeout: 260 seconds)
[08:12]
...... (idle for 29mn)
ZexaronS- has quit IRC (Quit: Leaving) [08:44]
ZexaronS has joined #archiveteam-bs [08:52]
jtn2 has quit IRC (Ping timeout: 250 seconds)
jtn2 has joined #archiveteam-bs
[09:03]
.................. (idle for 1h29mn)
BlueMaxim has quit IRC (Read error: Connection reset by peer) [10:32]
...... (idle for 29mn)
Soni has joined #archiveteam-bs [11:01]
....... (idle for 32mn)
vitzli has joined #archiveteam-bs
pizzaiolo has joined #archiveteam-bs
[11:33]
drumstick has quit IRC (Read error: Operation timed out) [11:47]
......... (idle for 42mn)
schbiridwired is so full of infinite url traps i am giving up on my mirror
2G log for a 500MB warc.gz, yup
[12:29]
....................................... (idle for 3h10mn)
***Stilett0 has joined #archiveteam-bs [15:40]
hook54321dashcloud, wp494: https://twitter.com/CatTheUndying/status/921848303138037761
Also, try going to the neogaf.com
"Our apologies for the temporary inconvenience. NeoGAF is currently down for scheduled maintenance. Please be patient while the site is down."
Even mail.neogaf.com won't load, I'm guessing the site is dead.
This is useful. https://twitter.com/NeoGAFNewThread
[15:49]
.................. (idle for 1h28mn)
schbiridif someone wants to continue a wired.com wpull of "Total disk usage: 362.9GiB Apparent size: 357.6GiB Items: 5244140", shout within the next 2 hours. it is a horrible mess of redundant url sinkholes so i stopped
such as "14.5GiB /google_internet_balloons", "10.7GiB /stories-about-girls-part-2" or "10.6GiB /westeroscraft-game-thrones-minecraft"
[17:23]
***vitzli has quit IRC (Quit: Leaving) [17:38]
..... (idle for 21mn)
joepie91schbirid: please hold [17:59]
schbiridjoepie91: https://www.youtube.com/watch?v=6g4dkBF5anU [17:59]
joepie91schbirid: hehe, exactly
schbirid: actually, let me PM
[18:01]
schbiridPerMission granted [18:01]
.... (idle for 16mn)
***jschwart has joined #archiveteam-bs [18:17]
jschwartjoepie91: I'm moving from Amersfoort to near Eindhoven, is any of that close to you? [18:23]
dashcloudjschwart: Did you want to upload your CDs to the Internet Archive, or were you planning to send them off to someone to handle for you? [18:23]
***wabu has quit IRC (Read error: Operation timed out)
odemg has quit IRC (Read error: Operation timed out)
[18:24]
jschwartdashcloud: it will probably be easier if someone takes them over
otherwise I will probably have to throw them away at some point
[18:27]
dashcloudif you want them to be available immediately, but don't want to store them, you can always scan+upload them, then donate the CDs to a local thrift store (otherwise, you can just pack up everything and send it to the Internet Archive) [18:29]
joepie91jschwart: I'm in Dordrecht
but yeah, if international shipping is a possibility, then that's probably preferable as SketchCow is currently better equipped to handle this than I am :P
[18:30]
jschwartalright, I am still sorting the discs now
maybe it would be useful if I try to make some kind of list of the discs?
I do not have a scanner myself here
[18:32]
joepie91that's always a good idea, even if just to make sure nothing gets lost in transit [18:33]
***odemg has joined #archiveteam-bs [18:47]
..... (idle for 24mn)
Somebody2jschwart: About how many disks do you have? [19:11]
........... (idle for 51mn)
jschwartSomebody2: around 50 I guess
could be >100 though, I'm not really sure
dutch versions of games it seems and it seems promotional discs were populair when I was in high school
[20:02]
***C4K3 has quit IRC (leaving)
C4K3 has joined #archiveteam-bs
[20:04]
jschwart has quit IRC (Quit: Konversation terminated!)
icedice has joined #archiveteam-bs
[20:13]
..... (idle for 20mn)
fie has joined #archiveteam-bs [20:34]
..... (idle for 24mn)
ZexaronS- has joined #archiveteam-bs
schbirid has quit IRC (Quit: Leaving)
ZexaronS has quit IRC (Ping timeout: 260 seconds)
[20:58]
Aerochrom has joined #archiveteam-bs [21:12]
......... (idle for 40mn)
kristian_ has joined #archiveteam-bs
ZexaronS- has quit IRC (Quit: Leaving)
[21:52]
yuitimoth has quit IRC (Remote host closed the connection)
yuitimoth has joined #archiveteam-bs
[22:05]
...... (idle for 27mn)
drumstick has joined #archiveteam-bs
Stilett0 has quit IRC (Ping timeout: 260 seconds)
[22:32]
.... (idle for 17mn)
ZexaronS has joined #archiveteam-bs [22:53]
kristian_ has quit IRC (Quit: Leaving) [23:00]
HCross2Anyone around to give me a hand with the tracker please? Trying to requeue some NewsGrabber items and I'm just getting an Internal Server Error [23:09]
***BlueMaxim has joined #archiveteam-bs [23:15]
...... (idle for 25mn)
dashcloudAerochrom: if you wanted to archive sites manually, small things can be thrown into the #archivebot channel, and you can have them archived there. If you have a larger site or want to do it yourself, wpull is generally the recommended tool now- it creates archives using the WARC format, which is what the Internet Archive uses behind the Wayback Machine. [23:40]
***Stilett0 has joined #archiveteam-bs [23:49]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)