[00:02] *** Mateon1 has quit IRC (Read error: Operation timed out) [00:02] *** Mateon1 has joined #archiveteam-bs [00:33] *** C4K3_ has joined #archiveteam-bs [00:33] *** C4K3 has quit IRC (Read error: Connection reset by peer) [00:38] *** Fletcher has quit IRC (Remote host closed the connection) [00:46] *** dashcloud has quit IRC (Read error: Operation timed out) [00:55] *** dashcloud has joined #archiveteam-bs [01:04] *** underscor has quit IRC (No Ping reply in 180 seconds.) [01:04] *** atomicthu has quit IRC (No Ping reply in 180 seconds.) [01:05] *** underscor has joined #archiveteam-bs [01:09] *** nyany has quit IRC (Remote host closed the connection) [01:10] *** nyany has joined #archiveteam-bs [01:21] *** atomicthu has joined #archiveteam-bs [01:55] *** Famicoman has quit IRC (Remote host closed the connection) [01:57] *** Famicoman has joined #archiveteam-bs [01:59] *** nyany has quit IRC (Remote host closed the connection) [01:59] *** nyany has joined #archiveteam-bs [02:10] *** qw3rty114 has quit IRC (Ping timeout: 600 seconds) [02:31] *** zyphlar_ has joined #archiveteam-bs [02:33] *** qw3rty114 has joined #archiveteam-bs [02:41] *** godane has quit IRC (Ping timeout: 252 seconds) [02:55] *** godane has joined #archiveteam-bs [03:05] bmcginty: afaik no (?), but we will get on it [03:32] *** wp494_ has joined #archiveteam-bs [03:37] *** wp494 has quit IRC (Read error: Operation timed out) [03:48] *** qw3rty115 has joined #archiveteam-bs [03:52] *** qw3rty114 has quit IRC (Read error: Operation timed out) [04:08] *** nyaomi has quit IRC (Ping timeout: 268 seconds) [04:10] *** odemg has quit IRC (Read error: Operation timed out) [04:15] *** odemg has joined #archiveteam-bs [04:41] *** zyphlar_ has quit IRC (Quit: Connection closed for inactivity) [04:42] *** wp494_ is now known as wp494 [04:45] *** nyaomi has joined #archiveteam-bs [04:54] *** nyaomi has quit IRC (Ping timeout: 244 seconds) [05:08] *** nyaomi has joined #archiveteam-bs [05:32] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [05:32] *** dashcloud has quit IRC (Read error: Operation timed out) [05:35] *** dashcloud has joined #archiveteam-bs [05:36] *** Lord_Nigh has joined #archiveteam-bs [05:38] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [05:41] *** nyaomi has quit IRC (Read error: Connection reset by peer) [05:44] *** nyaomi has joined #archiveteam-bs [05:46] *** Lord_Nigh has joined #archiveteam-bs [06:02] *** dashcloud has quit IRC (Read error: Operation timed out) [06:04] *** dashcloud has joined #archiveteam-bs [06:47] *** Mayonaise has quit IRC (Read error: Operation timed out) [06:58] *** Mayonaise has joined #archiveteam-bs [08:06] SketchCow: tons of cds isos in community texts: https://archive.org/details/@claunia [09:03] *** MrRadar_ has joined #archiveteam-bs [09:04] *** MrRadar has quit IRC (Read error: Operation timed out) [10:01] *** SilSte has quit IRC (Remote host closed the connection) [10:02] *** SilSte has joined #archiveteam-bs [10:07] *** RichardG has quit IRC (Ping timeout: 268 seconds) [10:11] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [10:13] *** dashcloud has joined #archiveteam-bs [11:32] *** BlueMax has quit IRC (Read error: Connection reset by peer) [12:00] Sanqui: Want me to archive their patreon? [12:03] *** godane has quit IRC (Read error: Operation timed out) [12:12] eientei95: With log-in access? Please do [12:19] Sanqui: Would a JSON export of the posts be good enough? COntains links to the images and attachments [12:20] Can those be accessed without a 403? [12:22] I believe so, let me check [12:23] Hm. nope [12:24] Then it's up to you to archive the whole thing [13:05] *** fie has quit IRC (Leaving) [13:09] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [13:11] *** Lord_Nigh has joined #archiveteam-bs [13:37] *** RichardG has joined #archiveteam-bs [13:40] *** Lord_Nigh has quit IRC (Ping timeout: 268 seconds) [13:47] *** Lord_Nigh has joined #archiveteam-bs [14:43] What's the best program to make a image of a CD containing Mac OS 9-era files? [14:43] Ideally cross platform - or suggestions for both Windows and Mac [14:44] (I'm in contact with someone who has a CD of Mac files, but I don't know their setup). [15:37] *** Darkstar has quit IRC (Ping timeout: 506 seconds) [15:42] *** Darkstar has joined #archiveteam-bs [15:43] *** Boppen has quit IRC (Quit: Nettalk6 - www.ntalk.de) [15:59] *** lindalap_ has joined #archiveteam-bs [15:59] *** lindalap has quit IRC (Read error: Connection reset by peer) [16:00] *** lindalap_ is now known as lindalap [16:10] *** schbirid has joined #archiveteam-bs [16:13] *** C4K3_ has quit IRC (leaving) [16:14] *** C4K3 has joined #archiveteam-bs [16:22] *** SilSte has quit IRC (Ping timeout: 633 seconds) [16:22] *** Fusl has quit IRC (K-Lined) [16:23] *** Fusl has joined #archiveteam-bs [16:32] So, trying to grab the GIGA forums. I can't get any faster than ~150 threads per minute from one machine for some reason, even at very high concurrency (it's not CPU-limited, the process is only using 50-60 % of one core). And that's not fast enough; it'd take 6 full days to scan all ~1.3 million thread IDs, and that would only cover the first page of each thread. I'll try to split it up into multiple [16:33] betamax: bin/cue, and then iso [16:33] parallel processes to see if that is any faster. [16:33] (If we end up doing a warrior project, this is obviously obsolete.) [16:34] why bother scanning? just -m -reject and go [16:36] They don't list all threads in the indices, it seems. [16:36] For example, the frontpage says there are 1684 threads in "GIGA.DE allgemein", but when you go into those forums, it only shows three threads. [16:36] &daysprune=-1 [16:37] Anzeige-Eigenschaften -> Alter -> von Anfang an [16:37] Yeah, right. [16:37] Hmm [16:38] *** atrocity has quit IRC (Read error: Operation timed out) [16:40] Setting up the ignores gets more complex though with a real recursive grab. [16:40] With the scanning strategy, I can just let it grab only showthread.php and the actual thread links and don't need to worry about infinite tunnels etc. [16:42] I might grab the indices like that though for browsability. [16:59] astrid: I've found that the person has a PC, what's the best (free) Windows software for making bin / cue files [17:00] (I tried ImgBurn, but that doesn't recognise the disc) [17:00] i haven't worked with ripping discs much so i can't give you any useful pointers, sorry [17:01] i might suggest http://cue.tools/wiki/CUETools [17:19] *** atrocity has joined #archiveteam-bs [17:31] betamax: https://github.com/saramibreak/DiscImageCreator [17:43] *** bithippo has joined #archiveteam-bs [17:57] As expected, I get much higher throughput with parallel processes (at the cost of retrieving some images, stylesheets etc. several times). Six processes give me ~750 threads per minute. Let's see how long it takes until GIGA bans me. [18:03] nice [18:07] Oh, just realised that I'll probably grab some threads multiple times. Hmm. [18:08] In the worst case, I might actually be grabbing *everything* multiple times. [18:12] *** Darkstar has quit IRC (Ping timeout: 480 seconds) [18:20] *** DFJustin has quit IRC (Ping timeout: 260 seconds) [18:23] *** Darkstar has joined #archiveteam-bs [18:37] *** DFJustin has joined #archiveteam-bs [19:24] *** bithippo has quit IRC (Textual IRC Client: www.textualapp.com) [19:46] *** jschwart has joined #archiveteam-bs [19:52] *** Pixi has quit IRC (Ping timeout: 255 seconds) [20:02] *** Pixi has joined #archiveteam-bs [20:13] *** SilSte has joined #archiveteam-bs [20:24] Ew, the wiki doesn't support SVG. :-( [20:31] *** schbirid has quit IRC (Quit: Leaving) [20:41] Still doing almost exactly 750 threads per minute (749.3 over the past half hour). :-) [20:42] *** jschwart has quit IRC (Quit: Konversation terminated!) [20:45] I'm quite surprised that they haven't banned me yet, to be honest. 120 concurrent connections from a single IP... [20:55] JAA: did you find any limits? [20:57] arkiver: Nada. Only my server. [20:58] *** dashcloud has quit IRC (Ping timeout: 252 seconds) [21:02] *** dashcloud has joined #archiveteam-bs [21:22] JAA: awesome, sounds good [21:47] Found a broken thread though: http://forum.giga.de/showthread.php?t=104517 [21:47] The only error so far. No timeouts, connection closures, whatever. [21:50] *** BlueMax has joined #archiveteam-bs [21:59] *** Mateon1 has quit IRC (Ping timeout: 268 seconds) [21:59] *** Mateon1 has joined #archiveteam-bs [22:11] *** godane has joined #archiveteam-bs [22:31] *** Atom has quit IRC (Read error: Connection reset by peer) [22:39] *** m007a83_ has joined #archiveteam-bs [22:41] *** m007a83 has quit IRC (Ping timeout: 252 seconds) [22:53] *** Mayonaise has quit IRC (Read error: Operation timed out) [22:58] *** Mayonaise has joined #archiveteam-bs [23:02] *** RichardG has quit IRC (Read error: Connection reset by peer) [23:04] *** RichardG has joined #archiveteam-bs [23:29] SketchCow: any update with Mank? [23:30] *** dashcloud has quit IRC (Read error: Operation timed out) [23:30] *** dashcloud has joined #archiveteam-bs [23:52] *** Atom has joined #archiveteam-bs