#archiveteam-bs 2018-04-11,Wed

↑back Search

Time Nickname Message
00:02 🔗 Mateon1 has quit IRC (Read error: Operation timed out)
00:02 🔗 Mateon1 has joined #archiveteam-bs
00:33 🔗 C4K3_ has joined #archiveteam-bs
00:33 🔗 C4K3 has quit IRC (Read error: Connection reset by peer)
00:38 🔗 Fletcher has quit IRC (Remote host closed the connection)
00:46 🔗 dashcloud has quit IRC (Read error: Operation timed out)
00:55 🔗 dashcloud has joined #archiveteam-bs
01:04 🔗 underscor has quit IRC (No Ping reply in 180 seconds.)
01:04 🔗 atomicthu has quit IRC (No Ping reply in 180 seconds.)
01:05 🔗 underscor has joined #archiveteam-bs
01:09 🔗 nyany has quit IRC (Remote host closed the connection)
01:10 🔗 nyany has joined #archiveteam-bs
01:21 🔗 atomicthu has joined #archiveteam-bs
01:55 🔗 Famicoman has quit IRC (Remote host closed the connection)
01:57 🔗 Famicoman has joined #archiveteam-bs
01:59 🔗 nyany has quit IRC (Remote host closed the connection)
01:59 🔗 nyany has joined #archiveteam-bs
02:10 🔗 qw3rty114 has quit IRC (Ping timeout: 600 seconds)
02:31 🔗 zyphlar_ has joined #archiveteam-bs
02:33 🔗 qw3rty114 has joined #archiveteam-bs
02:41 🔗 godane has quit IRC (Ping timeout: 252 seconds)
02:55 🔗 godane has joined #archiveteam-bs
03:05 🔗 arkiver bmcginty: afaik no (?), but we will get on it
03:32 🔗 wp494_ has joined #archiveteam-bs
03:37 🔗 wp494 has quit IRC (Read error: Operation timed out)
03:48 🔗 qw3rty115 has joined #archiveteam-bs
03:52 🔗 qw3rty114 has quit IRC (Read error: Operation timed out)
04:08 🔗 nyaomi has quit IRC (Ping timeout: 268 seconds)
04:10 🔗 odemg has quit IRC (Read error: Operation timed out)
04:15 🔗 odemg has joined #archiveteam-bs
04:41 🔗 zyphlar_ has quit IRC (Quit: Connection closed for inactivity)
04:42 🔗 wp494_ is now known as wp494
04:45 🔗 nyaomi has joined #archiveteam-bs
04:54 🔗 nyaomi has quit IRC (Ping timeout: 244 seconds)
05:08 🔗 nyaomi has joined #archiveteam-bs
05:32 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
05:32 🔗 dashcloud has quit IRC (Read error: Operation timed out)
05:35 🔗 dashcloud has joined #archiveteam-bs
05:36 🔗 Lord_Nigh has joined #archiveteam-bs
05:38 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
05:41 🔗 nyaomi has quit IRC (Read error: Connection reset by peer)
05:44 🔗 nyaomi has joined #archiveteam-bs
05:46 🔗 Lord_Nigh has joined #archiveteam-bs
06:02 🔗 dashcloud has quit IRC (Read error: Operation timed out)
06:04 🔗 dashcloud has joined #archiveteam-bs
06:47 🔗 Mayonaise has quit IRC (Read error: Operation timed out)
06:58 🔗 Mayonaise has joined #archiveteam-bs
08:06 🔗 godane SketchCow: tons of cds isos in community texts: https://archive.org/details/@claunia
09:03 🔗 MrRadar_ has joined #archiveteam-bs
09:04 🔗 MrRadar has quit IRC (Read error: Operation timed out)
10:01 🔗 SilSte has quit IRC (Remote host closed the connection)
10:02 🔗 SilSte has joined #archiveteam-bs
10:07 🔗 RichardG has quit IRC (Ping timeout: 268 seconds)
10:11 🔗 dashcloud has quit IRC (Ping timeout: 260 seconds)
10:13 🔗 dashcloud has joined #archiveteam-bs
11:32 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
12:00 🔗 eientei95 Sanqui: Want me to archive their patreon?
12:03 🔗 godane has quit IRC (Read error: Operation timed out)
12:12 🔗 Sanqui eientei95: With log-in access? Please do
12:19 🔗 eientei95 Sanqui: Would a JSON export of the posts be good enough? COntains links to the images and attachments
12:20 🔗 Sanqui Can those be accessed without a 403?
12:22 🔗 eientei95 I believe so, let me check
12:23 🔗 eientei95 Hm. nope
12:24 🔗 Sanqui Then it's up to you to archive the whole thing
13:05 🔗 fie has quit IRC (Leaving)
13:09 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
13:11 🔗 Lord_Nigh has joined #archiveteam-bs
13:37 🔗 RichardG has joined #archiveteam-bs
13:40 🔗 Lord_Nigh has quit IRC (Ping timeout: 268 seconds)
13:47 🔗 Lord_Nigh has joined #archiveteam-bs
14:43 🔗 betamax What's the best program to make a image of a CD containing Mac OS 9-era files?
14:43 🔗 betamax Ideally cross platform - or suggestions for both Windows and Mac
14:44 🔗 betamax (I'm in contact with someone who has a CD of Mac files, but I don't know their setup).
15:37 🔗 Darkstar has quit IRC (Ping timeout: 506 seconds)
15:42 🔗 Darkstar has joined #archiveteam-bs
15:43 🔗 Boppen has quit IRC (Quit: Nettalk6 - www.ntalk.de)
15:59 🔗 lindalap_ has joined #archiveteam-bs
15:59 🔗 lindalap has quit IRC (Read error: Connection reset by peer)
16:00 🔗 lindalap_ is now known as lindalap
16:10 🔗 schbirid has joined #archiveteam-bs
16:13 🔗 C4K3_ has quit IRC (leaving)
16:14 🔗 C4K3 has joined #archiveteam-bs
16:22 🔗 SilSte has quit IRC (Ping timeout: 633 seconds)
16:22 🔗 Fusl has quit IRC (K-Lined)
16:23 🔗 Fusl has joined #archiveteam-bs
16:32 🔗 JAA So, trying to grab the GIGA forums. I can't get any faster than ~150 threads per minute from one machine for some reason, even at very high concurrency (it's not CPU-limited, the process is only using 50-60 % of one core). And that's not fast enough; it'd take 6 full days to scan all ~1.3 million thread IDs, and that would only cover the first page of each thread. I'll try to split it up into multiple
16:33 🔗 astrid betamax: bin/cue, and then iso
16:33 🔗 JAA parallel processes to see if that is any faster.
16:33 🔗 JAA (If we end up doing a warrior project, this is obviously obsolete.)
16:34 🔗 schbirid why bother scanning? just -m -reject and go
16:36 🔗 JAA They don't list all threads in the indices, it seems.
16:36 🔗 JAA For example, the frontpage says there are 1684 threads in "GIGA.DE allgemein", but when you go into those forums, it only shows three threads.
16:36 🔗 schbirid &daysprune=-1
16:37 🔗 schbirid Anzeige-Eigenschaften -> Alter -> von Anfang an
16:37 🔗 JAA Yeah, right.
16:37 🔗 JAA Hmm
16:38 🔗 atrocity has quit IRC (Read error: Operation timed out)
16:40 🔗 JAA Setting up the ignores gets more complex though with a real recursive grab.
16:40 🔗 JAA With the scanning strategy, I can just let it grab only showthread.php and the actual thread links and don't need to worry about infinite tunnels etc.
16:42 🔗 JAA I might grab the indices like that though for browsability.
16:59 🔗 betamax astrid: I've found that the person has a PC, what's the best (free) Windows software for making bin / cue files
17:00 🔗 betamax (I tried ImgBurn, but that doesn't recognise the disc)
17:00 🔗 astrid i haven't worked with ripping discs much so i can't give you any useful pointers, sorry
17:01 🔗 astrid i might suggest http://cue.tools/wiki/CUETools
17:19 🔗 atrocity has joined #archiveteam-bs
17:31 🔗 schbirid betamax: https://github.com/saramibreak/DiscImageCreator
17:43 🔗 bithippo has joined #archiveteam-bs
17:57 🔗 JAA As expected, I get much higher throughput with parallel processes (at the cost of retrieving some images, stylesheets etc. several times). Six processes give me ~750 threads per minute. Let's see how long it takes until GIGA bans me.
18:03 🔗 schbirid nice
18:07 🔗 JAA Oh, just realised that I'll probably grab some threads multiple times. Hmm.
18:08 🔗 JAA In the worst case, I might actually be grabbing *everything* multiple times.
18:12 🔗 Darkstar has quit IRC (Ping timeout: 480 seconds)
18:20 🔗 DFJustin has quit IRC (Ping timeout: 260 seconds)
18:23 🔗 Darkstar has joined #archiveteam-bs
18:37 🔗 DFJustin has joined #archiveteam-bs
19:24 🔗 bithippo has quit IRC (Textual IRC Client: www.textualapp.com)
19:46 🔗 jschwart has joined #archiveteam-bs
19:52 🔗 Pixi has quit IRC (Ping timeout: 255 seconds)
20:02 🔗 Pixi has joined #archiveteam-bs
20:13 🔗 SilSte has joined #archiveteam-bs
20:24 🔗 JAA Ew, the wiki doesn't support SVG. :-(
20:31 🔗 schbirid has quit IRC (Quit: Leaving)
20:41 🔗 JAA Still doing almost exactly 750 threads per minute (749.3 over the past half hour). :-)
20:42 🔗 jschwart has quit IRC (Quit: Konversation terminated!)
20:45 🔗 JAA I'm quite surprised that they haven't banned me yet, to be honest. 120 concurrent connections from a single IP...
20:55 🔗 arkiver JAA: did you find any limits?
20:57 🔗 JAA arkiver: Nada. Only my server.
20:58 🔗 dashcloud has quit IRC (Ping timeout: 252 seconds)
21:02 🔗 dashcloud has joined #archiveteam-bs
21:22 🔗 arkiver JAA: awesome, sounds good
21:47 🔗 JAA Found a broken thread though: http://forum.giga.de/showthread.php?t=104517
21:47 🔗 JAA The only error so far. No timeouts, connection closures, whatever.
21:50 🔗 BlueMax has joined #archiveteam-bs
21:59 🔗 Mateon1 has quit IRC (Ping timeout: 268 seconds)
21:59 🔗 Mateon1 has joined #archiveteam-bs
22:11 🔗 godane has joined #archiveteam-bs
22:31 🔗 Atom has quit IRC (Read error: Connection reset by peer)
22:39 🔗 m007a83_ has joined #archiveteam-bs
22:41 🔗 m007a83 has quit IRC (Ping timeout: 252 seconds)
22:53 🔗 Mayonaise has quit IRC (Read error: Operation timed out)
22:58 🔗 Mayonaise has joined #archiveteam-bs
23:02 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
23:04 🔗 RichardG has joined #archiveteam-bs
23:29 🔗 godane SketchCow: any update with Mank?
23:30 🔗 dashcloud has quit IRC (Read error: Operation timed out)
23:30 🔗 dashcloud has joined #archiveteam-bs
23:52 🔗 Atom has joined #archiveteam-bs

irclogger-viewer