[00:00] doomtay. shut up. [00:01] *** Start has quit IRC (Quit: Disconnected.) [00:03] *** Start has joined #archiveteam-bs [00:44] here's a neat magnet link it's for the this American life podcast archive of episodes from 1995-2007 [00:44] magnet:?xt=urn:btih:5e31b76cd01ff9426ca2bec078c712ff20e17af6&dn=This%20American%20Life%20-%20Complete%20Volume%201995-2007%20-%20Episodes%201-342&tr=udp%3A%2F%2Ftracker.publicbt.com%2Fannounce&tr=udp%3A%2F%2Fglotorrents.pw%3A6969%2Fannounce&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80%2Fannounce&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce [00:51] *** Stiletto has quit IRC () [01:15] *** schbirid2 has joined #archiveteam-bs [01:18] *** schbirid has quit IRC (Ping timeout: 244 seconds) [01:18] *** Stiletto has joined #archiveteam-bs [02:00] *** Stiletto has quit IRC (Read error: Operation timed out) [02:01] *** Stiletto has joined #archiveteam-bs [02:01] *** dashcloud has quit IRC (Read error: Operation timed out) [02:02] *** dashcloud has joined #archiveteam-bs [02:24] *** Stiletto has quit IRC (Read error: Operation timed out) [02:25] *** Stiletto has joined #archiveteam-bs [02:49] *** SketchCow has quit IRC (Read error: Connection reset by peer) [02:55] *** SketchCow has joined #archiveteam-bs [02:55] *** midas sets mode: +o SketchCow [02:55] *** swebb sets mode: +o SketchCow [03:43] *** DoomTay has quit IRC (Quit: Page closed) [03:58] *** kristian_ has quit IRC (Leaving) [04:06] *** godane has quit IRC (Leaving.) [04:29] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:35] *** Sk1d has joined #archiveteam-bs [04:37] *** Frogging sets mode: +o arkiver [04:52] *** Meroje has quit IRC (Quit: bye!) [04:53] *** Meroje has joined #archiveteam-bs [05:35] *** Fusl has quit IRC (Contact: http://hallowe.lt/) [05:39] nice, whoever was in #internetarchive as "obama" sent me like 40 queries [05:40] some boys don't like the +b [05:41] under 40 different handles and CTCP VERSIONed me twice [05:53] *** godane has joined #archiveteam-bs [05:56] Nice yipdw - was holding my tongue back on him myself [06:10] i'm uploading 25gb of The Doug Urbanski Show [06:10] i'm hoping IA can handle me uploading since one item is still waiting to be derive [06:28] DoomTay, Frogging : It's a genealogy library. [06:33] *** fusl has joined #archiveteam-bs [06:42] *** Honno has joined #archiveteam-bs [07:13] *** Start_ has joined #archiveteam-bs [07:13] *** Start has quit IRC (Read error: Connection reset by peer) [07:15] *** sep332 has quit IRC (Quit: konversation out) [07:24] *** dashcloud has quit IRC (Read error: Operation timed out) [07:28] *** dashcloud has joined #archiveteam-bs [07:34] *** atrocity has quit IRC (Read error: Connection reset by peer) [07:35] *** atrocity has joined #archiveteam-bs [07:54] *** dashcloud has quit IRC (Read error: Operation timed out) [07:57] *** dashcloud has joined #archiveteam-bs [07:58] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [08:00] *** BartoCH has joined #archiveteam-bs [08:10] *** BlueMaxim has quit IRC (Read error: Operation timed out) [08:14] *** Start_ is now known as Start [08:23] *** wp494 has quit IRC (Read error: Connection reset by peer) [08:23] *** wp494 has joined #archiveteam-bs [09:03] *** tomwsmf has quit IRC (Read error: Operation timed out) [09:57] *** VADemon has joined #archiveteam-bs [10:07] *** BlueMaxim has joined #archiveteam-bs [10:28] *** yakfish has quit IRC (Ping timeout: 246 seconds) [10:43] *** yakfish has joined #archiveteam-bs [11:30] *** VADemon has quit IRC (Read error: Operation timed out) [11:53] hook54321: after you start the grab-site crawl in the crawl folder there are different files, (those are for the settings) if chnage any of these wpull will pickup the settings [11:54] there should be one for ignoreregexs [11:54] one per line [13:45] *** BlueMaxim has quit IRC (Quit: Leaving) [14:03] *** sep332 has joined #archiveteam-bs [15:11] *** DoomTay has joined #archiveteam-bs [15:19] *** goekesmi has left [15:25] yipdw: could we have a test endpoint on FOS to try and sort these speed issues please? [15:26] HCross2: an iperf endpoint, or something else [15:27] rsync [15:27] note that fos is currently doing a bunch of disk work so you're going to get that interference [15:28] :) [15:28] as a result your measurements are going to be noisy [15:28] Every time I try to clean up FOS I end up with 20% more used disk space [15:28] Yeah, I see. It's just that from the EU OVH it's painful and they want to test. [15:29] theey actually interested :O [15:35] so the doug urbanski show is mostly uploaded [15:37] *** dashcloud has quit IRC (Read error: Operation timed out) [15:42] *** dashcloud has joined #archiveteam-bs [15:45] rip Krautchan [16:22] *** kristian_ has joined #archiveteam-bs [16:55] *** DoomTay has quit IRC (Quit: Page closed) [17:21] *** atrocity has quit IRC (Read error: Connection reset by peer) [17:39] is it possible that pastebin.com expires/removes old pastes even if they didn't have an expiry set? there are a lot of broken pastebin links in my IRC logs and I could swear they never had an expiry [17:46] *** robink has quit IRC (Ping timeout: 501 seconds) [17:47] there's plenty of ways to remove stuff from pastebin.com [17:47] logged-in users can delete their pastes, abuse reports, DMCA reports [17:49] *** DoomTay has joined #archiveteam-bs [17:50] *** brayden__ has joined #archiveteam-bs [17:50] *** swebb sets mode: +o brayden__ [17:54] *** brayden_ has quit IRC (Read error: Operation timed out) [17:57] yes, pastebin.com may remove stuff without notice [17:57] but I would not highest on the likelyhood [17:57] i guess people removing them happens more frequently than I'd expect [17:57] it's just random stuff like code snippets my friend sent me last year, I wouldn't expect him to have deleted it manually or anything [18:01] yipdw, getting a chroot failed error :/ [18:05] kpfa archives are now up to 2016-08-09 [18:07] *** robink has joined #archiveteam-bs [18:08] *** RichardG has quit IRC (Read error: Operation timed out) [18:16] *** DoomTay has quit IRC (Quit: Page closed) [18:25] *** DoomTay has joined #archiveteam-bs [18:27] So http://4publicpurity.org/ is now a blank page [18:28] correct [18:31] At least it was saved, even if there was really little to save [18:39] *** VADemon has joined #archiveteam-bs [18:41] *** DoomTay has quit IRC (Quit: Page closed) [18:53] *** dashcloud has quit IRC (Read error: Operation timed out) [18:56] *** dashcloud has joined #archiveteam-bs [19:32] *** DoomTay has joined #archiveteam-bs [19:36] *** dashcloud has quit IRC (Read error: Operation timed out) [19:44] *** tomwsmf has joined #archiveteam-bs [19:45] *** dashcloud has joined #archiveteam-bs [19:51] *** Lord_Nigh has quit IRC (ZNC - http://znc.in) [20:07] SketchCow: more Nintendo Power issues: https://www.reddit.com/r/DataHoarder/comments/4wzzsv/a_few_more_issues_of_nintendo_power/ [20:07] yeah I just saw that, grabbed it [20:08] they are smaller then the other releases of Nintendo Power [20:08] i also have issue 171 and 180 [20:11] *** Lord_Nigh has joined #archiveteam-bs [20:49] All set for Nintendo power at the moment thank youuuuuuuuuuuuuuuuuuuuu [20:53] ok [20:56] Let's hope it doesn't go dark again [22:04] *** dashcloud has quit IRC (Read error: Connection reset by peer) [22:07] *** dashcloud has joined #archiveteam-bs [22:20] Frogging: We could set something up that automatically archives pastebin (and possibly some other sites?) links that are posted in IRC. [22:24] *** Honno has quit IRC (Read error: Operation timed out) [22:26] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [22:27] *** BartoCH has joined #archiveteam-bs [22:51] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [22:51] *** DoomTay has quit IRC (Ping timeout: 268 seconds) [23:08] *** BartoCH has joined #archiveteam-bs [23:11] *** DoomTay has joined #archiveteam-bs [23:20] If I have a reprint of a book that was originally published in 1906, is it safe to scan it and put it on archive.org? [23:21] yes [23:21] I'd OCR it so we can have it in pure text [23:22] DoomTay: Like, in addition to the the scans, or just OCR? [23:22] Either might work [23:22] Not sure which archive.org would accept [23:23] Yeah, both [23:24] What's the best OCR software? Are there any good free ones? [23:25] That I can't help you with [23:30] if you've got a good, clean scan, Internet Archive will OCR it for you as part of the process [23:31] they'll ocr anyway, but if it's a garbage scan you won't get much out of it [23:35] How good is their OCR? [23:38] they're kind of in the business of digitizing books so I think it'd be good [23:38] quite [23:58] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [23:58] *** BartoCH has joined #archiveteam-bs