[01:36] attrition.org is done at 791mb warc.gz SketchCow [01:46] InitHello, you around? [01:46] no [01:46] I mean yes [01:47] I got the depositfiles files downloading [01:47] excellent. I have a 20MB warc lying around [01:47] yeah I got plowshare on a loop pulling the files in [01:48] It takes a while as it sits through the times and all for you [01:49] sits through the timeout phases that these dumb file hosting places have [01:50] almost tempting to get a gold membership ... until one realizes that's exactly why they have them, [01:50] 266 mod files [01:57] 5 minute wait between files now, lame-o [03:22] Is there a status page that lists the current timeout status for uploading files? Is there a best time of day for uploads or just set it and forget it? [03:36] attrition is uploaded http://archive.org/details/attrition.org [04:03] omf_: is it forcing captcha? [04:03] the depositfiles are, I captcha each download [04:04] between that and the wait times this is going to take forever. [04:04] If anyone has a captcha solver program [04:05] http://www.youtube.com/watch?v=ROrpKx3aIjA&feature=youtu.be -> IA [04:06] omf_: for recaptcha? good luck [04:09] I am wondering if entering 260 captchas is worth the $11 for a one month membership and the ability to download all these files in 5 minutes [07:15] <[1]in8mal> geetz [08:04] omf_: Thanks [08:07] Time to kill more spam accounts [08:09] Killer in the niiiiiiight [08:15] omf_: you could always just wire up deathbycaptcha or something [08:21] Going in for the kill - how much can I kill in one hour? [08:48] http://www.archiveteam.org/index.php?title=Special%3AListUsers&username=X&group=&limit=50 [08:48] cleeeean [08:49] 3 letters out of 26 [08:49] (and chinese characters) [14:13] Hello! [15:16] ha ha, the beast is slowing down - 5 hours later, only six more spam pages added. [15:16] Take that! [15:29] ;) [15:37] Hopefully the source of the spam'll get knocked out or just give up. If i might ask, how long has this been a problem? i'm still kinda new here as far as actual wiki involvement goes. [15:38] Well, the thing is, I added a rather significant hurdle. [15:38] Guy has to come on here. [15:38] If the guy comes on here, does it, we're done [15:38] I just change the word [15:38] In theory, person has to keep coming back [15:39] are you sure it's one source and not bots? [15:40] also why not add a captcha in addition to the secret word? [15:40] captchas suck though [15:42] You have it [15:42] And they don't work [15:42] Here's the thing [15:42] You were asked 5 questions [15:42] One of five. [15:42] So over time, they got all the questions. [15:42] And I think someone does this, goes around, feeding this stuff [15:42] Then the bots kicked in [15:43] captchas are broken - you can be paid bitcoins to just put in captcha answers all day. [15:44] PROGRESS ON RE-GRABBING THE TEXTFILES DUMP: 1%, NO SEEDS. [15:44] 159,367k/11gb [15:44] *11.7gb [15:49] are you using http://archive.org/download/textfiles-dot-com-2011/textfiles-dot-com-2011_archive.torrent [15:51] yeah [15:52] *sorry for the delayed response, i was taking dishes from my lunch (half a patty melt left over from last night) down. [15:52] note every time there's a change to the item the previous torrent is invalidated, so you might try re-grabbing the torrent file [15:52] Why not grab the archive.org copy? [15:52] It's the same one? [15:52] i am using the textfiles one. [15:52] Sorry, misread here. [15:52] ah, np [15:52] I guess I should ask what you're trying to do [15:52] also there seem to be bogus files on the item like textfiles-dot-com-2011_files.torrent and textfiles-dot-com-2011_meta.torrent [15:53] yeah, i'm using _archivew [15:53] *_archive [15:54] already have a copy of the extracted files on my external, but i'm trying to populate a separate drive specifically for archive stuff [15:55] (also, the copy on my external is compromised -- my AV went a little ballistic when it found all the source code to MS-DOS era viruses/etc. in it and i don't have the original .7zs anymore) [22:34] woo gitdigger project has hit over 600k repos cloned today