#archiveteam 2013-04-18,Thu

↑back Search

Time Nickname Message
01:36 🔗 omf_ attrition.org is done at 791mb warc.gz SketchCow
01:46 🔗 omf_ InitHello, you around?
01:46 🔗 InitHello no
01:46 🔗 InitHello I mean yes
01:47 🔗 omf_ I got the depositfiles files downloading
01:47 🔗 InitHello excellent. I have a 20MB warc lying around
01:47 🔗 omf_ yeah I got plowshare on a loop pulling the files in
01:48 🔗 omf_ It takes a while as it sits through the times and all for you
01:49 🔗 omf_ sits through the timeout phases that these dumb file hosting places have
01:50 🔗 InitHello almost tempting to get a gold membership ... until one realizes that's exactly why they have them,
01:50 🔗 omf_ 266 mod files
01:57 🔗 omf_ 5 minute wait between files now, lame-o
03:22 🔗 omf_ Is there a status page that lists the current timeout status for uploading files? Is there a best time of day for uploads or just set it and forget it?
03:36 🔗 omf_ attrition is uploaded http://archive.org/details/attrition.org
04:03 🔗 balrog omf_: is it forcing captcha?
04:03 🔗 omf_ the depositfiles are, I captcha each download
04:04 🔗 omf_ between that and the wait times this is going to take forever.
04:04 🔗 omf_ If anyone has a captcha solver program
04:05 🔗 balrog http://www.youtube.com/watch?v=ROrpKx3aIjA&feature=youtu.be -> IA
04:06 🔗 balrog omf_: for recaptcha? good luck
04:09 🔗 omf_ I am wondering if entering 260 captchas is worth the $11 for a one month membership and the ability to download all these files in 5 minutes
07:15 🔗 [1]in8mal geetz
08:04 🔗 SketchCow omf_: Thanks
08:07 🔗 SketchCow Time to kill more spam accounts
08:09 🔗 SketchCow Killer in the niiiiiiight
08:15 🔗 kanzure_ omf_: you could always just wire up deathbycaptcha or something
08:21 🔗 SketchCow Going in for the kill - how much can I kill in one hour?
08:48 🔗 SketchCow http://www.archiveteam.org/index.php?title=Special%3AListUsers&username=X&group=&limit=50
08:48 🔗 SketchCow cleeeean
08:49 🔗 SketchCow 3 letters out of 26
08:49 🔗 SketchCow (and chinese characters)
14:13 🔗 Mister_Ar Hello!
15:16 🔗 SketchCow ha ha, the beast is slowing down - 5 hours later, only six more spam pages added.
15:16 🔗 SketchCow Take that!
15:29 🔗 Smiley ;)
15:37 🔗 MrArgent Hopefully the source of the spam'll get knocked out or just give up. If i might ask, how long has this been a problem? i'm still kinda new here as far as actual wiki involvement goes.
15:38 🔗 SketchCow Well, the thing is, I added a rather significant hurdle.
15:38 🔗 SketchCow Guy has to come on here.
15:38 🔗 SketchCow If the guy comes on here, does it, we're done
15:38 🔗 SketchCow I just change the word
15:38 🔗 SketchCow In theory, person has to keep coming back
15:39 🔗 balrog are you sure it's one source and not bots?
15:40 🔗 balrog also why not add a captcha in addition to the secret word?
15:40 🔗 balrog captchas suck though
15:42 🔗 SketchCow You have it
15:42 🔗 SketchCow And they don't work
15:42 🔗 SketchCow Here's the thing
15:42 🔗 SketchCow You were asked 5 questions
15:42 🔗 SketchCow One of five.
15:42 🔗 SketchCow So over time, they got all the questions.
15:42 🔗 SketchCow And I think someone does this, goes around, feeding this stuff
15:42 🔗 SketchCow Then the bots kicked in
15:43 🔗 Smiley captchas are broken - you can be paid bitcoins to just put in captcha answers all day.
15:44 🔗 MrArgent PROGRESS ON RE-GRABBING THE TEXTFILES DUMP: 1%, NO SEEDS.
15:44 🔗 MrArgent 159,367k/11gb
15:44 🔗 MrArgent *11.7gb
15:49 🔗 DFJustin are you using http://archive.org/download/textfiles-dot-com-2011/textfiles-dot-com-2011_archive.torrent
15:51 🔗 MrArgent yeah
15:52 🔗 MrArgent *sorry for the delayed response, i was taking dishes from my lunch (half a patty melt left over from last night) down.
15:52 🔗 DFJustin note every time there's a change to the item the previous torrent is invalidated, so you might try re-grabbing the torrent file
15:52 🔗 SketchCow Why not grab the archive.org copy?
15:52 🔗 SketchCow It's the same one?
15:52 🔗 MrArgent i am using the textfiles one.
15:52 🔗 SketchCow Sorry, misread here.
15:52 🔗 MrArgent ah, np
15:52 🔗 SketchCow I guess I should ask what you're trying to do
15:52 🔗 DFJustin also there seem to be bogus files on the item like textfiles-dot-com-2011_files.torrent and textfiles-dot-com-2011_meta.torrent
15:53 🔗 MrArgent yeah, i'm using _archivew
15:53 🔗 MrArgent *_archive
15:54 🔗 MrArgent already have a copy of the extracted files on my external, but i'm trying to populate a separate drive specifically for archive stuff
15:55 🔗 MrArgent (also, the copy on my external is compromised -- my AV went a little ballistic when it found all the source code to MS-DOS era viruses/etc. in it and i don't have the original .7zs anymore)
22:34 🔗 WiK woo gitdigger project has hit over 600k repos cloned today

irclogger-viewer