Time |
Nickname |
Message |
01:36
🔗
|
omf_ |
attrition.org is done at 791mb warc.gz SketchCow |
01:46
🔗
|
omf_ |
InitHello, you around? |
01:46
🔗
|
InitHello |
no |
01:46
🔗
|
InitHello |
I mean yes |
01:47
🔗
|
omf_ |
I got the depositfiles files downloading |
01:47
🔗
|
InitHello |
excellent. I have a 20MB warc lying around |
01:47
🔗
|
omf_ |
yeah I got plowshare on a loop pulling the files in |
01:48
🔗
|
omf_ |
It takes a while as it sits through the times and all for you |
01:49
🔗
|
omf_ |
sits through the timeout phases that these dumb file hosting places have |
01:50
🔗
|
InitHello |
almost tempting to get a gold membership ... until one realizes that's exactly why they have them, |
01:50
🔗
|
omf_ |
266 mod files |
01:57
🔗
|
omf_ |
5 minute wait between files now, lame-o |
03:22
🔗
|
omf_ |
Is there a status page that lists the current timeout status for uploading files? Is there a best time of day for uploads or just set it and forget it? |
03:36
🔗
|
omf_ |
attrition is uploaded http://archive.org/details/attrition.org |
04:03
🔗
|
balrog |
omf_: is it forcing captcha? |
04:03
🔗
|
omf_ |
the depositfiles are, I captcha each download |
04:04
🔗
|
omf_ |
between that and the wait times this is going to take forever. |
04:04
🔗
|
omf_ |
If anyone has a captcha solver program |
04:05
🔗
|
balrog |
http://www.youtube.com/watch?v=ROrpKx3aIjA&feature=youtu.be -> IA |
04:06
🔗
|
balrog |
omf_: for recaptcha? good luck |
04:09
🔗
|
omf_ |
I am wondering if entering 260 captchas is worth the $11 for a one month membership and the ability to download all these files in 5 minutes |
07:15
🔗
|
[1]in8mal |
geetz |
08:04
🔗
|
SketchCow |
omf_: Thanks |
08:07
🔗
|
SketchCow |
Time to kill more spam accounts |
08:09
🔗
|
SketchCow |
Killer in the niiiiiiight |
08:15
🔗
|
kanzure_ |
omf_: you could always just wire up deathbycaptcha or something |
08:21
🔗
|
SketchCow |
Going in for the kill - how much can I kill in one hour? |
08:48
🔗
|
SketchCow |
http://www.archiveteam.org/index.php?title=Special%3AListUsers&username=X&group=&limit=50 |
08:48
🔗
|
SketchCow |
cleeeean |
08:49
🔗
|
SketchCow |
3 letters out of 26 |
08:49
🔗
|
SketchCow |
(and chinese characters) |
14:13
🔗
|
Mister_Ar |
Hello! |
15:16
🔗
|
SketchCow |
ha ha, the beast is slowing down - 5 hours later, only six more spam pages added. |
15:16
🔗
|
SketchCow |
Take that! |
15:29
🔗
|
Smiley |
;) |
15:37
🔗
|
MrArgent |
Hopefully the source of the spam'll get knocked out or just give up. If i might ask, how long has this been a problem? i'm still kinda new here as far as actual wiki involvement goes. |
15:38
🔗
|
SketchCow |
Well, the thing is, I added a rather significant hurdle. |
15:38
🔗
|
SketchCow |
Guy has to come on here. |
15:38
🔗
|
SketchCow |
If the guy comes on here, does it, we're done |
15:38
🔗
|
SketchCow |
I just change the word |
15:38
🔗
|
SketchCow |
In theory, person has to keep coming back |
15:39
🔗
|
balrog |
are you sure it's one source and not bots? |
15:40
🔗
|
balrog |
also why not add a captcha in addition to the secret word? |
15:40
🔗
|
balrog |
captchas suck though |
15:42
🔗
|
SketchCow |
You have it |
15:42
🔗
|
SketchCow |
And they don't work |
15:42
🔗
|
SketchCow |
Here's the thing |
15:42
🔗
|
SketchCow |
You were asked 5 questions |
15:42
🔗
|
SketchCow |
One of five. |
15:42
🔗
|
SketchCow |
So over time, they got all the questions. |
15:42
🔗
|
SketchCow |
And I think someone does this, goes around, feeding this stuff |
15:42
🔗
|
SketchCow |
Then the bots kicked in |
15:43
🔗
|
Smiley |
captchas are broken - you can be paid bitcoins to just put in captcha answers all day. |
15:44
🔗
|
MrArgent |
PROGRESS ON RE-GRABBING THE TEXTFILES DUMP: 1%, NO SEEDS. |
15:44
🔗
|
MrArgent |
159,367k/11gb |
15:44
🔗
|
MrArgent |
*11.7gb |
15:49
🔗
|
DFJustin |
are you using http://archive.org/download/textfiles-dot-com-2011/textfiles-dot-com-2011_archive.torrent |
15:51
🔗
|
MrArgent |
yeah |
15:52
🔗
|
MrArgent |
*sorry for the delayed response, i was taking dishes from my lunch (half a patty melt left over from last night) down. |
15:52
🔗
|
DFJustin |
note every time there's a change to the item the previous torrent is invalidated, so you might try re-grabbing the torrent file |
15:52
🔗
|
SketchCow |
Why not grab the archive.org copy? |
15:52
🔗
|
SketchCow |
It's the same one? |
15:52
🔗
|
MrArgent |
i am using the textfiles one. |
15:52
🔗
|
SketchCow |
Sorry, misread here. |
15:52
🔗
|
MrArgent |
ah, np |
15:52
🔗
|
SketchCow |
I guess I should ask what you're trying to do |
15:52
🔗
|
DFJustin |
also there seem to be bogus files on the item like textfiles-dot-com-2011_files.torrent and textfiles-dot-com-2011_meta.torrent |
15:53
🔗
|
MrArgent |
yeah, i'm using _archivew |
15:53
🔗
|
MrArgent |
*_archive |
15:54
🔗
|
MrArgent |
already have a copy of the extracted files on my external, but i'm trying to populate a separate drive specifically for archive stuff |
15:55
🔗
|
MrArgent |
(also, the copy on my external is compromised -- my AV went a little ballistic when it found all the source code to MS-DOS era viruses/etc. in it and i don't have the original .7zs anymore) |
22:34
🔗
|
WiK |
woo gitdigger project has hit over 600k repos cloned today |