#archiveteam 2014-10-23,Thu

↑back Search

Time Nickname Message
00:13 🔗 jscottsux Oh yeah, and fuck archive.org. Only losers add to that service. That service should be under my command. I'd love for your leader, Jason Scott, to get tortured in many pornographic ways. Or "rape torture". I'd take the front seat to watching. DFJustin, Ivan, and XMC will also have terrible genatalia torture.
01:39 🔗 DFJustin hey I finally rate
05:03 🔗 Atluxity well that seemed like a charming guy
05:03 🔗 Atluxity I must surely have some good points
05:03 🔗 Atluxity *he
05:03 🔗 Atluxity damnit....
07:10 🔗 Smiley SketchCow: point me where metadata is needdd...
09:20 🔗 signius I am getting "bash: ./get-wget-lua.sh: Permission denied" when i try an clone the halo-grab
09:23 🔗 antomatic signius: try chmod +x get-wget-lua.sh
09:24 🔗 antomatic then try again
09:24 🔗 antomatic (happens sometimes, it's not just you)
09:25 🔗 signius antomatic, lol cheers, i was just changing the permissions manually as you said that :)
09:27 🔗 signius sigh ok no items available for qwiki or halo
11:52 🔗 dashcloud Smiley: you'll want to email /msg SketchCow with your email address so he can give you access to the arcade collection
12:20 🔗 schbirid i'll write something to archive mapillary.com in winter, their "2048px" images would be around 2TB from a quick glance.
12:36 🔗 midas just 2TB?
13:22 🔗 schbirid midas: ~5 million photos of ~500 kilobytes each
14:15 🔗 midas damn
14:58 🔗 staavmixe Did I miss why archiving TwitPic has stopped? They will shut down the 25th right?
14:59 🔗 midas twitpic is completed if im not mistaken
15:00 🔗 staavmixe Oh cool
15:00 🔗 staavmixe Expected more than 10 million pictures to be honest
15:01 🔗 balrog aren't they actively blocking us?
15:06 🔗 Compresse I've been running my crawlers on high concurrency and haven't seen any blocking, balrog
15:06 🔗 Compresse Only some 503 errors
15:06 🔗 balrog Compresse: they were earlier
15:06 🔗 Compresse Yes, I did join late.
15:07 🔗 staavmixe But if I look at they leaderboard now, nothing is happening
15:07 🔗 staavmixe Although it says 0 to go so I guess @balrog is right
15:07 🔗 Rotab interesting, apparently i got banned from twitpic.com again.. was unbanned for a while :D
15:55 🔗 SketchCow Twitpic blocked us
15:55 🔗 SketchCow We got a set of it.
15:56 🔗 SketchCow Hundreds of millions will disappear, hundreds of millions gotten.
15:56 🔗 SketchCow I am speaking at an archiving conference on Sunday.
15:56 🔗 SketchCow 90% of my talk will be twitpic
16:14 🔗 schbirid i hope there will it be variations of the words ('twit','pic','shit','twat','pig','tit','quit')
16:20 🔗 Nemo_bis Do more IPs help?
16:24 🔗 midas SketchCow: there should be a 'jason gets mad at people on stage' page on the wiki
16:25 🔗 godane SketchCow: i hope there is a video recording of that talk
17:42 🔗 SketchCow So, today's day is me basically pushing in descriptions for 1,130 collections on Internet Archive
17:49 🔗 godane uploaded: https://archive.org/details/aol-file-protocol-4400-3501-to-3600
18:05 🔗 DFJustin a lot of text files in this batch (and in previous batches)
18:08 🔗 DFJustin would be nice to see a textfiles.com update someday
19:46 🔗 joepie91 [21:44] <SketchCow> [19:56:17] 90% of my talk will be twitpic
19:46 🔗 joepie91 this means "90% of my talk will be verbally violating Twitpic and anybody associated with the decision to block us", yes?
19:47 🔗 joepie91 also, midas: that's the "public speaking" page, I think :)
20:02 🔗 signius project code is out of date & needs to be upgraded
20:02 🔗 signius is there a simple way to update the code other than a git clone & recompiling ?
20:02 🔗 aaaaaaaaa git pull
20:03 🔗 DFJustin any chance of badmouthing imageshack as well
20:05 🔗 bzc6p_ I see there was some lack of information about twitpic. Let me summarize what I know.
20:06 🔗 bzc6p_ Some days ago twitpic hid the pictures, so now we're getting only the HTML with some of the comments.
20:06 🔗 bzc6p_ But that's far not done. We've scraped about ~40% of them, there are ~12 million items left (12m*36 pages), they are loaded in ~1.5 million batches, because of limited memory of the tracker. (We won't finish till 25th October.)
20:06 🔗 bzc6p_ Actually it was a memory issue and that the admin was away, why it was stopped for several hours today.
20:06 🔗 bzc6p_ Regarding the actual pictures, ~500 million was downloaded by Kenshin some weeks ago. There are about ~300 millions more, which we have no access to.
20:06 🔗 bzc6p_ IRC channel #quitpic.
20:06 🔗 bzc6p_ End of report
20:07 🔗 joepie91 bzc6p_: you missed the part where Twitpic are dicks
20:07 🔗 joepie91 :)
20:07 🔗 bzc6p_ I thought everyone knows that :)
20:07 🔗 antomatic I seriously propose we change the wiki password to 'noaheverettisanobshiner'
20:08 🔗 antomatic or possibly knobshiner, for accuracy
20:08 🔗 yipdw yahoosucks is more punchy
20:08 🔗 joepie91 everettsucks
20:08 🔗 signius ok i just ran "su -c "cd /home/archiveteam; git clone https://github.com/ArchiveTeam/twitpic-grab.git; cd twitpic-grab; ./get-wget-lua.sh" archiveteam" & its still telling me the project code is out of date when i run it
20:09 🔗 ersi sunshineanddollars
20:09 🔗 joepie91 that sounds like a misconfigured tracker?
20:10 🔗 Kazzy signius: paste output of 'pwd' here
20:10 🔗 bzc6p_ So, for newcomers: export sucks; Noah Everett is radio silent, except when misleading with shutdown or won't-shut-down notices; several people offered him to pay the bandwidth cost to backup everything but Noah disappeared
20:11 🔗 antomatic Of course we'll all have to change our tune when Noah's picture is on the front page of CNN, handing over the hard discs to the library of congress or something.
20:12 🔗 joepie91 antomatic: I am okay with eating my words if that is the tradeoff
20:12 🔗 Kazzy signius: you here?
20:12 🔗 antomatic joepie91: I think we can be confident that this will never happen, sadly
20:13 🔗 signius Kazzy, yeah sorry just got multiple screen im switching between
20:13 🔗 joepie91 :(
20:13 🔗 bzc6p_ Well, I wouldn't be surprised on anything after this all... but paranoia is one of our basic traits and this time we have a reson for that
20:13 🔗 Kazzy paste output of 'pwd' here
20:14 🔗 signius /home/archiveteam
20:14 🔗 signius pwd
20:14 🔗 Kazzy cd twitpic-grab
20:14 🔗 Kazzy run-pipeline etc
20:16 🔗 SketchCow hahaha, blocked by Ello
20:19 🔗 SketchCow Ha ha, he just declared war on me.
20:19 🔗 joepie91 SketchCow: ?
20:19 🔗 signius Kazzy, http://fpaste.org/
20:19 🔗 SketchCow Ello guy.
20:19 🔗 antomatic Why would they block you after you were so nice to them? ):
20:19 🔗 SketchCow I think we consider archiving them.
20:19 🔗 antomatic er, :)
20:20 🔗 SketchCow Started a project channel: #oodbye
20:20 🔗 joepie91 SketchCow: something publicly readable?
20:20 🔗 joepie91 I have a spare bucket of popcorn on my desk
20:20 🔗 SketchCow His twitter is #cacheflowe
20:21 🔗 SketchCow But he's blocked me. I can't retweet him anymore, or anything.
20:21 🔗 chronomex awww
20:22 🔗 antomatic cacheflowe: @textfiles If you spoke to me like a respectful adult, instead of starting the conversation calling us a "Roach Motel" it would be different
20:23 🔗 SketchCow Tattoo on chest
20:23 🔗 ersi Haha!
20:24 🔗 SketchCow It's really late in the day to be shocked by my tactics in here.
20:24 🔗 SketchCow (For the record)
20:25 🔗 SketchCow https://twitter.com/textfiles/status/525377284958855170 is my tweet of the day
20:25 🔗 antomatic Devil's advocate - and purely out of interest (I'm not taking either position) - is he right?
20:26 🔗 chronomex is he right about what?
20:27 🔗 bzc6p SketchCow: from channel #quitpic
20:27 🔗 bzc6p <chfoo> i wonder who's behind https://twitter.com/TwitPicSupport/with_replies
20:27 🔗 bzc6p <chfoo> if someone could figure that out maybe there would be way negotiate
20:27 🔗 SketchCow I saw
20:27 🔗 bzc6p Ok, sorry then. Just wanted to make sure.
20:27 🔗 antomatic Is he right in being bruised that the conversation apparently started with disrespect which made him discinclined to co-operate with something which he might otherwise have been receptive to?
20:28 🔗 SketchCow I am sometimes distracted.
20:28 🔗 antomatic Devil's advocate.
20:28 🔗 * antomatic dons fireproof suit.
20:28 🔗 SketchCow You know what I hate?
20:28 🔗 SketchCow Advocatingfor the Devil
20:28 🔗 SketchCow He's the fuckin' Devil for a reason
20:28 🔗 ersi antomatic: L o L
20:28 🔗 antomatic Did he run over your dog or something?
20:28 🔗 chronomex worse
20:28 🔗 SketchCow Oh yeah, I always forget this about you.
20:28 🔗 chronomex my cat
20:28 🔗 ersi If someone gets a bit butthurt over being called on destroying data, so be it
20:29 🔗 SketchCow Here is what goes on.
20:29 🔗 SketchCow I always confuse you with arkiver.
20:29 🔗 ersi As long as someone gets some saving going on, fuck it all
20:29 🔗 SketchCow Similar names, sleep patters.
20:29 🔗 SketchCow I like arkiver.
20:29 🔗 antomatic I think I'm offended. :)
20:29 🔗 chronomex now that you mention it, i also confuse antomatic and arkiver all the time
20:29 🔗 midas im not sure if he really blocked you, twitter is fucking broken
20:30 🔗 signius ok delete the /home/archiveteam/twitpic-grab directory & re cloned it and its working now so fuck knows
20:30 🔗 SketchCow So I always forgive arkiver that, for some reason, he occasionally looks left and right as Archiveteam does another bombing raid and goes "uh, guys? Aren't we... aren't we being a little rough here?"
20:30 🔗 chronomex yeah twitter is down for me
20:30 🔗 antomatic I have been here at least twice as long as arkiver! :)
20:30 🔗 SketchCow Arkiver is better to deal with.
20:30 🔗 schbirid #######bbbbbbbssssssssss
20:30 🔗 SketchCow QUICK GET THE DEFIBRILLATOR FOR SCHBIRID
20:30 🔗 antomatic I'm genuinely surprised to even be on anyone's radar, though. And as I say, no judgement, I was just interested. No biggie.
20:30 🔗 SketchCow Also he is right
20:32 🔗 bzc6p signius: "git clone" must be run from outside, "git pull" from inside the directory
20:33 🔗 signius bzc6p, yeah i was inside when trying to do a git pull but it was bitching about the branch
20:33 🔗 signius i got it sorted now though
20:33 🔗 bzc6p However, for me those commands are a bit complicated. I know those are in the guide, but I don't create a dedicated user
20:36 🔗 SketchCow The commands are a bit complicated because that's why we made the vm.
20:39 🔗 signius ok "git pull && ./get-wget-lau.sh" worked fine on the other box so something got its panties in a bunch on the first box its all running fine now, but thanks for the help
20:41 🔗 midas ello got $5.5mil, man we could archive the internet with that.
20:45 🔗 SketchCow I love his "we haven't had time"
20:46 🔗 SketchCow No, I'm just watching him shove his own foot in his mouth and every time someone says something he points at it and says "mmmmggjgjggmmm foootph"
20:46 🔗 midas ill wait for the heartattack when he finds the bandwidth bill
20:47 🔗 midas did noaheverett ever responded to your tweets?
20:47 🔗 SketchCow http://i.imgur.com/CrsoJbL.gifv
20:47 🔗 SketchCow FOS currently
20:48 🔗 midas mmm internets
21:20 🔗 joepie91 SketchCow: you are linking teh evils
21:20 🔗 joepie91 :)
21:22 🔗 joepie91 [22:29] <chronomex> now that you mention it, i also confuse antomatic and arkiver all the time
21:22 🔗 joepie91 wat
21:22 🔗 * joepie91 is not sure what the similarities, and only confuses arkiver with arkhive for tabcomplete reasons
21:24 🔗 antomatic I wonder if I should change my name
21:24 🔗 joepie91 arkomatic?
21:24 🔗 Kazzy not_arkiver
21:24 🔗 antomatic something with a Z, perhaps. Or an X. Xes are cool.
21:24 🔗 ersi change it to arkivermatic
21:24 🔗 antomatic :)
21:24 🔗 joepie91 I think arkomatic would suffice
21:25 🔗 joepie91 Kazzy: haha, implying efnet would allow such a horrendously long nickname
21:25 🔗 joepie91 :)
21:25 🔗 Kazzy i don't even understand why there's a limit of 9 chars, damn
21:26 🔗 ersi because EFNet is Oldnet
21:26 🔗 aaaaaaaaa ~arkiver
21:27 🔗 ersi aaaarkiver
21:33 🔗 chronomex woop woop woop off-topic siren
21:59 🔗 TFGBD_ So Archive.org will take shovelware/shareware CDs?
22:00 🔗 chronomex yes
22:01 🔗 TFGBD_ Even if they are still copyrighted
22:01 🔗 TFGBD_ Though, I guess some of these compilers may be out of business
22:01 🔗 chronomex it'll be fine
22:01 🔗 TFGBD_ I noticed there was a full copy of Corel DRAW 1 in one of your archives
22:01 🔗 TFGBD_ How does Archive.org see "abandonware" like that
22:02 🔗 chronomex it'll be fine
22:02 🔗 TFGBD_ I mean it's technically warez, isn't it?
22:02 🔗 schbirid TFGBD_: the right holders can file DMCA complaints
22:02 🔗 TFGBD_ That sucks
22:02 🔗 TFGBD_ Just like YouTube
22:02 🔗 TFGBD_ I guess it's best to keep that stuff on the DL
22:02 🔗 schbirid i think archive.org will safe-keep the stuff for when the copyright expires
22:03 🔗 TFGBD_ That's good too
22:03 🔗 chronomex yes. ia does not delete things. i would like to close this topic.
22:03 🔗 TFGBD_ lets hope they survive that long
22:03 🔗 chronomex we have an faq, isn't this covered?
22:03 🔗 chronomex http://archiveteam.org/index.php?title=FAQ
22:04 🔗 joepie91 TFGBD_: just upload it, don't go shouting "GET YOUR WAREZ HERE!!1!" off the rooftops, and all should be fine - at worst it'll be made publicly unavailable and a darked copy continues to exist
22:12 🔗 SketchCow DID I MISS ANYTHING
22:15 🔗 antomatic free warez. all very hush-hush.
22:17 🔗 SketchCow Taking Corel Draw 1 from my cold, dead, ink-stained fingers
22:20 🔗 antomatic like zero, zero, zero-zero-zero-day software.
22:20 🔗 antomatic possibly with a 1 on the front.
22:29 🔗 marc lol
22:29 🔗 marc jason check privmsg
22:30 🔗 marc come out 2nite
23:20 🔗 SketchCow http://archive.org/editxml/geoworldmagazine
23:22 🔗 SketchCow http://www.theglobeandmail.com/technology/digital-culture/the-race-to-archive-twitpic-before-800-million-pictures-vanish/article21199755/
23:26 🔗 TFGBD_ Archive.org barely seems to screen the stuff.
23:27 🔗 TFGBD_ Like, noticed a bunch of fake submissions of "Windows Media Player 10" that were likely just virus laden links to some website
23:28 🔗 chronomex IA does not inspect all uploads
23:33 🔗 TFGBD_ they must have tons of bandwidth
23:34 🔗 DFJustin they have several 10gbit links

irclogger-viewer