#archiveteam 2012-02-28,Tue

↑back Search

Time Nickname Message
00:01 🔗 SketchCow Hostgator just suspended me. :)
00:01 🔗 SketchCow They want account verification.
00:01 🔗 SketchCow I assume this is because I bought a 3 year business account.
00:01 🔗 kennethre epic
00:08 🔗 topaz jeez, that was fast.
00:31 🔗 SketchCow I'm refunding hostgator and switching to webfaction.
00:33 🔗 kennethre <3
00:34 🔗 kennethre SketchCow: let me know if you have any questions w/ the webfaction setup
00:34 🔗 kennethre SketchCow: the app/domain mapping is a bit unique to them
00:36 🔗 SketchCow I want to run a mediawiki on a domain.
00:36 🔗 SketchCow It's not that hard
00:36 🔗 SketchCow If it IS hard, something's wrong
00:39 🔗 Zuu- He;;p
00:39 🔗 Zuu- Hello*
00:40 🔗 SketchCow Hi
00:40 🔗 Zuu- Is there ssl here? I cant seem to connect to port 6697
00:40 🔗 Zuu- Oh, it's connecting now, sorry
00:41 🔗 SketchCow Thank you for signing up with us.
00:41 🔗 SketchCow I just bought a 5 year account at webfaction.
00:42 🔗 Zuu- Is that the dealbreaker problem you were talking about on twitter?
00:43 🔗 SketchCow The dealbreaker problem was hostgator suspending my account while I was working to move files to it.
00:43 🔗 SketchCow And then demanding ID, and then taking an hour or two.
00:43 🔗 SketchCow Sorry, so fired
00:44 🔗 SketchCow Either demand my ID as a final step, or don't demand it.
00:44 🔗 arrith SketchCow: i've heard good things about rackspace
00:44 🔗 SketchCow Put it up, make it work, THEN jam it down, ESPECIALLY when I just gave you $350 bucks, sorry, you're shit and I'm done.
00:44 🔗 SketchCow If webfaction shits the bed, rackspace is next.
00:44 🔗 Zuu- have you tried namecheap?
00:45 🔗 SketchCow I use them for domains, not hosting.
00:45 🔗 Zuu- Ah
00:46 🔗 kennethre SketchCow: they actually have a one-click install for mediawiki
00:46 🔗 kennethre v1.8.2
00:47 🔗 SketchCow Good.
00:48 🔗 kennethre SketchCow: https://panel.webfaction.com/app/create?script_url=https%3A%2F%2Fwiki.webfaction.com%2Fwiki%2FMediaWiki-1.8.2Install%3Fformat%3Dtxt
01:18 🔗 SketchCow Wow, that's one fucked up interface.
01:18 🔗 SketchCow I wish you had told me it would be something by geeks to provide to nerds to make them all feel like sexless robots with infinite processing space and time
01:19 🔗 SketchCow I want to hear the AWESOME metric this approach fulfills
01:19 🔗 SketchCow I want it so you go to archiveteam.org and it shows the wiki. Looks like this thing tries to shove you into webapps.
01:19 🔗 kennethre SketchCow: i told you to ask :)
01:20 🔗 kennethre SketchCow: create mediawiki app, create archiveteam website, add architeteam domain to said website
01:20 🔗 kennethre and you'd golden
01:20 🔗 kennethre *you're
01:21 🔗 SketchCow I like that.
01:21 🔗 SketchCow 1. Turn lead to gold
01:21 🔗 kennethre haha
01:21 🔗 SketchCow 2. Forge gold into statue
01:21 🔗 SketchCow 3. Tell golem to emulate gold statue
01:21 🔗 SketchCow 4. Instant sandwich!
01:21 🔗 SketchCow I just want a sandwich
01:21 🔗 kennethre it's setup so you can have, say, wordpress running under a certian path on a domain
01:22 🔗 kennethre like mediawiki for /, wordpress for /blog/
01:22 🔗 SketchCow Yeah, it's on crack.
01:22 🔗 SketchCow Anyway, you're no help
01:22 🔗 SketchCow I obviously have to go through some sort of cult transition to think in this retarded geek environment
01:22 🔗 kennethre it takes 3 seconds to setup
01:22 🔗 SketchCow So dropping spoilers that mediawiki is a sled isn't going to help
01:22 🔗 SketchCow Yes, it takes me 3 seconds to find out the processing of a machine too
01:22 🔗 SketchCow Unless I'm dropped into Unix like a foundling
01:23 🔗 SketchCow Allow me some time to go read this crazy moonbat environment up
01:24 🔗 kennethre SketchCow: http://docs.webfaction.com/user-guide/websites.html
01:24 🔗 SketchCow Too late, I'm working over here
01:28 🔗 topaz gotta be fast to keep up with the cow
01:29 🔗 SketchCow Error: Traceback (most recent call last):
01:29 🔗 SketchCow File "/var/tmp/api_system.28582", line 96, in ?
01:29 🔗 SketchCow func(account, app_name, autostart, extra_info, password, server, session_id, username)
01:29 🔗 SketchCow File "/var/tmp/api_system.28582", line 58, in create
01:29 🔗 SketchCow server.create_db(session_id, db_name, 'mysql', password)
01:29 🔗 SketchCow File "/usr/local/lib/python2.4/xmlrpclib.py", line 1096, in __call__
01:29 🔗 SketchCow return self.__send(self.__name, args)
01:29 🔗 SketchCow File "/usr/local/lib/python2.4/xmlrpclib.py", line 1383, in __request
01:29 🔗 SketchCow verbose=self.__verbose
01:29 🔗 SketchCow File "/usr/local/lib/python2.4/xmlrpclib.py", line 1147, in request
01:30 🔗 SketchCow return self._parse_response(h.getfile(), sock)
01:30 🔗 SketchCow File "/usr/local/lib/python2.4/xmlrpclib.py", line 1286, in _parse_response
01:30 🔗 SketchCow return u.close()
01:30 🔗 SketchCow File "/usr/local/lib/python2.4/xmlrpclib.py", line 744, in close
01:30 🔗 SketchCow raise Fault(**self._stack[0])
01:30 🔗 SketchCow xmlrpclib.Fault: <Fault 1: 'name: The name "archiveteam_mediawiki" is too long. It must be 16 characters or less for MySql databases.'>
01:30 🔗 SketchCow Yeah, there's some aweseom
01:30 🔗 SketchCow Well, thanks for dropping me into a true level of hell
01:30 🔗 SketchCow It's like driving a car through a playground of babies and steering with PERL
01:30 🔗 SketchCow I mean, a serious, level of hell
01:30 🔗 topaz 16 characters or less.
01:30 🔗 SketchCow Let's do surgery with algrebra.
01:30 🔗 topaz welcome to 1983.
01:31 🔗 SketchCow Shut up, once we used 17 characters in a filename and a man DIED.
01:31 🔗 SketchCow He DIED.
01:31 🔗 topaz was it CP/M?
01:31 🔗 topaz cause serves him right if it was.
01:32 🔗 SketchCow I heard he was trying to use sed and awk into his perl compost because he was cooking a muffin
01:32 🔗 SketchCow Luckily his C++ preprocessor was fibbling the foorogotz and we didn't blow up a nearby church
01:32 🔗 SketchCow This is all in the script
01:33 🔗 arrith is the mediawiki in the debian/ubuntu repos not an option for some reason?
01:33 🔗 arrith for all i know it's really old or no one uses it or it isn't there
01:33 🔗 SketchCow Your account will be canceled in the next 24 hours (usually much sooner).
01:33 🔗 arrith but apt installing something is easy
01:33 🔗 SketchCow Goodbye, Webfaction
01:33 🔗 SketchCow Onto rackspace
01:33 🔗 SketchCow Good for me to have $700 out in the cloud
01:34 🔗 topaz you are guaranteed to get $700 back, but it may not all be your $700.
01:34 🔗 SketchCow Well, actually
01:34 🔗 SketchCow You know what?
01:34 🔗 SketchCow Fuck it
01:34 🔗 SketchCow One more fucking day of fucking viruses on the fucking mediawiki.
01:35 🔗 SketchCow Because fuck it, I don't need this shit.
01:35 🔗 SketchCow I can handle the giggling bucktooth nerd feedback that we're "hacked" for 24 hours until I cool down
01:36 🔗 SketchCow Rackspace doesn't look like a good fit.
01:38 🔗 topaz SketchCow: sorry for the hassle. I seriously did not mean to provoke you into trying to fix this right away, I was just talking about it this afternoon because I was stymied that I seemed to be the only person afflicted.
01:39 🔗 dnova what exactly happened
01:40 🔗 dnova I run mediawiki too and I'm scared
01:41 🔗 kennethre do we just want a wiki?
01:41 🔗 kennethre if so, github has a great wiki service
01:42 🔗 kennethre powered by git, so we're not relying on them either
01:51 🔗 SketchCow Yeah, no.
01:51 🔗 SketchCow I'm speaking with the host for textfiles.com. He's a fan, he's good, he helps me.
01:51 🔗 SketchCow We'll straighten this shit right out
01:52 🔗 BlueMax What's up SketchCow
02:08 🔗 SketchCow Three sniper bullets short of a high school shooting
02:08 🔗 hybernaut hang in there, brother
02:08 🔗 hybernaut if there's anything I can help with, let me know
02:09 🔗 SketchCow I'm just going to go with the guy, I'm sure.
02:09 🔗 SketchCow I like The Guy
02:10 🔗 SketchCow Also, I am poor these days.
02:10 🔗 SketchCow $300 for 5 years makes sense.
02:10 🔗 SketchCow $240 a year? Not so much.
02:24 🔗 SketchCow Cheered up.
02:25 🔗 SketchCow Going with the textfiles.com host, he's been there for me over the years.
02:25 🔗 SketchCow Also, the Musee' Mechanique just gave me permission to film in there.
02:25 🔗 SketchCow No lights, no wires on floor, one guy, one camera.
02:25 🔗 closure ah, fun place
02:25 🔗 SketchCow Yes!
02:25 🔗 SketchCow I knew they likely got a lot of shit-ass crews.
02:25 🔗 hybernaut that should be fun
02:25 🔗 closure liked it better when it was on the cliffs
02:25 🔗 dashcloud that should be really awesome
02:26 🔗 dashcloud I went there once and it was an awesome place
02:26 🔗 SketchCow > Please cancel my "archiveteam" account effective immediately. I would like to take advantage of your 60 day money back guarantee and request a full refund.
02:26 🔗 SketchCow I confirm that your account has been closed and a full refund has been sent.
02:26 🔗 SketchCow > Your site is for nerds telling geeks how to act like robotos.
02:26 🔗 SketchCow Can you tell us what exactly did you find hard/difficult in our panel?
02:26 🔗 SketchCow <silence of the grave>
02:26 🔗 SketchCow Did you consult our docs at docs.webfaction.com or try to contact support about it?
02:30 🔗 kennethre Note to self: never recommend anything to SketchCow. Ever.
02:32 🔗 SketchCow many have tried / few have not died
02:32 🔗 SketchCow I have this hilarious exchange
02:33 🔗 SketchCow Where Tim Berners Lee is talking to people about whether he has the old webserver code
02:33 🔗 SketchCow And he says not, and mails us at the archive for a copy
02:35 🔗 hybernaut and the archive says...
02:35 🔗 SketchCow Also, it's my fault, I should have known that the guy who has sex with Heroku servers and made it his full-time job would probably look at a web host that lauds the experience of steering a hay truck with bash scripts and think "oh, he's gotta get in on that delicious buffet"
02:36 🔗 SketchCow Like, if you're not into spicy food and you ask the guy whose cube smells like you got punched in the face by Vindaloo where's a good place to get grub
02:36 🔗 SketchCow The archive is at a loss, but thanks to the links, we'll be in better shape
02:40 🔗 * SketchCow kennethre NO GETTING BITTER
02:41 🔗 kennethre My quest in life is now to start a user-content company that gets massively popular and suddenly kill it without warning.
02:41 🔗 SketchCow No, no, it doesn't taste right if you plan ahead
02:42 🔗 SketchCow It's gotta be a surprise
02:42 🔗 kennethre haha
02:42 🔗 SketchCow You have to think it's all for the good of humanity and speak at SXSW and in interviews
02:42 🔗 SketchCow Then when you sit at your desk, head down, while they haul out the beanbags, you look up with seething eyes
02:42 🔗 kennethre changing the world
02:42 🔗 yipdw Soulant: cloud spirituality
02:42 🔗 kennethre I'm laughing so hard right now
02:43 🔗 SketchCow I will crush your dreams as mine have been crushed
02:43 🔗 yipdw also sex with Heroku servers could be awesome
02:43 🔗 yipdw like
02:43 🔗 yipdw need more?
02:43 🔗 yipdw HEROKU SCALE
02:43 🔗 SketchCow yeah
02:43 🔗 kennethre yipdw: it is indeed
02:43 🔗 SketchCow They surround you with their little mouths
02:43 🔗 SketchCow All want some shared you
02:44 🔗 kennethre It goes to 11.
02:44 🔗 yipdw that said, I deployed a CouchDB-based app to heroku + cloudant today
02:44 🔗 yipdw and after trying to hook up the app to the Cloudant DB I saw the "You're Doing It Wrong" talk that specifically mentions why couchrest_model is bad
02:44 🔗 yipdw guess what I'm using
02:44 🔗 yipdw so uh I don't know what the point of that was, except thanks Heroku, I guess
02:45 🔗 kennethre <3
02:51 🔗 undersco2 <yipdw> need more?
02:51 🔗 undersco2 <yipdw> HEROKU SCALE
02:51 🔗 undersco2 hahaha
02:51 🔗 yipdw if you're a pimp, you can even use heroku ps to keep track of all the bitches you've choked
02:52 🔗 SketchCow Here, cheering up emo kennethre
02:52 🔗 SketchCow http://www.archive.org/~jake/free-couch/
02:52 🔗 SketchCow Free Couches on Craigslist.
02:52 🔗 kennethre haha awesome
02:52 🔗 Zuu- :D
02:52 🔗 topaz wtf?
02:52 🔗 hybernaut yes, but when can you deliver the couch? I live on the 4th floor.
02:53 🔗 kennethre not bad: http://www.archive.org/~jake/free-couch/5Id5L75W43K23F53J4c1ob627dceda7c51c7d.jpg
02:53 🔗 topaz crazybeans.
02:58 🔗 SketchCow http://i.qkme.me/36ckpl.jpg
03:05 🔗 SketchCow OK, I'm going to call it a day
03:05 🔗 SketchCow I got some stuff done, but I'm behind
03:05 🔗 SketchCow Tomorrow, tomorrow things get done.
03:05 🔗 SketchCow I'm moving URLTE.AM to my textfiles.com machine, it's a fucking static site and people have been very patient with me about it.
03:09 🔗 SketchCow OK.
03:09 🔗 SketchCow Over the next day or two, urlte.am should go to textfiles.com. When it's looking right, I'll plunk our replacement back.
03:11 🔗 SketchCow DNS servers:
03:11 🔗 SketchCow ns1.easydns.com
03:11 🔗 SketchCow ns2.easydns.com
03:11 🔗 SketchCow remote1.easydns.com
03:11 🔗 SketchCow remote2.easydns.com
03:11 🔗 SketchCow ns3.easydns.org
03:11 🔗 SketchCow ns6.easydns.net
03:11 🔗 SketchCow Well, that was fast.
03:13 🔗 SketchCow OK, it'll be good shortly.
03:13 🔗 SketchCow Good, something off the fucking plate
03:13 🔗 SketchCow Now, to go see SF
03:13 🔗 SketchCow If kennethre only LIVED here, we'd be having DINNER
03:13 🔗 hybernaut enjoy your evening, brother
03:13 🔗 yipdw on a fucking plate
03:13 🔗 kennethre SketchCow: :(
03:14 🔗 kennethre SketchCow: I'll be in san jose next thursday
03:14 🔗 kennethre er, santa clara
03:19 🔗 dashcloud some of you may have seen this already, but it's a really awesome archive thing, so I'd like to share it again: http://infovore.org/archives/2012/02/26/a-year-of-links/
03:20 🔗 hybernaut that is a good one
04:06 🔗 Coderjoe_ "Do you want to type them in by hand?"
04:07 🔗 Coderjoe_ shit... i wish i still had that "The Internet Yellow Pages" book
04:08 🔗 Coderjoe_ perhaps I still do have it in a box somewhere
04:09 🔗 Coderjoe_ metadata: apparently always my enemy
04:09 🔗 Coderjoe_ even such simple things as "what books are in this box?"
04:11 🔗 topaz oh god, the Internet Yellow Pages
04:11 🔗 topaz Michael Wolff, right?
04:14 🔗 Coderjoe_ i don't remember
04:17 🔗 Coderjoe looks like there were several books by that name
04:18 🔗 Coderjoe iirc, the cover looked like this: http://ecx.images-amazon.com/images/I/414R8NBC1XL._SL500_AA300_.gif
04:24 🔗 swebb2 SketchCow: If you're still accepting images for your next talk, Mine is on the wiki: http://archiveteam.org/index.php?title=User:Scumola
04:27 🔗 swebb2 Also, tweets mentioning 'oscar' during last night's broadcast (I've got to do something with all of that data): https://skitch.com/scumola/8fmex/steves-twitter-trend-grapher
04:28 🔗 Coderjoe "oscar 17 crystal 1"?
04:28 🔗 Coderjoe assuming billy crystal
04:29 🔗 Coderjoe but what is the number?
04:29 🔗 swebb2 That just happens to be the values on the graph where my mouse was. Ignore that. :)
04:31 🔗 swebb2 Also, I've been unwinding all urls from url shorteners that have appeared on the twitter spritzer feed since Oct, 2011. I've got about 23M unwound urls now and counting.
04:33 🔗 chronomex fuck yeah
04:33 🔗 chronomex urlteam++
04:34 🔗 swebb2 Yea, I haven't forgotten about urlteam. :)
04:37 🔗 swebb2 I've got the whole twitter thing down to a science now. I have nagios watching the stream, so I get paged when the stream goes down or I've lost more than a couple minutes of data. I archive (bzip2) the data in minute-buckets and then go through each minute bucket and unwind all of the urls recursively and insert into a DB. I keep one week's worth of the full tweets in the DB also for
04:37 🔗 swebb2 twitter-based searches if I need it.
04:37 🔗 Coderjoe I've got a small sqlite database of goo.gl short urls that showed up in a torrent listing site rss feed
04:38 🔗 Coderjoe how large do the minute buckets generally get?
04:38 🔗 swebb2 Coderjoe: That's cool. I was initially scraping the url-shorteners too, but I figured that I wanted the ones that people were actually using.
04:39 🔗 Coderjoe well, mine is because I wanted to use the feed, but some people submit the torrentinfo page link on another site, while pretty much everything else is a link to download the .torrent file
04:39 🔗 swebb2 each minute is only like 5MB or so uncompressed. Tweets are only about 2k each and the spritzer feed is like 2% of the full firehose.
04:40 🔗 swebb2 but it's enough to get a good sample size and most urls are re-tweeted anyway, so I am probably capturing most of the 'important' urls.
04:40 🔗 swebb2 each hour is about 350MB of data.
04:41 🔗 Coderjoe and some of them are behind goo.gl urls. so I wrote a php script that pulls the feed, unshortens goo.gl urls (and saves the results in a database for later lookup, so I don't hammer google each time), and then convert the torrentinfo links to download links
04:41 🔗 Coderjoe tweets are 2k?
04:41 🔗 swebb2 a day is about 6.9GB uncompressed, 700MB compressed
04:41 🔗 swebb2 Yea, thereabouts. It varies, but 2k is a good ballpark figure.
04:41 🔗 Coderjoe for a 140-char message?
04:42 🔗 Coderjoe that's a lot of overhead
04:42 🔗 swebb2 yea, there's a bunch of metadata in there. Lat/Lon, user profile stuff, ...
04:42 🔗 swebb2 17:54:58 +0000 2011","profile_sidebar_fill_color":"ffffff","id":306331380,"utc_offset":-21600,"favourites_count":7,"url":null},"in_reply_to_screen_name":null,"truncated":false,"id":174353887023730688,"created_at":"Tue Feb 28 04:43:00 +0000 2012"}
04:42 🔗 swebb2 i0.twimg.com\/profile_background_images\/414657846\/tumblr_lyqhl7NzZL1r972d6o1_500.png","profile_background_color":"EBEBEB","protected":false,"id_str":"306331380","profile_background_tile":false,"profile_image_url_https":"https:\/\/si0.twimg.com\/profile_images\/1847372264\/sel_normal.png","name":"AA\u2665","default_profile_image":false,"follow_request_sent":null,"created_at":"Fri May 27
04:42 🔗 swebb2 ription":"","is_translator":false,"show_all_inline_media":true,"following":null,"profile_background_image_url":"http:\/\/a2.twimg.com\/profile_background_images\/414657846\/tumblr_lyqhl7NzZL1r972d6o1_500.png","default_profile":false,"profile_link_color":"b6ffb6","time_zone":"Central Time (US & Canada)","verified":false,"geo_enabled":false,"profile_background_image_url_https":"https:\/\/s
04:42 🔗 swebb2 ser":{"notifications":null,"friends_count":235,"profile_sidebar_border_color":"DFDFDF","screen_name":"andy_arceo","contributors_enabled":false,"lang":"es","statuses_count":28438,"profile_use_background_image":true,"location":"","listed_count":0,"profile_text_color":"000000","followers_count":203,"profile_image_url":"http:\/\/a0.twimg.com\/profile_images\/1847372264\/sel_normal.png","desc
04:42 🔗 swebb2 {"contributors":null,"text":"y pulido me aplico la flechita OTRA VEZ","entities":{"urls":[],"user_mentions":[],"hashtags":[]},"in_reply_to_user_id":null,"place":null,"retweeted":false,"coordinates":null,"retweet_count":0,"source":"web","in_reply_to_status_id_str":null,"geo":null,"in_reply_to_status_id":null,"favorited":false,"in_reply_to_user_id_str":null,"id_str":"174353887023730688","u
04:43 🔗 swebb2 That's one tweet.
04:43 🔗 chronomex yum
04:43 🔗 Coderjoe so the full feed is how much?
04:43 🔗 swebb2 the full feed costs money, but is about 70Mbps uncompressed.
04:43 🔗 Coderjoe whee
04:43 🔗 swebb2 the spritzer feed (the one that I get) is free and it's only 1.5Mbps uncompressed.
04:44 🔗 swebb2 I'd love to get my hands on the other social media feeds like foursquare and others.
04:44 🔗 kennethre i bet we could get twitter to send us dumps
04:45 🔗 kennethre i know some people in ops over there
04:45 🔗 swebb2 they charge money for the data - big-time.
04:45 🔗 swebb2 only the sprtizer feed is free.
04:45 🔗 kennethre yeah but we're archivists, it's different
04:45 🔗 swebb2 plus, the full twitter feed of everything would be huge, I'm guessing.
04:45 🔗 kennethre compressed, it shouldn't be SO bad
04:46 🔗 Coderjoe statuses/firehose
04:46 🔗 Coderjoe Returns all public statuses. The Firehose is not a generally available resource. Few applications require this level of access. Creative use of a combination of other resources and various access levels can satisfy nearly every application use case.
04:46 🔗 kennethre heavily compressed
04:46 🔗 kennethre oh nice
04:47 🔗 Coderjoe Returns a random sample of all public statuses. The default access level, ‘Spritzer’ provides a small proportion of the Firehose, very roughly, 1% of all public statuses. The “Gardenhose” access level provides a proportion more suitable for data mining and research applications that desire a larger proportion to be statistically significant sample. Currently Gardenhose returns, very roughly, 10% of all public st
04:47 🔗 Coderjoe statuses/sample
04:47 🔗 Coderjoe atuses. Note that these proportions are subject to unannounced adjustment as traffic volume varies.
04:48 🔗 swebb2 Just the spritzer feed (2% of the full firehose) is (on my system) ~90GB compressed for everything since Oct 2011.
04:49 🔗 Coderjoe i wonder what level the randomization happens
04:50 🔗 Coderjoe like, would different connections get different results?
04:51 🔗 swebb2 I think that each "hose" is just a modulus of the tweet id. So, the spritzer is $id mod 100 = 1 or something like that.
04:51 🔗 swebb2 so everyone getting the spritzer feed would get the same tweets.
05:19 🔗 yipdw damn, 90GB of tweets
05:20 🔗 yipdw I hope someone has wordclouded that shit
05:20 🔗 Zuu- yipdw, from who?
05:21 🔗 yipdw Zuu-: swebb2's copy of the spritzer feed
05:21 🔗 Zuu- ah
05:46 🔗 Thomas-ED hai, is there a channel for mobileme?
05:46 🔗 DFJustin #memac
07:40 🔗 shaqfu Pardon my ignorance, but is archiving Twitter necessary, given the LoC archive?
07:41 🔗 kennethre swebb2: loc?
07:41 🔗 kennethre oh nvm duh
07:41 🔗 shaqfu Library of Congress
07:41 🔗 kennethre yeah
07:44 🔗 shaqfu Is it to provide a public dump vs. request only?
08:21 🔗 * joepie91 is running a few boxes on the mobileme thing
08:21 🔗 joepie91 er
08:21 🔗 * joepie91 is running the mobileme thing on a few boxes *
08:22 🔗 ersi Jump over to #memac, that's where the mobileme party is going
08:22 🔗 joepie91 o
08:23 🔗 ersi ^_^
08:40 🔗 SketchCow Hi,
08:40 🔗 SketchCow Psst.
08:40 🔗 SketchCow I've talked to people.
08:40 🔗 SketchCow Inside people.
08:41 🔗 SketchCow The LoC tweet archive?
08:41 🔗 SketchCow I wouldn't buy a lot of stock in it.
08:41 🔗 SketchCow Wouldn't bet the farm, you know.
08:43 🔗 SketchCow Wouldn't bet the farm, you know.Juuuust sayin'
08:44 🔗 chronomex figures
08:47 🔗 SketchCow Yeah.
08:47 🔗 SketchCow So do whatever we can
08:47 🔗 SketchCow Oh, that new wikimedia project. Daddy likes.
09:02 🔗 SketchCow FUCKING FINALLY.
09:02 🔗 SketchCow URLTE.AM IS BACK
09:02 🔗 radii SketchCow: thank you!
09:02 🔗 radii oh, not me.
09:02 🔗 SketchCow Now, let's not have THAT happen again.
09:02 🔗 SketchCow What are you up to, radii?
09:03 🔗 radii I've been saying "Free the edit stream!" to every wikipedian I come across for years, so when I saw the WM Commons project I figured I should get online
09:03 🔗 SketchCow Ah, OK.
09:03 🔗 ersi Welcome aboard
09:03 🔗 SketchCow URLTE.AM finally back.
09:04 🔗 radii but alas I have a timezone and dayjob challenge, so I'm just going to idle here for the next long while.
09:04 🔗 ersi We're exactly like Wikipedia is in how we run things
09:04 🔗 chronomex what, no, deletionists are not welcome here.
09:05 🔗 ersi but with less drama, less deletes and a lot more rape
09:05 🔗 chronomex no deletes.
09:05 🔗 ersi no is less!
09:05 🔗 * chronomex shakes head
09:08 🔗 * radii deletes my awakeness
09:08 🔗 * ersi injects a coffee machine into radii
09:08 🔗 radii ersi: in 8 hours I will agree with that.
09:11 🔗 ersi http://web.me.com/stressakustik
09:11 🔗 ersi awesome page
09:20 🔗 SketchCow Tomorrow, we see about the wiki.
09:21 🔗 SketchCow Oh, I will love THAT overlap.
09:38 🔗 dcmorton damn.. while working 12 hour days kicks ass on payday, the rest of the time it sucks
09:48 🔗 kennethre dcmorton: I was just reading a blog post about that: http://thecodist.com/article/why_i_don_39_t_do_unpaid_overtime_and_neither_should_you
10:00 🔗 joepie91 worth a read: http://www.stevepavlina.com/blog/2006/07/10-reasons-you-should-never-get-a-job/
10:08 🔗 kennethre http://qaa.ath.cx/LoseThos.html
10:09 🔗 kennethre ^ i thought the post was jumping the shark, but if you take the time to watch the videos and look at his twitter stream, I think the other's quite correct
10:14 🔗 kennethre joepie91: that post is a bit rediculous in parts
10:14 🔗 joepie91 kennethre: it's hyperbole, obviously :P
10:15 🔗 kennethre joepie91: this guy must have had a really soul-crushing gig :)
10:15 🔗 joepie91 nah
10:15 🔗 joepie91 imo hyperbole is necessary there
10:15 🔗 kennethre for the masses I'd agree
10:15 🔗 joepie91 it throws people out of their normal line of thinking and makes them reconsider :)
10:16 🔗 joepie91 yes, exactly
10:16 🔗 joepie91 those that don't need it will recognize it as being hyperbole, usually :)
10:16 🔗 kennethre I used to do hyperbole posts
10:16 🔗 kennethre I feel like I grew out of it though
10:16 🔗 joepie91 also, lookie at http://www.ramhost.us/?page=status-core
10:16 🔗 joepie91 if you didn't read #memac yet
10:16 🔗 joepie91 :P
10:16 🔗 joepie91 the bottom two graphs
10:16 🔗 kennethre geeze
10:16 🔗 kennethre haha
10:16 🔗 * joepie91 enjoys seeing those graphs pealk
10:16 🔗 joepie91 peak*
10:16 🔗 joepie91 I just briefly hit 156mbit total
10:17 🔗 joepie91 before that I was on 130mbit total for a while
10:17 🔗 joepie91 60mbit down and 70mbit up or so
10:17 🔗 joepie91 I do have the idea I'm going to be running out of bandwidth very soon though.. ._.
10:17 🔗 joepie91 or traffic*, rather
10:17 🔗 kennethre maybe they only limit incoming traffic?
10:18 🔗 joepie91 88.211 GB of 600 GB (14.7%)
10:18 🔗 joepie91 oops
10:18 🔗 joepie91 :x
10:18 🔗 joepie91 that was on 3GB a few hours ago
10:18 🔗 kennethre per box or per account?
10:18 🔗 joepie91 that's that one server
10:19 🔗 joepie91 I have 5 other servers, but I can afford for this one to be cut off due to bandwidth exceeding
10:19 🔗 kennethre kill it and start a new one when you're doing :)
10:19 🔗 kennethre *you're done
10:19 🔗 joepie91 :P
10:19 🔗 joepie91 it's just a storage server
10:19 🔗 kennethre oh man
10:19 🔗 joepie91 doesn't really do much
10:19 🔗 kennethre i think i used to use these guys
10:19 🔗 kennethre I did!
10:19 🔗 joepie91 ramhost?
10:19 🔗 joepie91 they're <3
10:19 🔗 kennethre no they were horrible
10:19 🔗 kennethre worst support ever
10:19 🔗 joepie91 you sure you're talking about ramhost?
10:19 🔗 kennethre pretty sure
10:20 🔗 kennethre i don't know of any others that offer both openvz and kvm
10:20 🔗 joepie91 buyvm does
10:20 🔗 kennethre though it was openvz and xen at the time
10:20 🔗 joepie91 and so does some .nz provider
10:20 🔗 kennethre mabye it was someone else
10:20 🔗 joepie91 nah
10:20 🔗 joepie91 ramhost never did xen
10:20 🔗 kennethre hmm
10:20 🔗 * kennethre searches the logs
10:20 🔗 kennethre ah
10:20 🔗 kennethre vpslink.com
10:20 🔗 kennethre so, so terrible
10:21 🔗 joepie91 ramhost always scores really high on support, the one issue some people have with them is that they're rather straightforward and to the point
10:21 🔗 joepie91 :P
10:21 🔗 joepie91 and some people experience that as 'unfriendly'
10:21 🔗 joepie91 mm
10:21 🔗 joepie91 never heard of them
10:21 🔗 joepie91 OH
10:21 🔗 joepie91 this reminds me
10:21 🔗 joepie91 I have an unmetered santrex box laying around
10:21 🔗 kennethre not as bad as liquidweb though
10:21 🔗 kennethre no one's as bad as them
10:21 🔗 joepie91 might as well make it work on this as well
10:21 🔗 joepie91 theoretically that box was terminated in december
10:21 🔗 joepie91 >.>
10:22 🔗 joepie91 according to their administration, that vps doesn't exist anymore
10:22 🔗 joepie91 according to my ssh client, it does
10:23 🔗 joepie91 and I've heard mixed reviews about liquidweb
10:23 🔗 joepie91 and I'm pretty sure that santrex is worse than liquidweb :P
10:23 🔗 joepie91 far worse
10:23 🔗 kennethre doubt it
10:23 🔗 kennethre haha
10:23 🔗 joepie91 well
10:23 🔗 ersi It's amazing how many interpret straight forwardness as unfriendly
10:23 🔗 joepie91 let me make a list :P
10:23 🔗 ersi Or being a douche
10:23 🔗 joepie91 I ordered a santrex vps
10:23 🔗 ersi I'm usually straight forward
10:23 🔗 joepie91 after the billing failed and had to be done manually
10:23 🔗 joepie91 it assigned the wrong amount of RAM
10:23 🔗 joepie91 to my vps
10:23 🔗 kennethre classy
10:23 🔗 joepie91 24 hours after that it went offline for 2 hours
10:23 🔗 joepie91 after which the RAM suddenly fixed itself
10:24 🔗 joepie91 however, while I picked debian, it had installed centos
10:24 🔗 joepie91 they also did not send me an email where to find my solusvm panel
10:24 🔗 joepie91 so I had to google for the address to figure out how to reinstall my box
10:24 🔗 joepie91 as the santrex panel itself did not allow for doing that and only showed basic stats
10:24 🔗 joepie91 that was the first 24 hours after placing my order.
10:25 🔗 joepie91 I had an uptime monitor checking my site
10:25 🔗 joepie91 not a day has gone by where I did *not* get a downtime alert
10:25 🔗 kennethre hahaha
10:25 🔗 joepie91 for months, every day it would go down multiple times
10:25 🔗 kennethre mine was worse
10:25 🔗 kennethre *not worse
10:25 🔗 joepie91 for minutes to hours
10:25 🔗 joepie91 now comes the best part
10:25 🔗 joepie91 they told me they would 'migrate to new hardware and renumber'
10:25 🔗 joepie91 "somewhere next week"
10:25 🔗 joepie91 no date or time was given
10:25 🔗 joepie91 and in the end, it was suddenly offline for two days somewhere at the end of that week
10:26 🔗 joepie91 without an announcement beforehand with a more specific time
10:26 🔗 joepie91 after two days it was back up with a new IP
10:26 🔗 joepie91 what they did *not* tell me, was that, besides upgrading hardware and renumberin, they also moved my server to an *entirely different datacenter*
10:26 🔗 joepie91 which messed up the redundancy of my storage grid to a degree
10:26 🔗 kennethre ah, hosting
10:26 🔗 joepie91 they moved it from ecatel to leaseweb
10:27 🔗 kennethre ah that is kind of what happened to me
10:27 🔗 joepie91 after that I had constant issues with downtimes
10:27 🔗 kennethre i wanted a refund for various reasons
10:27 🔗 joepie91 a few times it went down for a few days
10:27 🔗 kennethre and they decided to give mea dedicated box instead from some other company
10:27 🔗 joepie91 without any further announcement
10:27 🔗 kennethre that is run by the same people
10:27 🔗 kennethre joepie91: forgot to pay the electric bill
10:27 🔗 joepie91 sending a support ticket would just return "look at our status page " ( which, I might add, had no reference to the outage)
10:27 🔗 kennethre joepie91: two more user payments and we can be back online!
10:27 🔗 joepie91 at one point I suspect the server was seized, when it was mysteriously offline for 5 days
10:27 🔗 joepie91 and lol
10:28 🔗 joepie91 and then I missed an invoice, and my server was 'terminated'
10:28 🔗 joepie91 and till the day of today it's still running
10:28 🔗 ersi taken out back and shot?
10:28 🔗 joepie91 2 of my 3 IPs have been renumbered to another vps
10:28 🔗 joepie91 reassigned*
10:28 🔗 joepie91 the third still works
10:28 🔗 ersi lol
10:28 🔗 joepie91 so yeah
10:28 🔗 joepie91 santrex.
10:28 🔗 joepie91 don't touch it with a ten feet pole.
10:28 🔗 joepie91 I'm pretty sure that was worse than liquidweb :P
10:28 🔗 kennethre i have a friend that worked there
10:28 🔗 kennethre and it's apparently run by like the greediest person in the world
10:29 🔗 joepie91 yes, I know a friend of the owner and he said the exact same thing
10:29 🔗 joepie91 or well, 'know' is a big word
10:29 🔗 joepie91 but I occasionally talk to him
10:29 🔗 kennethre hahaha
10:29 🔗 kennethre what'd they say?
10:30 🔗 joepie91 pretty much that it's someone that doesn't really give a shit about whether servers are running
10:30 🔗 joepie91 as long as the invoices are paid
10:30 🔗 joepie91 (by customers)
10:30 🔗 joepie91 since I don't know the owner personally I obviously don't know how true that is
10:30 🔗 joepie91 :P
10:30 🔗 joepie91 but it would not surprise me at all
10:31 🔗 kennethre pretty much exactly what i heard too
10:31 🔗 joepie91 some other fun ones:
10:31 🔗 joepie91 w2servers and veeble
10:32 🔗 joepie91 w2servers was one of lyron foster's projects
10:32 🔗 joepie91 you might have heard of him
10:32 🔗 joepie91 to get an idea: http://www.google.nl/webhp?sourceid=chrome-instant&ix=seb&ie=UTF-8&ion=1#hl=nl&safe=off&output=search&sclient=psy-ab&q=w2servers&pbx=1&oq=&aq=&aqi=&aql=&gs_sm=&gs_upl=&fp=d60c0cceab75dfa0&ix=seb&ion=1&bav=on.2,or.r_gc.r_pw.r_cp.r_qf.,cf.osb&biw=1366&bih=680
10:32 🔗 kennethre oh fun
10:33 🔗 joepie91 I had one of those $24/yr servers there as a throwaway server
10:34 🔗 kennethre man hosting is shady
10:34 🔗 kennethre i hate how that's always the case too
10:34 🔗 joepie91 and then there's veeble, which is, without malicious intent, just ran by someone that doesn't have a clue
10:35 🔗 joepie91 soooo
10:35 🔗 joepie91 let's get that santrex box archiving
10:36 🔗 kennethre joepie91: ah he worked for hostgator
10:36 🔗 kennethre joepie91: that explains everything
10:36 🔗 joepie91 hahaha
10:36 🔗 joepie91 even hostgator isn't /that/ bad, though :P
10:36 🔗 kennethre shared hosting always sucks
10:37 🔗 joepie91 hmm, I have to disagree
10:37 🔗 kennethre the only place i've found that to not be true is webfaction
10:37 🔗 kennethre i'm sure there are others
10:37 🔗 kennethre but they're very rare
10:37 🔗 joepie91 I have shared hosting for my personal site at a dutch hosting company
10:37 🔗 joepie91 webruimtehosting
10:37 🔗 joepie91 sounds incredibly scammy, but they're very good
10:37 🔗 joepie91 it's like 24 euro a year with free domain included, and their server is *fast*
10:38 🔗 joepie91 and usually 6 out of 7 servers hit 100% uptime each year
10:38 🔗 joepie91 the other is usually 99,99% or so
10:38 🔗 joepie91 I've been rather impressed with them :P
10:38 🔗 kennethre you get what you pay for :)
10:38 🔗 joepie91 heh, never in history has such a widespread saying been disproven so thoroughlu
10:38 🔗 joepie91 thoroughly*
10:40 🔗 * joepie91 wonders how much the santrex box will push in terms of bandwidth
10:41 🔗 joepie91 ooo, cinfu box is now hitting 80mbit :D
10:41 🔗 * joepie91 is enjoying looking at the terminal-based blinkenlights
10:47 🔗 joepie91 ... oops, I just took down the wrong network interface on my santrex box ._.
10:47 🔗 joepie91 that was stupid
10:48 🔗 joepie91 that was stupid.
10:48 🔗 joepie91 um, anyone has any idea how to fix this? :3
10:54 🔗 db48x ouch. when Pandora doesn't like your browser, they really lock you out
10:54 🔗 db48x you can't even find out what Pandora _is_
10:56 🔗 joepie91 db48x: their loss? :P
10:58 🔗 db48x yea, I've got other music I can listen to
10:58 🔗 joepie91 for Free music: http://www.jamendo.com/
10:58 🔗 joepie91 for everything else: http://www.vk.com
10:58 🔗 db48x indeed
10:59 🔗 joepie91 preferably jamendo, ofc :)
10:59 🔗 db48x I'm listening to http://www.youtube.com/watch?v=ME5urFBf0kk&feature=youtu.be&hd=1 at the moment
14:27 🔗 FAMAS|2 sketchcow
14:31 🔗 ersi He's here and reading/responsing occationally
14:31 🔗 ersi what's up?
14:32 🔗 FAMAS|2 i just came
14:32 🔗 FAMAS|2 to thank this ragtag team of yours
14:33 🔗 ersi ah, hehe
14:34 🔗 ersi on behalf of what ever little I've contributed I'll say you're very welcome
14:44 🔗 FAMAS|2 ersi
14:44 🔗 FAMAS|2 how do i send any sites i have done myself?
14:48 🔗 ersi Good question, dunno if there's a good answer - besides possibly making a upload to the Internet Archive or making an archive(zip,rar,tar,etc) available somewhere (http/torrent etc)
14:48 🔗 FAMAS|2 where do the archiveteam keep their dumps?"
14:49 🔗 ersi we're a ragtag team like you said
14:49 🔗 FAMAS|2 so no central storage
14:49 🔗 ersi we share stuff between ourselfs or make them public in other ways
14:49 🔗 FAMAS|2 i guess that works, for now
14:50 🔗 FAMAS|2 ersi, try using osiris SPS
14:50 🔗 FAMAS|2 check it out now
14:50 🔗 ersi in some cases, Internet Archive swallows whatever we produce after it has been made nice
14:51 🔗 joepie91 FAMAS|2: also have a look at http://retroshare.sf.net/
14:52 🔗 ersi Or FreeNet, or tahoe-lafs.. or.. or..
14:52 🔗 FAMAS|2 i got that thing yesterday from anonops
14:52 🔗 joepie91 :P
14:52 🔗 joepie91 actually tahoe-lafs is something entirely different
14:52 🔗 FAMAS|2 ersi, check osiris SPS
14:52 🔗 FAMAS|2 it may blow your head off
14:52 🔗 joepie91 tahoe-lafs is storage, not sharing/collaboration/communication
14:52 🔗 Coderjoe git-annex
14:53 🔗 ersi I don't get turned on by that, so I doubt it FAMAS|2
14:53 🔗 FAMAS|2 joepie91, why not use the existing irc network to share?
14:53 🔗 joepie91 FAMAS|2: what do you mean?
14:53 🔗 ersi "sharing/collaboration/communication"
14:53 🔗 FAMAS|2 you can send and recieve file via irc
14:53 🔗 joepie91 yes...
14:53 🔗 joepie91 which is centralized...
14:53 🔗 FAMAS|2 ersi, let me get a tutorial for osiris
14:53 🔗 ersi whatever we are doing now works pretty well
14:53 🔗 ersi FAMAS|2: stop, I don't care
14:54 🔗 joepie91 the whole point of retroshare, osiris, etc, is that it's decentralized
14:54 🔗 ersi if you want to do something with any data someone here produces, do it
14:54 🔗 ersi if you feel like doing something, do it
14:55 🔗 FAMAS|2 http://www.metacafe.com/watch/yt-0eNP3RT7rmY/osiris_sps_tutorial_create_a_portal/
15:18 🔗 DFJustin FAMAS|2: http://www.archive.org/details/archiveteam
15:21 🔗 Schbirid ugh, that smf forum mirror downloaded 8GB of which only 2.4G remained because of the not-saving the *msg* clutter
15:36 🔗 Schbirid ndcu ROCKS
15:40 🔗 emijrp are there file extension larger than 4 chars?
15:41 🔗 DFJustin .torrent
15:41 🔗 DFJustin .vbproj
15:41 🔗 emijrp lol
15:42 🔗 Schbirid .aria2c
15:42 🔗 nitro2k01 Plenty
15:43 🔗 nitro2k01 But many people are still afraid of long file extensions
15:43 🔗 emijrp and extensions with more than 1 dot?
15:43 🔗 nitro2k01 Technically no, but there's .tar.gz
15:43 🔗 Schbirid my.random.text.files.txt
15:43 🔗 nitro2k01 Which is really a tape archive which is then compressed with gzip
15:44 🔗 nitro2k01 It's modular. You can compress anything with gzip like that
15:44 🔗 nitro2k01 .sql.gz
15:44 🔗 nitro2k01 Whatever
15:44 🔗 nitro2k01 So, it's really two independent file extensions, is what I'm saying
15:45 🔗 emijrp k tahnks
15:46 🔗 nitro2k01 klol np
15:49 🔗 Schbirid wow, smf is retarted
15:49 🔗 emijrp #klol is awaiting for some care
15:49 🔗 Schbirid eg on &topic=6461.0 there are links to the prev and next thread
15:50 🔗 Schbirid their targets are &topic=6461.0;prev_next=prev and &topic=6461.0;prev_next=next
15:50 🔗 Schbirid and yes, you end up at those URLs for topics that actually have proper unique different IDs, eg 6460 and 6462 (examples, not real)
15:50 🔗 Schbirid idiotic
15:50 🔗 Coderjoe what defines ordering? most forums I've used reorder threads based on activity
15:51 🔗 nitro2k01 Schbirid: What if thread 6460 gets deleted between the time the user opens the page and clicks the link...
15:51 🔗 Schbirid Coderjoe: ah, that might be why i have much different IDs as prev and next (eg 2523)
15:52 🔗 Schbirid wow, phpbb2 does it too
15:53 🔗 Schbirid nitro2k01: bad luck? what is the difference?
15:53 🔗 Coderjoe i get the impression that doing SMF with wget --mirror will be a pain. perhaps a special toolset?
15:53 🔗 Schbirid i am writing my experiences
15:54 🔗 Coderjoe it does? I could have sworn that phpbb actually used params like &start=30 for page three, &start=45 for page four, etc (given a posts-per-page of 15, at least)
15:55 🔗 nitro2k01 It may be annoying for you, but it makes sense to have those links
15:55 🔗 Coderjoe (and where I mean phpbb2 when I say phpbb)
15:56 🔗 Schbirid nitro2k01: how?
15:56 🔗 Schbirid example here http://rome.ro/smf/index.php?topic=6204.0
15:57 🔗 Schbirid not pagination of a single topic, i mean the previous/next topic
15:57 🔗 nitro2k01 Oh wow
15:57 🔗 nitro2k01 I thought it would redirect
15:58 🔗 nitro2k01 THIS doesn't make sense
15:58 🔗 Coderjoe oh. I generally never use those
15:58 🔗 Coderjoe when I do, it is accidental and I meant prev/next page of the current thread
15:59 🔗 nitro2k01 "I don't always use prev/next, but when I do, it's accidental"
15:59 🔗 Coderjoe stay frosty, my friends
16:02 🔗 Schbirid step 1: 7zipping all i got before i start decluttering
16:06 🔗 Coderjoe uhoh. I fear the size of this mobileme profile... username is "emusicbox"
16:07 🔗 Schbirid heh
16:07 🔗 Coderjoe feed.xml is taking awhile
16:09 🔗 Schbirid oh great smf/index.php/topic,1998.0/images/smiles/icon_smile.gif.html
16:09 🔗 Coderjoe yay. leaving the default text/html content-type header
16:10 🔗 Coderjoe I wish browsers were a lot more strict on the handling of content-type
16:10 🔗 Coderjoe forcing website code monkeys to return the correct type
16:11 🔗 Coderjoe - Running wget --mirror (at least 18934 files)...
16:15 🔗 ersi Why would you want to go to the next thread, that's tarded.. what is the index/subforum index for
16:15 🔗 Schbirid oh yes, that idiocy was implied :D
16:15 🔗 Schbirid actually it might be useful if that function only shows unread topics to registered users
16:16 🔗 ersi imo it's retarded
16:16 🔗 ersi larger than a small group of people on a forum makes threads "too diverse" to serve everyones interests
16:16 🔗 Schbirid it also serves the forum index for 404s
16:16 🔗 ersi lol
16:16 🔗 ersi what can I say.. forum software..
16:17 🔗 Schbirid to be fair, this is 4 years old
16:18 🔗 Coderjoe and the web is 20 years old. how to handle 404s should be well known by now
16:22 🔗 Schbirid wait, that was incorrect
16:23 🔗 Schbirid it does serve 404 for nonexisting files
16:23 🔗 Schbirid but not for eg "this thread does not exist"
16:42 🔗 Coderjoe i had to write a workaround for 4chan for awhile, as some servers were using versions of the board code that didn't set the 404 status code
17:25 🔗 shaqfu SketchCow: Ah, yikes; I knew LoC was behind the curve, but I figured Twitter was in their ability to save
17:26 🔗 shaqfu (And how often is metadata@textfiles.com checked?)
18:22 🔗 Nemo_bis to be archived? http://www.openimages.eu/media
19:17 🔗 SketchCow I am metadata@textfiles.com.
19:17 🔗 SketchCow I have a tad of a backfill.
19:18 🔗 SketchCow Are youu Alex?
19:19 🔗 SketchCow Also, everyone will be delighted to know the leak of the JSTOR liberator caused JSTOR to shit and get off the pot.
19:19 🔗 SketchCow They're now arranging to send all their stuff to archive.org (the early stuff)
19:20 🔗 yipdw so who leaked it?
19:20 🔗 SketchCow I don't know, honestly.
19:20 🔗 SketchCow But the fact that it went to the circlejerk hostname does not make me think it's anyone new
19:52 🔗 chronomex our logs are public, yes?
19:52 🔗 soultcer y
19:52 🔗 SketchCow CANDY VAN
19:52 🔗 chronomex mmmkandi
19:52 🔗 soultcer http://badcheese.com/~steve/atlogs/?chan=archiveteam
19:53 🔗 Nemo_bis I always forget it
19:56 🔗 chronomex #archiveteam is forever
19:58 🔗 war9512- Where is all the data stored? ;p
20:00 🔗 chronomex iiiinternetttt
20:10 🔗 SketchCow INTERNETNENTTTT
20:10 🔗 SketchCow I just did an interview with NZ tech podcast.
20:10 🔗 SketchCow PROBABLY was a little radical sounding
20:12 🔗 ersi Did you bring it up to 11?
20:14 🔗 SketchCow I said we built the internet like a car with no brakes and now we're retrofitting brakes as it careens down the hill
20:15 🔗 yipdw it's agile development
20:15 🔗 yipdw nobody had a use case for brakes yet.
20:16 🔗 kennethre YOU WOULDN'T DOWNLOAD A CAR
20:16 🔗 dnova I would download so many cars
20:16 🔗 kennethre hehe
20:17 🔗 SketchCow I'M DOWNLOADING A CAR NOW
20:17 🔗 kennethre http://bitcoinmedia.com/uploads/2012/01/You-Wouldnt-Download-A-Car.jpg
20:17 🔗 yipdw when mobile broadband takes off in cars, I can download a car from my car
20:17 🔗 SketchCow I used a variation of that image for a talk from two years ago
20:19 🔗 Coderjoe YOU WOULDN'T STEAL A CAR
20:20 🔗 Coderjoe YOU WOULDN'T SHOOT A COP
20:20 🔗 Coderjoe AND STEAL HIS HELMET
20:20 🔗 * Schbirid poops in it
20:20 🔗 SketchCow TWICE
20:20 🔗 SketchCow YOU ONLY HAVE ONE HEAD AFTERALL
20:20 🔗 SketchCow DON'T OVERSHOOT COPS
20:20 🔗 yipdw otherwise you won't get the pacifist run achievement
20:20 🔗 SketchCow http://blogs.loc.gov/digitalpreservation/2012/02/jason-scott-rogue-archivist/
20:21 🔗 yipdw I thought you described yourself as a free-range archivist
20:21 🔗 Schbirid are there more photos of your LOC talk around?
20:22 🔗 SketchCow I didn't talk at LOC yet
20:22 🔗 SketchCow This is an interview
20:23 🔗 SketchCow I did a PDA talk
20:23 🔗 SketchCow This is just e-mail
20:23 🔗 SketchCow The photo is of me at ROFLsummit
20:23 🔗 Schbirid whoa, totally mixed up your pda talk with being the loc
20:32 🔗 shaqfu SketchCow: Yes, I am, and don't sweat it; you seem insanely busy
20:33 🔗 shaqfu I was just curious if you still checked it at all
20:37 🔗 SketchCow I will be assigning you stuff.
20:37 🔗 SketchCow Shortly
20:37 🔗 shaqfu Thanks
20:37 🔗 SketchCow Today is catchup day
20:37 🔗 shaqfu Like I said, put out your fires first
20:37 🔗 SketchCow A day we call HOW THE FUCK DOES JASON GET SO MUCH DONE
20:38 🔗 Coderjoe before heading in to work today, I began the process of bringing together all my stage6 videos. What should I do about the metadata, which is currently in mysql? just a mysql dump, or should I parse it and output xml or something?
20:39 🔗 Coderjoe and how should I package it for IA?
20:39 🔗 Coderjoe stats (including some about file sizes) at http://wegetsignal.org/stage6.php
20:46 🔗 balrog hmm
20:46 🔗 balrog what was that github grabber?
20:47 🔗 Schbirid balrog: https://github.com/joeyh/github-backup
20:47 🔗 balrog Schbirid: that pulls all branches, right?
20:47 🔗 Schbirid no clue
20:47 🔗 balrog hrm, ok
20:48 🔗 kennethre Schbirid: yes
20:48 🔗 balrog also did he fix that 30-of-each-thing bug?
22:02 🔗 SketchCow I am up for a .txt of the metadata that I can convert
22:28 🔗 SketchCow Who wants an easy scrape project?
22:28 🔗 SketchCow Might be less than 15 minutes of work.
22:28 🔗 SketchCow http://www.1up.com/do/minisite?cId=3148397
22:28 🔗 SketchCow I want a textfile with the date, and then the description, in a textfile.
22:31 🔗 shaqfu One textfile per, or one with all of them?
22:35 🔗 SketchCow Let's go for a textfile with the date in the form of MM-DD-YYYY followed by a one line description ripped from those pages. For the GFW entries only.
22:35 🔗 SketchCow I show it'll be about 130-140 lines.
22:35 🔗 SketchCow I can then add the full podcast to archive.org.
22:35 🔗 shaqfu Gotcha
22:44 🔗 nitro2k01 Why that date format?
22:45 🔗 nitro2k01 Let me see some ISO8601 up in this bitch
22:46 🔗 nitro2k01 MM-DD-YYYY is still ambiguous as far as MM and DD go
22:46 🔗 nitro2k01 Still better than having everything as two digits each, but hey...
22:50 🔗 topaz YYYY-MM-DD > MM-DD-YYYY
22:52 🔗 topaz and having dropped my pearl of wisdom for the day I'm off for home. see y'all later.
22:55 🔗 Coderjoe a .txt of the metadata for each video? alongside the video?
22:55 🔗 SketchCow That date format is ebcause that's what the filenames are, bitch
22:55 🔗 SketchCow It'll go to the other format for the date
22:55 🔗 SketchCow I'll make it all good
22:57 🔗 Coderjoe (my question is about my stage6 data)
22:58 🔗 Coderjoe the metadata I have is uploader, tags, stage6 video ID, original url, title, description, filesize, and possibly date of upload. (not sure if this field was upload date or date I added it to the queue, offhand)
23:00 🔗 Coderjoe I also have more metadata than videos
23:01 🔗 SketchCow That's excellent, actually.
23:01 🔗 SketchCow Yes.
23:02 🔗 SketchCow Want me to give you a place to upload it or do you want to do it.
23:03 🔗 Coderjoe I can do it, I think with some guidance on things like number of videos per item and preferred metadata info format
23:03 🔗 Coderjoe or I can just organize it a bit and upload it for you to ram into IA
23:05 🔗 Coderjoe yeah. looks like the v_added field is the date that stage6 said it was added to the site
23:05 🔗 Coderjoe but I only have that for videos I managed to scrape, I think
23:06 🔗 Coderjoe which appears to be all the videos I indexed (which was sadly not every video on the site)
23:06 🔗 SketchCow What kind of videos are these?
23:07 🔗 Coderjoe technically, AVI files (they're divx's .divx by filename)
23:07 🔗 Coderjoe divx networks ran a user video hosting site for awhile, named stage6
23:07 🔗 SketchCow Yes
23:07 🔗 SketchCow Just tracked.
23:07 🔗 SketchCow How many videos. How big.
23:08 🔗 SketchCow This is 50% curious, 50% useful for advising you.
23:08 🔗 Coderjoe though some of these are probably tv shows
23:08 🔗 Coderjoe easy stats are here: http://wegetsignal.org/stage6.php
23:09 🔗 Coderjoe I have 4989 actual downloaded videos, ranging from 556K to 1.9G
23:09 🔗 SketchCow I see.
23:09 🔗 SketchCow Before it died.
23:09 🔗 SketchCow So 303gb
23:09 🔗 Coderjoe yeah
23:11 🔗 SketchCow Just talked here.
23:11 🔗 SketchCow One item per video
23:11 🔗 shaqfu SketchCow: Done
23:11 🔗 Coderjoe alright. current file names are like 1.divx for video id 1. should I rename them at all?
23:11 🔗 SketchCow Let's do one.
23:11 🔗 SketchCow Choose something fucking hilarious.
23:12 🔗 SketchCow Upload it, plug in the metadata.
23:12 🔗 SketchCow Give me the item, I'll make an archiveteam-stage6 collection with it
23:12 🔗 SketchCow And we'll go from there, OK?
23:13 🔗 SketchCow shaqfu: mail that in. jason@textfiles.com
23:14 🔗 Coderjoe alright
23:16 🔗 shaqfu Sent
23:29 🔗 SketchCow Hmm, not here yet.
23:29 🔗 shaqfu Odd
23:29 🔗 SketchCow So, shaqfu: What interests you: general computer history items, or arcade manuals?
23:29 🔗 shaqfu SketchCow: What's there more a pressing need for?
23:29 🔗 shaqfu And it's probably my old school's email server being awful again
23:30 🔗 SketchCow PERSONALLY, I think the general computer history items are more useful. The arcade ones are more "fun".
23:30 🔗 shaqfu I'll tackle the history items, then
23:30 🔗 nitro2k01 In case it interests anyone: http://www.flickr.com/photos/gameboygenius/sets/72157629470674857/
23:30 🔗 nitro2k01 Will do a writeup on it later
23:30 🔗 shaqfu If push comes to shove, you could probably open the arcade manuals to crowdsourcing or w/e, since it's higher interest
23:30 🔗 kennethre nitro2k01: O_O that is amazing
23:31 🔗 SketchCow I've already opened it generally
23:31 🔗 SketchCow So yeah, bitsavers is the way.
23:31 🔗 shaqfu Any collections you have in mind?
23:32 🔗 SketchCow Oh, let me help.
23:32 🔗 SketchCow The collection I'm putting up:
23:32 🔗 SketchCow http://www.archive.org/details/bitsavers
23:32 🔗 SketchCow As you see, 44 items.
23:32 🔗 shaqfu Gotcha
23:33 🔗 SketchCow http://www.textfiles.com/bitsavers/pdf/
23:33 🔗 SketchCow Now, that's the source area.
23:33 🔗 SketchCow If you choose anything, I prefer you "close out" a directory.
23:33 🔗 SketchCow I want the URL, and the date (as much as you can guess), and a desc.
23:34 🔗 SketchCow Do as many as you can stand.
23:34 🔗 shaqfu Sounds good
23:34 🔗 SketchCow I'll be able to add them.
23:34 🔗 shaqfu Do you want subject keywords also?
23:34 🔗 SketchCow With only 44 of the thousands up, it's lots of space
23:34 🔗 SketchCow I would like that but I don't want you murdered
23:35 🔗 shaqfu SketchCow: I developed a pain tolerance to this sort of thing in grad school
23:35 🔗 shaqfu I'll start tonight
23:35 🔗 SketchCow OK, great.
23:35 🔗 SketchCow I'll ensure the front page has credit.
23:35 🔗 shaqfu One file per dir, or per item?
23:35 🔗 SketchCow One file per item
23:35 🔗 shaqfu Got it
23:35 🔗 SketchCow I have a script that does the uploading, and lets me plug in your textfile.
23:36 🔗 shaqfu Is it picky about formatting date/desc/keywords?
23:36 🔗 SketchCow Not really.
23:36 🔗 shaqfu Awesome
23:36 🔗 SketchCow What you do saves me the hardest part.
23:36 🔗 SketchCow I do the rest.
23:36 🔗 shaqfu got it
23:37 🔗 shaqfu I gotta run now; I'll start on this in a bit
23:37 🔗 shaqfu Thanks for the work
23:38 🔗 SketchCow Great wrk, nitro2k01
23:44 🔗 bsmith093 SketchCow: what i meant about the wiki link for the networking film, was that there are still some perple who have yet to be ID'd, is all.
23:44 🔗 SketchCow Ohhh
23:50 🔗 SketchCow Archive Team wki is only about 600mb.
23:50 🔗 SketchCow I'm getting our new host setup for it.
23:50 🔗 SketchCow Then we'll move there.

irclogger-viewer