[00:17] Only had 1 of mine go error 4 on me; the others just took a long time. [00:18] Are there any other emergencies right now, or can I put this machine back on memac now? [00:18] I've got 3 left that are running for quite some time now [00:23] or perhaps I misheard. the sound wasn't the best in the back corner where I was sitting. [00:27] my defcon mirroring stop working [01:39] I was having error 3's [01:39] I have only two left running, Pachito and shankargallery. [01:39] the rest have either died or finished [01:43] https://twitter.com/thecoderjoe/status/205835085172322304 [02:08] Looks like I've got a pretty big one running. [03:06] Morning [03:07] 855 left on memac! [03:07] hhahaha no way i overtook Sui in GB on tabblo [03:07] probably barely. [03:07] he has more items though [03:08] i figured it wasnt going to happen but i guess i had a few big users in the end [03:09] :> [03:16] So far, we've found no incomplete uploads. [03:16] But that can't be right. [03:40] SketchCow: I found msx-user magazine [03:40] is that on the dark-rack cause its not on archive.org [03:42] not even underground gamer has it [03:42] :-D [03:43] http://i.imgur.com/vynW8.png [03:44] ugh [03:45] this is the state of the public schools [03:45] http://imgur.com/gallery/aWnkX [03:45] that one on the far right reminds me of SketchCow [03:45] :D [03:46] Mr.Pibb: it goes down good [03:51] msx gold: http://www.msxarchive.nl/pub/msx/ [03:53] based on the dates it has not been up dated since summer of 2006 [03:53] ok photos was update in jan 20 2008 [04:13] i also found MSX Computing [04:45] I just started the ArchiveTeam Warrior myself, to see it in action. [04:45] No wonder everyone's loving it. [04:51] http://cdn.shopify.com/s/files/1/0066/5282/files/FREAKER_5817_grande_grande.jpeg?104516 [04:51] Come join Archive Team! [05:07] SketchCow: yeah, it's pretty fantastic [05:07] this is the first i am hearing of it :/ [05:07] from the twitters [05:08] how does it work [05:15] It was just announced yesterday [05:15] And it totally exploded through Tabblo [05:15] A nice debut [05:15] I'll fire it up in a few mins to have a look [05:34] are the instructions for how the ArchiveTeam Warrior VM image is created posted somewhere? [05:34] yeah there are tutorials on oracle's site someplace [05:35] (the virtualbox documentation) [05:36] well i mean specifically the AT W one, since i'm curious about what tweaks it has [05:36] oh. I don't know much about it yet [05:36] bad news [05:36] I was banned from fileplanet [05:36] D: [05:37] you're a bad seed, underscor [05:37] I know :( [05:37] and yet he's a good apple [05:37] Apparently I downloaded too fast [05:38] hmm [05:38] actually... [05:38] does this 403 for any of you? [05:38] http://m2x12.fileplanet.com/%5E229657417/ftp1/092011/BF3_99Problems_1080p60_Web_AudioFina_20Mbit.mov [05:38] 404 in my case [05:39] 403 for me [05:40] firefox says 404 [05:40] but it's a weird 404 page [05:40] wget says 403 [05:40] agreed [05:40] wget says 403, yes [05:40] I wonder if it's the %5E [05:40] I get a soft 403 in opera\ [05:40] soft? [05:40] Referral Denied You don't have permission to access "http://download.direct2drive.com/ftp1/092011/BF3_99Problems_1080p60_Web_AudioFina_20Mbit.mov?" on this server. [05:40] wget -U "Mozilla/5.0 (Windows NT 6.1; rv:5.0) Gecko/20100101 Firefox/5.0" [05:40] Reference #24.2c496dcf.1337924471.229cdd2b [05:40] says 403 [05:40] is a page that is delivered as a downloadable file [05:41] well, it might be a hard 403, I'm not sure [05:45] harrumph [05:45] I wonder if it's schibirid's scripts then [05:46] that link may have expired [05:46] iirc fileplanet is one of those pre-DDL sites that makes people jump through tons of hoops and 'wait in line' for DLs, so links would be user-specific [05:48] it was [05:49] oops [05:49] didn't realize there was a channel [05:51] hah [05:51] I didn't know IA archived xvideos [05:51] http://archive.org/details/xvideos.com-20120511-234516 [05:51] that's neat [05:51] huh [05:52] the collection gives a 500 error [05:52] are you sure you're supposed to post that? ;) [05:53] it isn't terribly well hidden. you just have to look at the work queue ("where am I in line?") to find them [05:53] aye [05:53] (even without any admin powers) [05:53] I found it while not logged in actually [05:54] so I guess it's okay [05:54] lol [05:54] who made the warrior appliance? [05:54] alard afaik [05:54] alard [05:54] thanks [05:54] he's one busy grad student. I wonder when he finds time to study [05:55] http://archive.org/search.php?query=collection%3Axvideos.com&sort=-publicdate [05:55] Does that return results for you guys? [05:55] yes. I'm logged in, however [05:55] i'm not logged in and yes it does [05:57] okay, good [05:57] then I'm safe [05:57] lol [05:58] Well, I mean, like I said, I found it not logged in [05:58] But still [05:58] your IP is being targeted with gifts of secret data [05:59] I was bored and tried to figure out how much pr0n is on IA the other day, came up with 20.9tb just for the top domain crawls [06:00] there's also youporn.com, redtube.com, pornhub.com, and some smaller ones in there [06:02] wow [06:02] I guess I'll not run out of porn while I'm in office [06:02] they better do xhamster.. and hm, they *kind of* do, very few results: http://archive.org/search.php?query=collection%3Axhamster.com&sort=-publicdate [06:02] I dunno if it's actually grabbing the videos or just a shitload of thumbnails, I guess the stuff in there is just automated due to being on alexa or w/e [06:03] ironically enough, one of my projects is to figure out a way to inject a collection of 1920s through 1940s erotica that was donated [06:03] that'll be interesting to work on [06:03] "no, I wasn't watching porn on the job, it was quality control!" [06:03] yeah they don't really seem geared for photo collections currently [06:04] we should archive porn websites as an AT project [06:04] that's a lot of cultural ephermera right there [06:04] * underscor snickers [06:04] could do some fancy video deduplication, since a lot are probably duplicates [06:04] presuambly [06:05] too bad there's no acoustid for video [06:05] acoustID* [06:05] there are more people that collect music than videos i guess [06:05] usenet would probably be the low-hanging fruit there [06:06] collecting music videos might be similar enough to get a project rolling [06:06] there are some video fingerprinting things that look promising... but expensive [06:07] (i've been looking for some way to help me find duplicates that are encoded differently in a collection of fan-made video content) [06:09] fuzzy matching would be nice too [06:09] like, detect things like the same video but with a watermark, etc [06:09] (of course, difficult to do, but cool in theory) [06:13] can someone move this to the splinder collection? http://archive.org/details/splinder-users [06:16] something that compares videos 'visually', as in based on images rather than codec-specific stuff, so basically codec-independent stuff [06:18] yeah [06:19] so, I was thinking about school this fall [06:19] I only need to find a thousand people willing to loan me $10 [06:19] and I'll be home free [06:19] lol [06:20] kickstarter! [06:20] I know, I was thinking about that [06:20] But I don't know how I'd "reward" [06:20] underscor: there are! they are tax payers, and they fund freddir mac / fannie mae student loans. [06:20] "Send Alex to college for one year" [06:20] 10k tax payers -> student loan -> profit [06:21] yeah, but I'd rather go straight to the tax payers [06:21] lol [06:21] especially since I only qualified for $4k of fed money [06:21] s/profit/crushing student loan debt that haunts you for years/ [06:22] I've got it. Print 10,000 business cards explaining the deal, and pass them out to everyone I meet at defcon. slip them into pockets, scatter them on tables, etc [06:22] achieve a 10% hit ratio [06:22] ??? [06:22] profit [06:22] hahaha [06:23] exactly. your startup to fund your schooling is moving too fast to find a business model. [06:23] let the company that acquires you figure that part out [06:23] lots of users -> ??? -> success [06:24] I went to sxsw, I know this scam [06:24] hey, if that crazy video game guy can get over $10k in pledges [06:24] stayed with some guys who were pulling it [06:24] >:D [06:24] well, they were too clueless to pull it [06:24] I just realized the warrior downloads and uploads at the same time. [06:24] but they were trying [06:24] It doesn't yank, then upload, then yank [06:24] It takes the entire pipe to lunch 24/7 [06:24] excellent to hear, SketchCow [06:24] ha [06:25] chronomex: [06:25] chronomex: but this is for a ~good cause~ [06:25] ;D [06:25] I need you to take my entire pipe and yank, then upload, then yank [06:25] i hope it's not limited by upstream, like it downloads as fast as it can, then uploads as fast as it can, and if it can't upload fast enough it writes to disk [06:25] SketchCow: hi [06:25] hot [06:25] * arrith riffs [06:25] that sounds like a jerkcity quote [06:26] sounds dirty [06:26] chronomex: what were they trying to pull? [06:26] alard: when/if you have a chance, putting the scripts to use to generate the vm image onto github or someplace would be neat. i'm curious if i can get the image smaller [06:27] make a thing that doesn't fill a need or have a way to collect money -> pimp it out -> get ten thousand users -> ??? -> now you are a successful company [06:27] ah [06:28] help me think of things people could get in return for sending me to school [06:28] >:I [06:28] they could talk to you on the phone for a half hour about philosophy [06:28] instagram something something [06:28] you could smoke a joint with them [06:28] it really sucks being a poor student, as a poor student i know this [06:28] ummm i'm trying to think of things that don't cost you money [06:29] gotta spend money to make money [06:29] I'm a former poor student [06:29] hahaha [06:29] you could smoke a joint with them [06:29] their joint, of course [06:29] hahaha [06:29] you don't have money for drugs [06:29] wouldn't want to spend your college tuition on drugs [06:29] of course [06:29] then you'd never learn whatever it is that the state wants you to know [06:29] beep beep I am a state learned robot [06:30] goddamnit stupid dj, release this damn cd already, I want to buy it [06:30] btw jury nullification is a thing [06:30] and something like 90% of stuff doesn't go to trial, all goes through plea bargains [06:30] arrith: yes, but don't say that until they've already moved on fromthe throwing people off the jury stage [06:30] chronomex: yeah definitely [06:31] although, if you want to get kicked off [06:31] tru [06:31] either way i'd want to print up a bunch of pamphlets on it [06:31] poison the jury [06:31] poison them all ahaha [06:31] hahaha [06:31] just put them out in the lobby of the courthouse every day [06:31] there should be a website to crowdfund education [06:31] haha [06:31] that would be interesting [06:31] don't let anyone in the courthouse without a 20second rant on jury nullification [06:31] like larouchites [06:32] underscor: wikibooks kinda [06:32] I meant more in the sense of having a thing where like family members or something could drop in some money [06:33] sort of like a targeted kickstarter or something [06:33] I dunno [06:33] Don't mind me, just thinking aloud [06:33] don't use kickstarter, use indiegogo because you can keep all the money regardless of whether you hit the goal [06:34] for raising money from a group iirc WePay is supposed to be good, at least compared to paypal [06:35] http://www.youtube.com/watch?v=oGcApe6_HdI&feature=youtu.be [06:37] SketchCow: i saw him on this coin operated show on the history channel when i was flipping channels for the first time in like a year or two [06:51] SketchCow: http://archive.org/details/GBTV_09_12__16_2011 [06:51] first week of gbtv [06:52] 10 hours of video right there [06:56] I got first place in my networking project today [06:56] that was p exciting [06:57] underscor: what was the project? [06:57] getting 50 business cards out first [06:58] FilmCompany recently bought a new building, and we had to do a proposal for end to end network design, installation, maintanence, security, policies, etc [06:58] it ended up being like 115 letter pages [06:58] we presented to judges today, from different places [06:59] high up network engineers from AT&T and Verizon, head prof from the networking program at GMU, high up cisco technical guy [06:59] and then some more "marketing/businessey" people [07:00] haha [07:02] watching some of John Romero's doom development video footage. watching a playtest of E1M2 where the sound effects weren't made yet, so some of the SNES wolf 3d sounds are there [07:36] I think I just quit being GDC's archivist. I feel terrible. [07:36] now why would you do a think like that [07:36] Because in three months, I have not produced a single .flv for their vault. [07:36] [07:36] Oh, make no mistake, I've fucking TRIED [07:37] Like, 99% of my stress this past three months? Vault. [07:37] is flv just hard to make? [07:37] Taking .AVI files and turning them into .flv files? All my stress. [07:37] I was able to do it for a while! [07:37] hrm. [07:37] But now I can't. [07:37] And I'm sick of it. [07:37] flv for a vault? ew. [07:37] ^ [07:37] They're getting fucked, and I'm way too busy. [07:38] I was doing it primarily to supplement income. And I kind of will really, really be financially strapped if I do this. [07:38] But I don't think I can take it. [07:38] I'm tired of missing deadlines and promises with those people. [07:39] Here's an example. [07:39] Tomorrow, I am to go to NY to spend time with the lady. [07:39] Flight on Sunday, to CA, for a week. [07:39] Am I packing? No. [07:39] Am I making sure I have stuff? No. [07:39] Am I ensuring I am answering e-mail? [07:39] No. [07:39] No, I just wasted another hour trying to get the dying laptop to render an .flv [07:39] you should do the opposite of that [07:40] an .flv that doesn't wokr. [07:40] video encoding is such dark magic i don't even know [07:40] re-encoding is another thing even [07:40] upload to youtube -> download from youtube. done, converted. ;/ [07:40] arrith: ArchiveTeam Warrior scripts? What scripts? I just made a VM by hand. :) Log in (Alt+F2) and see if you can make it smaller. [07:41] I just wrote them a letter. [07:41] It said, basically: [07:41] flv sounds like a disease :-/ [07:41] - Let me work on this on my spare time, no pay [07:41] - Or I give up, you get the drives with the stuff and the tapes [07:41] But they wanted X videos a week [07:41] And I can't get a single video to render. [07:42] alard: nnnnooo. i was afraid you did that. you have to do it all fancy where a script fires up virtualbox, attaches a distro iso, then runs a preseed install, or the equiv. hm. [07:42] Literally, this is all my stress, this side job. [07:42] arrith: That would be cool, but I don't know how to do that and didn't really want to learn. So go ahead. :) [07:42] But I think the worst part is that this was a leg up, a favor from a buddy, who worked there [07:42] And now he looks like a tard for hiring this flustered excuse machine [07:43] I feel like one of those people, those addicts you give simple jobs to and they come back with excuses. [07:43] arrith: um... that sounds incredibly non-portable [07:43] SketchCow: so the basic issue is that you don't know the recoding process - once that works, everything's smooth? [07:43] "Couldn't take out the garbage, got hit with a board by a passing truck, went to complain, got arrested for harassment" etc. [07:43] Yeah, likelyt. [07:43] alard: hm i don't exactly have the time to do it justice. but if you have the time in the next few weeks if you could jot down the things you do to the OS once installed, that'd be neat [07:44] I have raw MPEG [07:44] I make a AVI that works. [07:44] Convering the AVI to a .FLV that works, apparently I lost the knowledge. I used to be able to do it. [07:44] Coderjoe: virtualbox is portable, but yeah i'd be imagining a linux dev env, i mean unless people working on it aren't on linux [07:44] Oh, and make no mistake. [07:44] I drop the .flv I make into a VLC, it works. [07:45] virtaulbox is portable, but a script that fires up VB and all of that sounds rather not [07:46] I think the idea is that you use the script to create an appliance that you then distribute. [07:46] Coderjoe: well to be clear this is just to automate the creation of the vm, which is then distributed. the only people touching it would want to work on the vm itself [07:46] Coderjoe: but yeah you'd need OS-specific stuff if people on different OSes wanted to dev on the vm [07:47] There, I just told another guy, that no, I won't have his thing for him. [07:47] I have a lot of mail in my inbox right now, mostly me being willing to do things I have no time for. [07:47] SketchCow: In all honesty, you have such incredible cachet right now with gamedevs (via PoP) that the last thing you should worry about is their reaction [07:47] Guy was a buddy, wanted some sort of special preview trailer for my three docs. [07:47] Guess what. None. [07:47] I don't have time to cut a "special preview trailer". [07:47] shaqfu: You do not understand the situation. [07:48] And that's OK. [07:48] Honestly, I shouldn't even be putting this in the channel, it's off topic. [07:48] I'm pissed and I'm tired. [07:49] These three days in Detroit were a triumph. I blew people's minds both on the main keynote for the conference I was asked to do, as well as an excellent 2 hour interview/podcast/appearance at a local reknown library. I was flown in and given a hotel room. I had excellent chats with people. [07:49] But all the time I wasn't doing that? In my nice hotel room, trying to render a fucking .FLV [07:49] i need some advice, our fileplanet downloads include full games from https://en.wikipedia.org/wiki/Direct2Drive . these games are not playable but apparently the exe files have an product activation screen like http://i.imgur.com/Pxtju.png [07:49] still, at least for the game i just tried (GTA vice city) it installed all the data unencrypted [07:50] this feels like deep copyright trouble to me [07:50] I feel slightly douchy for not being more talkative, but I tend to be that way around people I don't know well [07:50] What, at the dinner thing? [07:50] yes [07:50] Oh, well, look, my whole deal is outgoing and social. [07:51] we might be able to filter out most of those games ( *_dd_setup.zip , *_dd.zip ) [07:58] Hey, I'm looking for a new home for the MobileMe tracker's Redis. My free time on Amazon is running out at the end of the month, so I should probably move it somewhere else. [07:58] Does anyone have a suggestion? (Or even better -- is running a Redis server with a bit of spare room where the tracker could live out its last month?) [07:58] I found a few free Redis hosts so far, but they're too small (the tracker data is about 50-100MB, which is more than the 5MB they usually give you). [08:00] Can FOS do this work? [08:06] http://helloeveryonethisisarandomsubdomain.archive.org/ [08:08] hmm, lots of the tabblo stuff that got requeued is failing for me [08:08] Downloading fitziane... ERROR (3). [08:08] Error downloading 'fitziane'. [08:08] Getting next username from tracker... done. [08:08] such as that [08:18] SketchCow: Yes, there's no reason FOS couldn't run Redis. [08:18] Aranje: We're getting to the hard cases now -- they've failed before. [08:19] alard: btw what i meant earlier is i don't have the time to do it justice soon, but i would be curious how small i could get that vm image [08:20] I'll see if I can write some things down. There's not that much that's different from a normal Ubuntu install, just a few tweaks. [08:26] dpkg --get-selections will give you the installed packages concisely, btw [08:51] alard: what's the approximate data transferred per month? [08:54] GLaDOS: Less than 3GB. [08:54] But most of that is from the graph data, which can be truncated. [08:55] SketchCow: can i run a .au mirror for textfiles.com for you? whats the best way to grab all the content? im guessing you have it tar.gz'd or something somewhere? [08:56] alard: pff, that's nothing@ [08:57] It's little bits of usernames to and from the heroku webapp. [09:01] GLaDOS: For some people, it's a lot (Not me, just sayin') [09:01] can one tracker code instance be used to run multiple projects? (either with one redis instance or multiple) [09:03] Well, at the moment not, though it would be useful to set up a central tracker app somewhere. Then you'd just point-and-click to create a new tracker for the current project. [09:07] GLaDOS: (Also, the bandwidth isn't really the problem, it's running the instance. On Amazon it would be $60, which probably won't kill you but isn't nothing either. :) [09:07] 60 dollars? [09:07] wow. [09:07] The micro instance is $0.08 per hour, and Google says that will be $58 dollars for a month. [09:08] Better off just getting a VPS on its own if thats the price [09:08] Sorry, no, wrong, that's the small instance. [09:09] A micro instance is $0.02 per hour, so $15 for a month. [09:10] how could you make wget reject blog.direct2drive.com/2011/05/special-announcement-from-d2d/index.html?replytocom=70002.html ? [09:10] *replytocom*.html did not work [09:10] *replytocom* neither [09:11] --reject-regex='\?replytocom' [09:11] (on the latest wget, of course) [09:13] * Schbirid builds from bzr [09:13] huh, that may be why my archiving server isn't getting any new tabblo users [09:13] It's currently downloading 80 large users [09:14] Schbirid: It's on git, nowadays. The latest alpha version contains everything: ftp://alpha.gnu.org/gnu/wget/wget-1.13.4.56-620c.tar.bz2 [09:14] i prefer whatever someone nicely packaged fir me (arch AUR) [09:15] argh [09:15] stupid s3cmd keeps getting stuck downloading a whole lot of nothing [09:18] ftp://ftp.ntsomz.ru/ u: electro p: electro [09:18] that is the electro sattelite imagery [09:18] http://www.theregister.co.uk/2012/05/24/electro_l_121_megapixel_earth_photo/ [09:18] ELECTRO-L went aloft in early 2011 and snaps Earth every 30 minutes or so, and the Agency will happily sell you snaps from the bird. Each image packs a kilometre of Terran surface into each pixel. [09:35] alard: is the tracker software capable of using passwords on a redis server? [09:36] Yes. [09:44] It's amazing how consistently this happens: I get ops, and then between midnight and 1:00 the router craps itself. [09:45] Heh [09:46] Even better, the memac script doesn't know anything's wrong until wget returns. So it steams along, with a five-hour gap, then when it finially gets through mirroring what it can, it goes "holy crap I must've blacked out or something!" [09:47] So ^C'ing dld-client becomes part of the routine. [09:48] arrith: https://github.com/downloads/ArchiveTeam/warrior-code/recipe-files.tar.gz [09:48] weird... I requested a large instance, which is supposed to have 850GB of instance storage. the console says I have a large instance. linux on the instance shows 2 cores (medium has 1, large has 2, xlarge has 4), but the instance storage is only 410GB (which is what medium is supposed to have) [09:48] arrith: That's more less what I remember to have done, plus the files that seem to have changed. There may be important bits missing. [09:52] oh [09:52] Your m1.large instance comes with 2x420GiB instance storage. The other 420GiB is available but is unformatted and unmounted. [09:52] sneaky [09:53] how dare they expect you to do some work to get your brand new server up to spec [09:55] it isn't documented on the normal ec2 instance types page. that just says "a total of (blah)". you have to dig deeper into the docs to find this [09:57] and I don't seem to have the /dev/xvdc that I should have for the other part [09:58] alard: thanks. i suppose one way to do it is to "diff -r" the vm with a fresh install but egh that's involved. if you wind up remaking the vm image from scratch before i poke around through this maybe you could take notes or something, which would be neat [10:02] arrith: I'll try to remember, but it's not a very structured process. How do these vm-creating scripts work? Is there a standard way of doing it? [10:04] alard: i'm really not sure. i know you can do basically anything with the VirtualBox cli tools that VirtualBox can do. the tricky part to me is figuring out the preseed file syntax and going over long lists of what can be tweaked [10:05] alard: i was just thinking of simple lists of VBoxManage stuff which would have each section heavily commented [10:06] appears to be a bug with the ubuntu 12.04 AMI [10:07] you can attach it if you use the command-line tools to launch the instance, but the second isn't attached by default, so the gui fails [10:17] or not? same results with older version [10:17] longstanding bug maybe :P [10:19] arrith: Something like this https://help.ubuntu.com/12.04/serverguide/jeos-and-vmbuilder.html may be useful. [10:28] 'vagrant' might be useful as well [14:05] >> DANIELLA [14:05] Description ends at line 94... [14:05] Description starts at line 473... [14:05] Princess looks so ugly when she cries. No mp3s for daniella [14:05] We make it a pretty princess. daniella [14:06] SketchCow: see my msg about mirroring textfiles.com in .au ? [14:07] if i may :) [14:07] Have at [14:07] at ? [14:15] yes, go ahead and rsync it [14:15] I didn't see the message for what it's worth [14:16] dont you have it compressed somewhere or something? dont really want to rsync a billion tiny text files... :| [14:16] dont worry. he'll serve you goatse after the first 500 million [14:18] http://archive.org/details/textfiles-dot-com-2011 [14:28] :| [14:28] how can i easily download the whole lot at once? [14:30] with bash: wget http://ia600608.us.archive.org/4/items/textfiles-dot-com-2011/textfiles.com.7z.{001..111} [14:33] rsync is pretty efficient even for a lot of tiny files [14:35] Schbirid: that tries to get them without the 00s so starts at 1 2 etc, rather than 001 [14:36] WORKSFORME [14:36] what shell are you using? [14:36] GNU bash, version 4.2.28(2)-release (i686-pc-linux-gnu) [14:37] /bin/bash [14:37] bash-3.2-32.el5 [14:37] :\ [14:37] no idea then [14:37] there would be many ways [14:38] just make a textfile with all the urls [14:38] in calc or so ;) [14:38] or a for loop with seq [14:40] its ok its working with that wget command on another box with bash-4.1.2-8.el6.x86_64 [14:40] thx for the tip [14:41] np [14:44] 4,095 of the Tabblos were 500 Errors [14:44] Not a bad deal! [15:09] I'm considering doing an archive team kickstarter to raise some money funds for our various activities. [15:09] Not this week, not next, want to think it over. [15:10] Hire someone like Rich Stevens to do some branding for these projects, get money to archive.org for all the disk space we're eating, pay for t-shirts or something [15:10] Just thinking it over. [16:44] http://www.reddit.com/r/nsfwhot/comments/u2l1p/husband_watching_wife_gangbanged_in_restaurant/ [16:45] someone kickban gyotoit22 and Lolita27 [16:46] botspam incoming? [16:50] * Aranje laughs [17:01] http://www.youtube.com/watch?v=YUVf6yvWvQ4&feature=youtu.be [17:04] funny & impressive [17:13] SketchCow: do you have ahoy, commodore free, or commodore horzions 100%? [17:14] i know i have seen compute! and compute!'s Gazette [17:15] If it's a magazine torrent that says "100%", I have it. [17:15] thats the name of the magazine [17:16] http://www.demonoid.ph/files/details/2218860/8010783/ [17:16] commodore free is from like 2007 also [17:17] magazine website: http://www.commodorefree.com/ [17:19] ok you guys may have this torrent [17:19] just found ahoy magazine [17:19] on archive.org [18:10] quick questions [18:10] I have archived a wiki that's GFDL, how can I mention this in IA [18:35] GFDWhat? [18:39] sorry about the ragequit earlier, Aranje has encountered a bug [18:39] seesaw didn't delete any data and filled up a server, killing an important mysql instance [18:39] afaik seesaw is supposed to delete the data when done rsyncing, correct? [18:39] GFDL [19:05] Debianer, you have to explicitly set it in the licenseurl field [19:05] by putting https://www.gnu.org/copyleft/fdl.html [19:05] I'm using the web interface. [19:06] beware that you won't be able to edit the metadata again after this [19:06] I'm using the web interface... [19:06] then I think it's forbidden, or you could try and add anpother licensurl field [19:06] ^ [19:06] if it doesn't work use the general purpose rights field [19:06] "rights" field [19:11] I'm using the rightsfield and filled in a "fake" public domain license... [19:11] http://i.imgur.com/oyGrj.png nsfw [19:35] Debianer, fake PD is very bad [19:35] put CC-BY-SA if you really want to use the licenseurl field [19:35] in any case put the correct license in the rights field [19:35] and if you care about metadata consistency please don't put erroneous license info [19:42] sorry [19:43] I'm a n00b [19:44] Okay, there [19:44] I'm confused about IA and the like, so [20:24] Debianer, everyone is :D [20:24] and licenses are confusing for everyone [20:25] plus, GFDL handling is plainly broken [20:33] gfdl is weird [20:48] guys, everyone who has a twitter account, ovh is giving away free servers as beta test of their new US location. http://www.ovh.de/rootserver/bestellung_usa_beta.xml http://www.ovh.com/fr/serveurs_dedies/commande_usa_beta.xml http://www.ovh.co.uk/dedicated_servers/usa_order_beta.xml [20:49] Just got an archives job with no real supervision; fuckyeah.jpg [20:49] lucky winners will get a http://www.ovh.co.uk/dedicated_servers/superplan_mini.xml [20:51] oh, you have to follow them on twitter for participation... [20:51] should have seen that coming [20:51] we'll see [20:51] just keeps giving me errors [20:52] An error has occurred, please try again [20:52] thanks for the helpful error message [20:53] hm, the url should work. maybe try "servers -> new location (shiny US banner on the right) [20:53] I'm at the form to sign up [20:53] put in my info, and that error happens [20:53] maybe your name is wrong [20:53] it's not [20:54] well, i get that all todays servers are gone already :] [20:54] oh, now it's this [20:54] Sorry, we have reached the limit of available servers for today. Please try again tomorrow! [20:54] yeah. [20:54] oh well [20:54] yuuuup [20:55] quarity [20:55] http://forum.ovh.co.uk/showthread.php?t=6138 [20:58] canada != USA [20:59] guess the url is easier to say usa than na for north america, but the page says north america which is correct :P [21:01] :) [21:03] speaking of which I just heard that some famous newspaper (?) in the USA suggests people to study in Canada [21:03] despite the usual 400 M$ something announced by Obama [21:03] (why are USA political promises amounts always so ridiculous) [21:04] 400MM USD is about $1.00 per person in the country [21:04] what do you mean by ridiculous? [21:04] high or low? [21:05] $1.30 ;) [21:14] just ridiculous, they always look random numbers [21:14] not to speak of national debt :p [21:19] Hm, off/on-topic, is Tesseract/OCRopus ready for prime time? [21:22] Nemo_bis: How's it compare to commercial products? [21:22] k [21:22] like? [21:22] tesseract is mostly useless without a lot of wor [21:22] a disaster [21:22] :( [21:23] I'll note that in acquisitions, then [21:23] i used gimageReader very successfully on good scans [21:23] iirc that uses tesseractin the background [21:24] yeah [21:25] I wound up writing my own ocr engine a while ago, because it seems that every ocr engine in the world sucks balls when it comes to ocring single-font character-cell text and preserving the spacing [21:27] i scanned high resolution, high contrast text. worked really well [21:27] yeah I'm working with xerographic enlargements of computer-output-microfiche, so it's kind of crappy [21:28] alard: hm that might be useful. though two notes, jeOS is now just part of the normal ubuntu server iso, and kvm isn't as portable as virtualbox in terms of choosing dependencies. though depending on how vmbuilder makes the jeos iso, it might be something like jigdo [22:01] chronomex: oh cool, do you have the code for said ocr engine available? [22:01] indeed I do [22:01] https://github.com/chronomex/ess-ocr [22:02] it's super shitty, be warned, but it gets good results [22:02] you have to change constants in the source to tell it the sizes of your page, and how many characters are on each page, etc [22:02] and you need to pre-crop the pages [22:02] "part of this complete breakfast" [22:21] Sorry, the currently available servers were already forgiven all. Try it again tomorrow! [22:21] awww [22:21] :( [22:23] damn, I wonder what time they roll over [22:23] GMT? [22:23] well, it's an eu company. [22:23] yeah [22:25] I suspect that when the figure on the top line of this page is nonzero, there are slots: http://www.ovh.co.uk/dedicated_servers/dedicated_list.xml [22:26] * underscor writes a script to check it [22:28] http://www.ovh.co.uk/dedicated_servers/hg_2011_xxxl.xml [22:28] mmmm [22:31] 36 x 3tb [22:32] 10Gbit, max 40T/mo hmmm [22:35] can anyone point me to a wget win32 binary new enough to have proper warc support compiled in? [22:35] * seller selling: FRESH -> smtp ip / rdp server / rdp server whit ams / mail sender / 1 m (leads emails = usa australia europe asia arab mixed ) / mistery shopper / roots linux dedicate server / fresh fullz vbv rezult / dumps shopping -> add me: jeff_fullz@yahoo.com [22:35] teh googls just keep turning up ass old copies from 2005-2008 [22:36] underscor: op me [22:37] thank you [22:37] while true; do if [ `curl http://www.ovh.co.uk/dedicated_servers/dedicated_list.xml|grep "44.99"|grep -oP ".*?"|head -n1|grep -o "[0-9]*"` -eq 0 ];then echo "No dice."; else echo "Dice";fi;sleep 1;done [22:38] sleep 1? [22:38] ha [22:38] I guess it could wait longer [22:38] I'd expect it to fire again on the hour [22:39] yeah [22:39] although they are burning in a new facility, so it may not be connected to that page anyway [22:40] :( [22:40] Nemo_bis: you can edit the metadata afterward on items you uploaded, but not certain fields. I'm not sure what all is locked out after upload, other than item id and collection [22:42] gfdl breaks it [22:45] argh [22:45] damn you s3cmd, stop hanging [22:46] restarting is an expensive (timewise) operation [22:58] hello life savers [22:58] web savors