[02:00] I...uh...what [02:00] * BlueSleep stabs underscor [03:11] BlueMax: SketchCow wouldn't look sexy in a tiger costume? [03:11] rarrrrrr [03:12] * BlueMax stabs undersco2. [03:12] I <3 you too [03:13] Leaving for governor's school tomorrow!! [03:13] * undersco2 is so excited [03:14] Governor's school? [03:15] it's basically college-for-a-month [03:16] My program KS on logic, debate, and problem solving [03:16] is* [03:16] Lucky bastard [03:16] :-D [03:17] 100 MB it internet too! [03:17] internet* [03:19] FFFFFFFUUUUUUUUUUUU [03:20] Where are you again, bluemax? [03:20] Australia [03:20] oh, your net sucks [03:22] * atg misses his 100mbit iternet [03:22] *internet [03:22] Fun project for the fall semester will be borging 5 100mbit lines and seeing if we can get .5gbit usenet downloads going [03:22] For uh... watever legal content may be on usenet [03:24] hahaha [03:24] suuure [03:24] Solaris supports etherchannel out of the box [03:24] It's also solaris... [03:25] I'll end up using PFSense and doing some crazy forwarding magic to make it all work [03:25] As my school's network totally isn't designed for bonding links, and I don't think asking IT to create an aggragrate link would go over too well [03:26] hahaha [03:26] uhh, yeah, we want to download some stuff from alt.binaries, will you bond these switch ports for us? [03:32] Getting a public IP was work enough... I was told a) it's not possible b) fill out a form that is only applicable to professors/staff running offical servers and finally the correct answer c) of "register here" [03:32] But a network client needs to be running full time to have internet access [03:32] They claim they can't disable it... and it's web/javascript/java based [03:32] (yes, all three) [03:32] So you need a headed system to run it. [03:32] :| [03:34] That's disgusting [03:37] Yes. Yes it is. [03:37] UVA has their internet stuff done /awesomely/ [03:37] My school? Not so much. [04:07] atg: Where do you go? [04:07] GMU [04:08] Oh, that's cool [04:08] Right near me then [04:10] It's pretty decent other than that [04:11] That's enough of a reason to get them removed from my list [04:12] I hate shit like that >:I [04:44] IT... yeah [04:44] I've written some scripts to work around it [04:44] Fun learning experience, etc [04:44] They don't care how much BW I pull though, which is awsome [04:44] My roommate pulled something like 400GB in ~16 hours [04:46] atg: WHAT [04:47] Yes [04:47] We pulled many TBs [08:16] In for amoment. [08:17] * SketchCow gets his tiger outfit out for the evening [08:18] * BlueMax pulls out a tranquilizer gun [08:18] SketchCow, step away from the tiger outfit. [08:19] heh [09:02] I get the feeling SketchCow has me on ignore [10:48] any hint how i can insert random text with all kinds of funny characters (robots.txt files) into a sqlite3 database using Bash? [10:48] i tried [10:48] sqlite3 -echo ~/ramdisk/robots.db "INSERT INTO files (time, domain, content) VALUES ('${today}', '${domain}', '${filecontent}');" [10:48] but if i have single quotes in the file content, then it obviously explodes around my ears [12:38] alright, got it [12:38] i think i will store them straight into sqlite then [12:38] probably saves a lot of space due to tiny files otherwise [12:39] this might grabbing 100k per day possible for me [13:51] aaaargh fuck, why did aria2c save gzipped stuff [13:54] for example "aria2c http://sohu.com/robots.txt" [13:54] if anyone knows how to make it stop that, please tell me [14:16] apart from those and some files that are too huge to be sent through Bash it seems to work well [15:00] maybe i could simply grep downloaded files for ^Disallow OR ^Allow and remove non-matches [15:00] i even got some PNG and GIF as responses... [16:59] nice, 500mb turned into a 200mb database [17:03] ah, --http-accept-gzip=false [17:04] :(( [17:04] no, Default: false [17:06] oh nice http://aria2.sourceforge.net/aria2c.1.html#aria2_optref_on_download_complete [17:07] maybe i can call a script that checks the file and deletes it if it is garbage [17:14] http://pastebin.com/raw.php?i=naCaSyuZ [17:19] ^data$ [17:19] i would delete the following: [17:19] "HTML document" [17:19] "application" [17:19] "gzip compressed data" [17:19] ^empty$ [17:19] "PE32 executable" [17:20] "image data" [17:20] any thoughts? [17:44] dammit, it does not work [17:45] stupid me [18:03] done [18:03] calling a script with this inside: [18:03] if test "$(file $3 | grep -E '(.*data.*|.*empty.*|.*HTML document.*|.*gzip compressed data.*|.*application.*|^PE32 executable.*|.*image data.*)' )"; then rm $3; fi [18:03] yes, it is as ugly as my backside [18:59] wicked, http://imgur.com/robots.txt [18:59] project starts to take shape though, i am downloading, discarding bad files and in the end putting all of today's files in a sqlite3 db [19:32] ~30 minutes to download 10k, 5 minutes to insert them into db. nice [19:33] 60 megabytes as files, 8 as database :)