#archiveteam 2011-07-02,Sat

↑back Search

Time Nickname Message
02:00 🔗 BlueSleep I...uh...what
02:00 🔗 * BlueSleep stabs underscor
03:11 🔗 undersco2 BlueMax: SketchCow wouldn't look sexy in a tiger costume?
03:11 🔗 undersco2 rarrrrrr
03:12 🔗 * BlueMax stabs undersco2.
03:12 🔗 undersco2 I <3 you too
03:13 🔗 undersco2 Leaving for governor's school tomorrow!!
03:13 🔗 * undersco2 is so excited
03:14 🔗 BlueMax Governor's school?
03:15 🔗 undersco2 it's basically college-for-a-month
03:16 🔗 undersco2 My program KS on logic, debate, and problem solving
03:16 🔗 undersco2 is*
03:16 🔗 BlueMax Lucky bastard
03:16 🔗 undersco2 :-D
03:17 🔗 undersco2 100 MB it internet too!
03:17 🔗 undersco2 internet*
03:19 🔗 BlueMax FFFFFFFUUUUUUUUUUUU
03:20 🔗 undersco2 Where are you again, bluemax?
03:20 🔗 BlueMax Australia
03:20 🔗 undersco2 oh, your net sucks
03:22 🔗 * atg misses his 100mbit iternet
03:22 🔗 atg *internet
03:22 🔗 atg Fun project for the fall semester will be borging 5 100mbit lines and seeing if we can get .5gbit usenet downloads going
03:22 🔗 atg For uh... watever legal content may be on usenet
03:24 🔗 undersco2 hahaha
03:24 🔗 undersco2 suuure
03:24 🔗 undersco2 Solaris supports etherchannel out of the box
03:24 🔗 atg It's also solaris...
03:25 🔗 atg I'll end up using PFSense and doing some crazy forwarding magic to make it all work
03:25 🔗 atg As my school's network totally isn't designed for bonding links, and I don't think asking IT to create an aggragrate link would go over too well
03:26 🔗 undersco2 hahaha
03:26 🔗 undersco2 uhh, yeah, we want to download some stuff from alt.binaries, will you bond these switch ports for us?
03:32 🔗 atg Getting a public IP was work enough... I was told a) it's not possible b) fill out a form that is only applicable to professors/staff running offical servers and finally the correct answer c) of "register here"
03:32 🔗 atg But a network client needs to be running full time to have internet access
03:32 🔗 atg They claim they can't disable it... and it's web/javascript/java based
03:32 🔗 atg (yes, all three)
03:32 🔗 atg So you need a headed system to run it.
03:32 🔗 atg :|
03:34 🔗 underscor That's disgusting
03:37 🔗 atg Yes. Yes it is.
03:37 🔗 atg UVA has their internet stuff done /awesomely/
03:37 🔗 atg My school? Not so much.
04:07 🔗 undersco2 atg: Where do you go?
04:07 🔗 atg GMU
04:08 🔗 underscor Oh, that's cool
04:08 🔗 underscor Right near me then
04:10 🔗 atg It's pretty decent other than that
04:11 🔗 underscor That's enough of a reason to get them removed from my list
04:12 🔗 underscor I hate shit like that >:I
04:44 🔗 atg IT... yeah
04:44 🔗 atg I've written some scripts to work around it
04:44 🔗 atg Fun learning experience, etc
04:44 🔗 atg They don't care how much BW I pull though, which is awsome
04:44 🔗 atg My roommate pulled something like 400GB in ~16 hours
04:46 🔗 BlueMax atg: WHAT
04:47 🔗 atg Yes
04:47 🔗 atg We pulled many TBs
08:16 🔗 SketchCow In for amoment.
08:17 🔗 * SketchCow gets his tiger outfit out for the evening
08:18 🔗 * BlueMax pulls out a tranquilizer gun
08:18 🔗 BlueMax SketchCow, step away from the tiger outfit.
08:19 🔗 db48x heh
09:02 🔗 BlueMax I get the feeling SketchCow has me on ignore
10:48 🔗 Spirit_ any hint how i can insert random text with all kinds of funny characters (robots.txt files) into a sqlite3 database using Bash?
10:48 🔗 Spirit_ i tried
10:48 🔗 Spirit_ sqlite3 -echo ~/ramdisk/robots.db "INSERT INTO files (time, domain, content) VALUES ('${today}', '${domain}', '${filecontent}');"
10:48 🔗 Spirit_ but if i have single quotes in the file content, then it obviously explodes around my ears
12:38 🔗 Spirit_ alright, got it
12:38 🔗 Spirit_ i think i will store them straight into sqlite then
12:38 🔗 Spirit_ probably saves a lot of space due to tiny files otherwise
12:39 🔗 Spirit_ this might grabbing 100k per day possible for me
13:51 🔗 Spirit_ aaaargh fuck, why did aria2c save gzipped stuff
13:54 🔗 Spirit_ for example "aria2c http://sohu.com/robots.txt"
13:54 🔗 Spirit_ if anyone knows how to make it stop that, please tell me
14:16 🔗 Spirit_ apart from those and some files that are too huge to be sent through Bash it seems to work well
15:00 🔗 Spirit_ maybe i could simply grep downloaded files for ^Disallow OR ^Allow and remove non-matches
15:00 🔗 Spirit_ i even got some PNG and GIF as responses...
16:59 🔗 Spirit_ nice, 500mb turned into a 200mb database
17:03 🔗 Spirit_ ah, --http-accept-gzip=false
17:04 🔗 Spirit_ :((
17:04 🔗 Spirit_ no, Default: false
17:06 🔗 Spirit_ oh nice http://aria2.sourceforge.net/aria2c.1.html#aria2_optref_on_download_complete
17:07 🔗 Spirit_ maybe i can call a script that checks the file and deletes it if it is garbage
17:14 🔗 Spirit_ http://pastebin.com/raw.php?i=naCaSyuZ
17:19 🔗 Spirit_ ^data$
17:19 🔗 Spirit_ i would delete the following:
17:19 🔗 Spirit_ "HTML document"
17:19 🔗 Spirit_ "application"
17:19 🔗 Spirit_ "gzip compressed data"
17:19 🔗 Spirit_ ^empty$
17:19 🔗 Spirit_ "PE32 executable"
17:20 🔗 Spirit_ "image data"
17:20 🔗 Spirit_ any thoughts?
17:44 🔗 Spirit_ dammit, it does not work
17:45 🔗 Spirit_ stupid me
18:03 🔗 Spirit_ done
18:03 🔗 Spirit_ calling a script with this inside:
18:03 🔗 Spirit_ if test "$(file $3 | grep -E '(.*data.*|.*empty.*|.*HTML document.*|.*gzip compressed data.*|.*application.*|^PE32 executable.*|.*image data.*)' )"; then rm $3; fi
18:03 🔗 Spirit_ yes, it is as ugly as my backside
18:59 🔗 Spirit_ wicked, http://imgur.com/robots.txt
18:59 🔗 Spirit_ project starts to take shape though, i am downloading, discarding bad files and in the end putting all of today's files in a sqlite3 db
19:32 🔗 Spirit_ ~30 minutes to download 10k, 5 minutes to insert them into db. nice
19:33 🔗 Spirit_ 60 megabytes as files, 8 as database :)

irclogger-viewer