[00:06] uploaded: https://archive.org/details/catholiceducation.org-20130712 [00:17] wtf the Digital Ocean site is under maintenance [00:23] wat. [00:24] still. [00:24] Just recovered 50gb more of data from corrupt shit. Today is a great day -> http://www.zeldauniverse.net/gallery/images/6838/large/1_RVL_ZeldaSS_08ss11_E3.png [00:39] i'm starting to think arcadeathome.com has a 1000 file limit [00:39] when using wget [03:05] GLaDOS: poke, the pastebin is down [03:22] uploaded: https://archive.org/details/g4tv.com-video-thumbnail-images-20130706 [04:07] Today I walked 10 miles. [04:07] Yesterday, 15 [04:07] So there you go [04:09] Hot. [04:17] Did you wear a funny hat? [04:50] argh [04:50] http://makezine.com/forums/?DiscussionID=1565 [04:50] old forums content? apparently gone [04:51] (aside from a couple caches) [04:52] god damnit [04:52] or at least not accessible on their site (right now) [04:58] Are you going to email the site admins? [06:11] winr4r: back up [08:38] time to give to joeyh: https://campaign.joeyh.name/ [08:41] dang, the android port does not run on my 2.3 device :( [08:55] Yeah I got an old android phone too [08:58] uploaded: https://archive.org/details/cdrom-linuxformatmagazine-156 [08:59] I cannot even run firefox [09:00] me either [09:00] g4tv.com-video8196: MSN Living on the net pt. 1: https://archive.org/details/g4tv.com-video8196 [09:06] g4tv.com-video7629: Gary Coleman: https://archive.org/details/g4tv.com-video7629 [09:07] I have had my phone for two whole years, unheard of I know. ;) [09:09] I've had mine for about the same time I think, maybe bought it in 2010 or 2011 [09:09] g4tv.com-video8389: See the full, uncut interview with DJ Danger Mouse: https://archive.org/details/g4tv.com-video8389 [09:11] g4tv.com-video5980: Confessions of a Software Pirate: https://archive.org/details/g4tv.com-video5980 [09:15] There are too many applications that need android 4, this might be the first time I upgrade a phone for features instead of it just dying [09:15] I had a g1 before this [09:16] i bought me a http://www.geekbuying.com/item/Cube-U30GT-10-1-inch-Quad-Core-RK3188-Android-4-1-Tablet-PC-Retina-Screen-1920-1200-2GB-RAM-16GB-BT-Gyroscope-314425.html and it is nice [09:17] I've had mine for about 2 years but it's starting to break in the weirdest of ways [09:17] So I'm considering upgrading [09:17] Schbirid, can you install other linux versions on there? [09:17] omf_: the video above is the one your want to see [09:17] omf_: havent tried yet [09:18] i wish android would not have moved away from hardware buttons and especially a cursor key [09:18] text editing is so much harder on this compared to my tiny htc desire with cursor [09:19] its the video interview of chris tresco from 'Drink Or Die' [09:19] I am downloading it now godane [09:37] I am watching it and the questions are so generic [09:39] 1.5tb of software at his peak [09:41] ugh the drink or die guy said it was stealing, did he have to say that because of court? :P [09:41] I also got that dangermouse video [09:53] hmmm [12:24] GLaDOS: so the pastebin appears to be back up, but all the pastes seem to have disappeared [12:29] ..what [12:30] winr4r: retry [12:32] winr4r: this is what I get for not defining config [12:32] GLaDOS: works, ta :) [14:09] uploaded: https://archive.org/details/cdrom-linuxformatmagazine-167 [14:45] g4tv.com-video8750: See how to create a digital camera for about $11: https://archive.org/details/g4tv.com-video8750 [14:50] g4tv.com-video8736: Buy PC parts: https://archive.org/details/g4tv.com-video8736 [14:51] the one above has Wil Wheaton [14:52] and it was the 'broken' one in that it didn't stop with buy pc parts segment [16:13] hmmm. i think a web site i frequent is about to "relaunch" as a database-driven cms thing instead of static text. does anyone have an easy way i can pull the site and make a warc or whatever? [16:14] in case some material on it doesn't make the transition [16:15] pft: https://www.refheap.com/af43be71eaeed8d42f9777af6 [16:15] i'm assuming i need a wget with warc [16:15] run quick-warc www.domain.com [16:15] ok [16:16] heh [16:16] ivan`, I'd like to confess my newfound love for you for that snippet [16:16] :) [16:17] is wget with --warc-file in the archiveteam github? [16:17] dunno, but it's in the wiki somewhere [16:17] ok [16:18] someone really ought to make a wget-lua script that hits a Python server for every URL, asking if it should skip [16:18] then I could edit the Python script during the scraping adding more things to skip [16:18] here is my bash function. function warc() { wget -m -a "$1_$(date +%Y%m%d).log" -e robots=off -nv --adjust-extension --convert-links --page-requisites -nH --directory-prefix=$1_$(date +%Y%m%d) --warc-file="$1_$(date +%Y%m%d)" --warc-cdx "http://$1/" ;} [16:19] ah, --directory-prefix, good idea [16:19] pft: if you use an up-todate wget, it has all the warc stuff by default [16:19] I probably misunderstood the question [16:19] it's in wget 1.14 and above [16:19] yeah, i'm on debian wheezy adn it doesn't seem to have that but i can go grab source and compile [16:40] http://archiveteam.org/index.php?title=Wget#Creating_WARC_with_wget [16:40] There's a wget-lua (that's based on wget-1.14) on http://launchpad.net/~archiveteam/ by the way [16:40] Should be installable on Debian Wheezy as well, I guess [16:41] Maybe worth putting in that Wiki article, Smiley :) [16:50] ersi: indeed [16:50] I was just about to tell ivan` if he wanted to do it himself [16:50] :-) [17:28] :O IA has a credit union? [17:28] http://www.wired.com/wiredenterprise/2013/07/iafcu/ [17:28] o_O [17:28] :O [17:30] weird, but ok. [17:30] * Smiley wishes for IA multi-continental redudnancy [17:30] you can start by getting some space over here [17:30] :D [17:30] I wonder if iafcu is in the big credit union network in the us [17:30] cause if so, I could get an account with them and free service at any of the 50k atms and such [17:31] the co-op network, that's the name [17:53] http://help.blekko.com/index.php/does-blekko-have-an-api/ [17:53] HUZZAH I SHALL WRITE TO THEM FOR API KEY [17:54] (this is for finding sites on dying hosts, it is for a good cause) [17:57] sweet [18:22] Aranje: Yup, we do [18:22] Not part of the big network yet, though [18:22] We are now bitfloor's primary bank, too [18:22] brought in something crazy like $15mil usd of assets [18:25] woot :D [19:39] uploaded: https://archive.org/details/cdrom-linuxformatmagazine-168 [20:14] uploaded: https://archive.org/details/supertorrents-predb-sql-201307 [20:59] so i got my new maximum pc cds [21:02] The most accurate link bait title. "Apple Sued For Porn Addiction" http://www.ibtimes.com/apple-sued-porn-addiction-man-says-macbook-cost-his-marriage-kids-1345831 [21:03] * joepie91 sees ibtimes, instantly lowers credibility rating by 50% [21:04] It is mind blowing the lengths people go to, to blame someone else for lack of either self awareness or self control. [21:04] joepie91, I found the court docs - http://www.scribd.com/doc/153168246/Chris-Sevier-Apple-Complaint [21:04] well the filing [21:04] ... [21:05] I don't... [21:05] I just... [21:05] * joepie91 shoots self [21:05] from point 13. [21:05] Apple is well aware that the internet is fire with pornography. [21:06] ... can cause unwarranted arousal addition better than the purchasers. [21:06] addiction [21:06] fucking shit site that does not allow copy and paste [21:06] the complaint is also fire with typios [21:06] ;) [21:06] typos* [21:08] I need to login to download something you don't own... really. o( ><)o [21:09] he blames late night sex enhancement drugs too [21:10] (◎_◎;) [21:13] fuck yeah! America? ヘ(´o`)ヘ [21:36] http://arstechnica.com/information-technology/2013/07/what-to-do-with-a-popular-project-that-you-no-longer-want-to-maintain/ [21:36] http://arstechnica.com/gadgets/2013/07/capptivate-a-site-capturing-apps-before-they-disappear-forever/ [21:37] actually [21:37] #archiveteam material [21:57] so i have a problem with gbtv/theblaze xml data [21:57] it looks like there some dos like code in it that brakes the lines sometimes [22:00] looks like the dos error is in warc.gz [22:00] but not in the files themself [22:07] https://github.com/mame/quine-relay [22:17] this might be of interest to the server owners in the room: big issues in IPMI and BMC- https://community.rapid7.com/docs/DOC-2344 and what you might try to do to work around these issues: http://fish2.com/ipmi/bp.pdf [22:55] so my new gbtv/theblaze dump is working good [23:07] also i got a pc accelerator cd in my maximum pc cds shipped today [23:28] so that site refresh that iw as talking about happened, what do i do with the warc i generated this morning? [23:28] is there somewhere i upload it so that the wayback machine can parse it? or will it just be uploaded into a collection like other archiveteam stuff [23:38] upload it as an item to the internet archive, then one of the admins here can move it to the archiveteam collection [23:38] at that point the wayback machine will integrate it on their next update pass [23:46] awesome [23:49] so just the warc.gz or is this .cdx file useful? [23:49] sorry about all the newbie questions [23:57] pft: upload the cdx as well [23:57] ok [23:58] once i get home i shall upload