[00:42] this might be of interest to some folks here: http://sphirewall.net/ they're building an opensource linux firewall/router that provides advanced user management and bandwidth/analytics (not based on iptables) [02:25] looks like archive.org is hating me again [02:26] also 50 items are paused right now [02:28] its giving me the slowdown bs error again [02:29] http://www.dannychoo.com/en/post/26517/Japanese+Retro+Games.html [03:27] what the hell, i slept normally [03:28] wp494: you've been busy on the wiki today, thanks for fixing the "hostings" thing that was bothering me too [03:38] wp494: also, i generally don't use {{unknown}} for project source/tracker in the project infobox, because that puts them in the "Unknown status" category, but if you think that's the right thing to do then it's okay [04:49] i didn't win the 13 boot magazine cds [05:40] did that massive miror backup of Apple II FTPs get backed up? [05:40] https://archive.org/details/asimov.apple.archive.emulators.2013.03 [05:42] or wait maybe it's https://archive.org/details/apple_collection_v1.02 [05:42] godane: aww :< [05:52] ah good to see [05:54] so who wants to help me think of stuff here [05:57] for what? [05:57] BlueMax: using bing to find pages on webtv.net [05:58] oh...you're on your own with that one [05:58] give me some interesting things to search for, like the highest-value stuff on homepages [05:58] i've already done stuff like geneaology or however you spell that, family history, family tree, memorial, in memoriam blah blah [06:01] schools, school clubs, school reunions, colleges [06:01] S[h]O[r]T: good thinking! [06:02] thanks, you just gave me like 20 new ideas [06:02] i clicked some snapjoy link earlier that was pictures of an after school club :p [06:03] S[h]O[r]T: your suggestions just added like 500 new pages to my list and counting [06:11] winr4r: fanfiction [06:13] DFJustin: ran that one already :) [06:13] fan page! [06:13] good idea [06:13] * winr4r runs that [06:14] S[h]O[r]T: make that 1k with your ideas, btw :) [06:18] winr4r, web ring, journal, diary, poetry, essay, pets, baby pics, zines, gallery, star trek. are a few ideas I just had for searches [06:19] history [06:19] y2k [06:19] omf_: super! [06:19] yes, thank you [06:20] ivan`, was working on a markov chain search tool using keywords from dmoz on the google reader feeds, it might be overkill for this project but I think its worth looking into [06:21] yes, it is [06:26] haha, glad i ran a search for y2k because you get alllll sorts of cray http://community-2.webtv.net/tlandrews/FEARFULSIGNSOFEND/index.html [06:27] might as well do "end of the world" too [06:27] or end of days [06:27] yes, i just ran end times :) [06:27] i honestly have to commend M$ for the fact that webtv has even lived this long [06:27] S[h]O[r]T: yeah, it's quite remarkable [06:28] and even here, they're offering a bunch of migration options for everything but the pages (but that's quite a loss in itself) [06:28] MS didn't have another in to that market, they do now with the xbox [06:28] the whole smart TV, use your tv for shit market that has never seemed to pan out [06:29] look at how Apple and Google are still trying [06:29] yes, true [06:29] I think cell phones and tablets are turning out to be a better fit that smart tvs [06:29] shit I am looking into getting a project so I can get rid of my 40" LCD [06:30] smaller, easier to secure, can take it to parties [06:30] a few friends of mine have made the switch and they love the freedom [06:30] * winr4r nods [06:31] (you've added about 1.5k with your suggestions by the way, so thanks) [06:31] Maybe we should keep this search list on the wiki for other projects that need old web page digging [06:32] not a bad idea that [06:32] i'll sed it out from my bash history when i'm done [06:32] cool [07:00] only 19 more before we hit 10k unique pages! [07:02] sweet [07:19] 10,447 :) [07:20] and 1697 of my 5000 bing API requests left [07:27] I found this to be a spot on take of the bullshit that is now the hollywood tent pole movie - http://www.vulture.com/2013/07/lone-ranger-is-everything-wrong-with-hollywood.html?test=true [07:27] I started noticing this in force when Terminator 4 came out [07:28] here is a choice quote - demonstrates the industry’s franchise obsession, origin-story laziness, over-reliance on bloodless violence, and inability to prevent running-time bloat. [07:32] i'm not sure this is something new, or at all recent [07:33] I agree, I think it started more in the 90s [07:33] but every fucking loser studio is doing it now [07:36] Fast and Furious 7 anyone? [07:37] Some movies series make sense like Lord of The Rings or most comic book movies [07:38] haha, f&f 7 [07:38] although [07:38] I also understand that by using existing characters that it lowers the barrier for the audience to understand the back story but lets be fucking real here people, Hangover 3, Ring 2, star wars 37 [07:38] i kind of want to see it just because they have a mark 1 ford escort in it and those are cooool [07:39] I liked fast 5 more than the other movies in the series but what is that really saying? The back story of the whole thing is good guy cop and angry sometimes bad guy [07:40] fast 5 became no longer about the cars [07:41] true, I hadn't thought about it that way [07:41] i mean I enjoy those films, few of that I do enjoy anymore [07:41] but the lack of cool cars made it just another action movie :/ [07:41] take out vin diesel and put in daniel craig [07:41] oh look it's a bond movie [07:42] tank and all. [07:42] obviously change other chara's too [07:45] (●°﹏°●) [07:47] my point is the "bad guy" wasn't .... interesting [07:47] interesting bad guys are few and far between [07:47] I didn't like Skyfall but I did like the villan [07:48] hmmmmm yah [07:50] SmileyG: also hi [07:50] mistym: and also hi! [07:51] woohoo, apparently just searching for "john" and "david" added like 400 new pages to my webtv.net list [07:51] hey winr4r [07:52] * winr4r thinks of more common names [07:52] winr4r: english names? [07:52] SmileyG: yup [07:52] coke (as in coke cola) just did 2 lists of most popular names (As they printed them on their bottles) [07:52] want? [07:52] James, Patrick, Matthew, Jason, Robert [07:53] http://cokestudio.coca-cola.com/tccc-sacn-webapp/findyourname?lang=en [07:53] 200 names, enjoy :) [07:53] Oh, 250 now :D [07:56] aw, 646 api requests left [07:56] I have been giving more thought to the whole blink/webkit thing. I understand why the split was made [07:56] Google is doing a lot of things to the browser that apple just does not give a shit about [07:57] Like being a developer, Apple would rather you buy some shit from them then give it to you for free in the browser. I mean they are killing shadow DOM in their repo [07:58] yup [07:58] Plus let us all not forget how shitty Apple has been about giving code back upstream since the beginning [07:58] :) [07:58] i'm rather looking forward to cheap firefox OS devices [07:58] me too [07:58] Is there good documented proof of that omf_ ? As I argue with apple lovers and they just go "but apple does open source"..... [07:59] then we can forget all about google and apple forever [07:59] hopefully the firefox ones push android too :) [07:59] 3 man race is far more interesting. [07:59] SmileyG, yes when you look back on the KHTML mailing lists about how apple just shit out code at them and the KHTML developers had to spend significant time getting it to fit in [08:00] If you break down what is running on a MacOS X system the bulk is open source [08:00] open source with a BSD license so they do not have to give anything back [08:01] The biggest push FreeBSD has had is Apple [08:01] and they still hardly give back shit [08:01] omf_: hmmmm [08:01] why run the older versions of bash? Because they are not gpl [08:01] no point me pointing people to that, they'll just go "derrp?" [08:02] Bottom line is Apple makes money off of hardware and locked in media [08:02] yes, yes they do [08:02] everything else is secondary to that [08:02] oh and don't forget all these discussions were in a -chat channel of a open source distro too [08:02] hasn't bash always been GPL? [08:02] correct [08:02] it changed after 2.05 [08:03] bash is now gpl 3 partly because people like Apple used it and never gave back [08:04] I am running Bash 4.2.24 and yet the newest OS X is still 2.05 [08:05] wait, didn't 2.05 ship in like 2003? [08:05] i recall using it on freebsd 5.1 anyway [08:05] it is old [08:09] HOORAY used up all 5000 API requests [08:09] Seems very likely it's because of the license. [08:10] If you look at older version of OS X there are more gpl applications than there are now [08:10] it is all part of the closed ecosystem approach [08:10] Huh, pretty funny - I started scraping/discovering LiveJournal usernames (for the Google Reader-grab project) and started one scraper with two seed usernames. I know have around 700k total names from those two - where about 80k are "communities" and the rest are users. [08:11] now in most cases I choose and promote gnu licenses to stop this but BSD does have its place and to me that is in web applications where shit gets crazy real quick using multiple libraries. The other reason BSD works on the web is nothing is compiled and you always get the source code on demand for the client side [08:11] omf_: Well, I guess it worked pretty good for bootstrapping. A lot of things they didn't need to make, at that time. [08:12] GLaDOS: are you around? [08:12] Cool, doubleop <3 [08:12] GLaDOS: trying to put my list of URLs into the hastebin, but it exceeds the maximum length [08:13] (it's about 800k) [08:13] winr4r: try p.defau.lt [08:13] what about making it a pad instead? [08:14] or that [08:14] underscor: just tried that, pasting into there crashes my browser, lol ;\ [08:14] fuck it, dropboxing it for now [08:20] 800k of what? [08:20] SmileyG: list of URLs found on webTV.net [08:20] 12627 of them! [08:20] https://dl.dropboxusercontent.com/u/57276499/at/wtv-final-sorted.txt [08:20] Ah [08:21] needs archiving? [08:21] SmileyG: yup, MSN TV is ending, lots of homepages are going away [08:21] Ah crap, I didn't realise there was homepages etc on there [08:22] yes [08:22] Right, to #archiveteam to discuss [08:24] ersi, I agree that bootstrapping with open source is a great time and money saver. The question I have does moving away from that help Apple more than leveraging the shared knowledge of the existing communities [08:24] it stinks of NIH [08:27] unless theres already a channel for msntv? [08:28] not yet according to the wiki [08:28] lets brainstorm a name [08:29] deadtv [08:29] statictv [08:30] tv2.oooh [08:30] crashNburn (Reference to Hackers, where they "take over" a TV station) [08:30] lol [08:30] ZeroCool checking in [08:30] ;D [08:31] "Mess with the best, die like the rest" [08:31] I love that line [08:31] It doesn't even rhyme though, which makes me a little disappointed [08:31] I always imagine how good that film could have been with the same actors with a great script [08:32] Yeah [08:33] Pool on the roof must have a leak [08:33] yes [08:33] I think we should go with ersi's deadtv. [08:34] winr4r, what do you think? [08:34] or cobbletv [08:34] cobbled.. web, etc [08:34] crippletv [08:34] the google logo is currently an alien in a flying saucer [08:36] not sure! [08:37] deathbyMS maybe [08:37] we have like almost two months, though, so we're not in a rush [08:37] and what's on webtv.net is, i think, only what's left of their paying subscribers [08:37] so you know, not all that huge [08:49] omf_: I like that one [08:49] #deathByMS [08:50] winr4r: yeah but better to get started soon :) [09:10] I am looking for new movies to watch after work. What having people been watching this year? [09:10] I saw '2012' yesterday [09:10] which version? [09:10] Latest [09:11] Lots of CGI and shizzle [09:11] John Cusack ersi ? [09:11] It was watchable, not awesome - but fairly watchable. Some story gaps and maybe they use the "OMG NOW ITS HAPPENING RUUUN; *runs*; *escaped just in time*"-effects [09:12] omf_: Yes, with John Cusack [09:12] 5.8 IMdb score. Not.. great :D [09:13] I think Roland Emmerich earlier works are better than his current offerings [09:14] ersi: as a geolgist my head exploded when i watched that movie [09:14] I mean two id4 sequels [09:14] Like I said, watchable, not great [09:14] Or well, watchable, as long as you're not an geologist I guess :D [09:14] hehe that's true [09:15] then again, I'm stupid enough to watch movies which got IT in 'em in some way [09:15] Jurassic Park lol, "I know this interface" [09:15] I don't get as frustrated these days, it's pretty interesting to see how others see a field [09:15] "gui in visual basic" [09:15] ewww CSI NY [09:16] fucking worst line ever [09:16] I know this, it's a UNIX system [09:16] omf_: jurassic park which I LOVE, got some bad paleontology and IT [09:16] I liked the book a lot better, different characters die [09:17] I cringe at hearing about Jurassic Park 4 [09:17] yes [09:17] but I went and saw Jurassic Park 3d and will see Jurassic park 4 as well if it comes out [09:18] I'll see it [09:19] I wouldn't mind a new back to the future that continued where the trilogy left off if the story was good and they got the actors back [09:20] yes would watch that! [09:21] A new Blade would be nice as well but Disney is retarded when it comes to making R films [09:21] As a movie buff there are very few films in my opinion that should have sequels [09:22] Secret of Nimh is always on my list [09:22] I think a live action Aladdin could be cool [09:23] The Phantom with an older Billy Zane would be cool as well [09:25] instead we'll see Saw 12 in 3D [09:26] :P [09:31] I will just say it: Movies are trying to become TV series [09:32] You get the audience on the tit and take their money time and time again [09:32] Indeed [09:32] Which is why I watch fewer and fewer mass produced shits from Hollywood each year [09:33] Imagine the boner hollywood would have if they had Game of Thrones as movies instead of HBO [09:33] Mmmhm. [09:33] There is an interesting interview with Martin about how there have been offers for years and he said no because it would be shit. Just like whats his name did for Watchmen [09:34] ersi, I am in your boat. I see less films in the theater because it is just shlock [09:37] here is a 2013 movie release list, it makes me cry http://www.imdb.com/search/title?year=2013,2013&title_type=feature&sort=moviemeter,asc [09:39] I swear Ashton Kutcher has to fuck everyone in hollywood to keep getting good roles that he proceeds to fuck up [09:39] I can not think of anything other than The 70'ies Show when I see his damn face in movies [09:39] He should've stayed there [09:41] The Hunger Games is just a shitty watered down version of Ray Liotta in "No Escape" [09:41] and yet boners all around for that soon to be series of films [09:43] I mean fuck people Hunger Games has another film this year. 2 films in 2 years [09:43] it is Twilight all over again [09:44] * omf_ gets out soapbox [09:45] I remember when films were designed to get money from people who had tons to spare aka the rich. Now it is get money from the retard masses who cannot even read a book cover to cover [09:46] and this isn't nostalgia either, there were plenty of crap movies before [09:47] maybe it is just the ever increasing of marketing bs [09:47] * omf_ puts soapbox away [11:00] i found all the original hd episodes of diggnation [11:00] for 2007 [11:03] and its starting to look like wayback machine has all torrents of digganaion past episode 55 [11:07] i take that back [11:07] we have more then that [11:29] also looks like m4v and mov are the same file [11:29] at least for episode 16 of diggnation [11:29] md5sum is even the same [11:49] e [12:00] g4tv.com-video61600-flvhd: First 15: Marvel Pinball - Civil War: https://archive.org/details/g4tv.com-video61600-flvhd [12:01] yes we have game footage of Marvel Pinball - Civil War [13:05] winr4r: keep splitting the list in half until it pastes! [13:10] Yeah, maxlen was 400k [13:10] try again [13:11] It's now 400k with a few added zeros. [13:48] GLaDOS: haha thanks [13:49] chromium gets sad just trying to save and load it [13:49] but it's done! [13:49] \o/ [13:49] thanks for fixing that for me [13:49] :) [13:49] And the server hasn't died. [13:49] If it did, I wouldn't be talking to you! \o/ [13:51] haha [13:52] diggnation original 2005 episodes are uploaded [13:52] godane: *salutes* [13:53] http://paste.archivingyoursh.it/ficequtape.avrasm [13:53] i figured its best ot have a full collection of diggnation with meta data [13:53] Anyone get the Jakob Neilson reports that are pay? There is a report I think I would buy but I would like a second opinion [13:53] since there was transcodes of the older ones to hd for some reason [13:54] godane: ..upscaling them? [13:54] only episodes number 113+ was ever in hd [13:56] it was like upscaling the phone video format to hd [13:57] back in 2005/2006 [13:57] ah gotcha [14:03] also i'm about to hit 7.2tb of upload [14:05] cool godane how are you measuring it? Keeping track before you upload? [14:05] here: http://www.us.archive.org/metamgr.php?&w_uploader=slaxemulator@gmail.com&mode=more [14:05] i have bookmarked [14:05] it [14:12] I just checked myself, I am at 1,431.34 gigabytes [14:12] not including running the warrior. Your count is higher than that godane since you run the warrior on projects too [14:13] crikey [14:16] damn how did you manage that, I'm at 848 GB [14:16] godane I can understand because literally all he does is upload things [14:16] how do you tell? [14:16] scroll up [14:17] where: &w_uploader=djsmiley2k@Gmail.com | imagecount: 70,469,034| size: 1,452,586,132 KB [14:18] 1385.29 gb SmileyG [14:18] wow :) [14:18] Of course I'm kind of cheating ;) [14:20] Xanga from anarchive is going up under my username ;) [14:23] here is something for Jason: http://www.ebay.com/itm/Official-Xbox-Magazine-1-103-All-Excellent-Shape-All-Demo-Discs-For-Collector-/111114506664?pt=Video_Games_Games&hash=item19def0d1a8 [14:23] shi [14:23] t [14:23] I've just had a thought [14:23] 1 thur 103 offical xbox magazine and 103 demo discs [14:24] * SmileyG ponders [14:24] I know a few guys at imagine publishing in the UK. [14:24] * SmileyG goes to ask questions of them [14:24] doubt they can offically help me, but who knows. [14:24] issue 55 and 92 are missing in that set [14:26] pc gamer lot of 79 disks: http://www.ebay.com/itm/PC-Gamer-Lot-of-Demo-CDs-from-2001-to-2008-79-discs-in-total-/130943569930?pt=Video_Games_Games&hash=item1e7cd8500a [14:28] are we missing any of those pc gamers at this point [14:32] i think anything 2007+ [14:35] just fixed a typo [14:35] one of the dates of the pc gamer sayed it was 205-05 [14:36] when it should be 2005-05 [14:36] this one is now fixed: https://archive.org/details/PC_Gamer_DVD_Issue_148_Side_B_May_2005_PCG148DB0505 [14:39] pc gamer magazine demo discs 2009-2011: http://www.ebay.com/itm/PC-GAMER-MAGAZINE-Demo-Discs-2009-2011-Lot-of-22-/251301223957?pt=Video_Games_Games&hash=item3a82b85e15 [14:40] 2006-2008 demo discs: http://www.ebay.com/itm/PC-GAMER-MAGAZINE-Demo-Discs-2006-2008-Lot-of-19-/251301222903?pt=Video_Games_Games&hash=item3a82b859f7 [14:42] Cannot write to 'pouet.net/prodlist.php?year=2003&reverse=1&order=name&page=1432' (Success). [14:42] FINISHED --2013-07-06 12:07:11-- [14:42] Total wall clock time: 60d 12h 57m 49s [14:42] Downloaded: 3918573 files, 250G in 3d 0h 20m 50s (1006 KB/s) [14:43] 11 pc gamer cds from 1997-1998: http://www.ebay.com/itm/Lot-of-11-PC-GAMER-CD-demo-discs-1997-1998-includes-3-15-FINAL-FANTASY-VII-June-/181153420783?pt=Video_Games_Games&hash=item2a2d95d5ef [14:56] i just pasted 7.2tb worth of uploads [15:05] so, guys, your thoughts would be valued here [15:06] i've written a tool to scrape a list of mediawiki wikis to get lists of pages/sites hosted on a given domain, for exploring websites that are about to go down [15:07] it occurred to me that rather than (or in addition to) using a built-in list, the list could be maintained on the archive team wiki [15:08] (i'm cleaning up the code to publish it over the next few days) [15:08] is there any reason that this is a bad idea? [15:08] so the tool will grab a page from the AT wiki, parse out the list, use that list as the list of wikis to scrape [15:36] anyone here got a good automatic pdf renamer? something that can read and parse the title of a pdf and rename it? [15:45] Tephra: want some bash-fu? [15:51] winr4r: if it works ;) [15:54] mkdir output; for i in *.pdf; do newfn=`pdfinfo "$i" | grep '^Title:' | sed 's/^Title:[\t ]*//'`; (test -n "$newfn" && cp -i "$i" "output/$newfn.pdf"); done [15:54] pdfinfo is from xpdf i believe [15:55] that'll only copy/rename files that have a title defined [15:55] yes, that's my problem [15:55] you want it to just copy it if it doesn't? [15:56] no, want to parse out the title from the text [15:56] so I have a lot of scientific papers with names like 123.312.234.pdf [15:56] 'cause weily and jstor are stupid [15:57] and they don't have title defined or have wierd title names [15:57] oh, crap [15:58] like Title="glacier.tex" [15:58] yeah, that's more heuristics than bash one-liners care to deal with :) [15:58] yes [15:58] :P [15:58] time to do some Python! [16:00] good luck with that [16:01] winr4r: 7 [16:01] oops [16:01] nico_32: my thoughts exactly! [16:01] 7 all day every day [16:21] hey, uh, what exactly is the connection, if any, between iafcu.org (Internet Credit Union) and the Internet Archive? [16:22] because iafcu has a sizable IA logo at the bottom [16:22] but a few things about the site make me iffy [16:24] https://iafcu.org/management-team/ https://iafcu.org/special-posts/where-did-we-come-from/ [16:25] https://twitter.com/textfiles/status/350352186569531393 [16:26] Internet Credit Union was started by Brewster Kahle [16:26] He functions on the board [16:26] There's one main employee right now, Jordan Modell. [16:26] What makes you feel iffy? [16:28] various things, primarily that there's an IA logo at the bottom that links to the IA main page (and the same logo is used in the credit union logo) [16:28] yet there is no obvious place where the relationship is explained [16:29] the NCUA link is also dead [16:29] and the ATM finder link doesn't work for me either [16:30] the information is apparently there, but it's not obvious to a casual visitor [16:31] a simple way to improve it, for example, would be to make the IA logo at the bottom link to a post explaining what IA is, and briefly mentioning the relationship between the two (perhaps linking to other relevant pages) [16:31] basically, right now, it gives the "as seen in the media" appearance that for example internet marketing sites have, where they casually throw a bunch of logos of well-known media outlets on their page as "endorsement", yet what exactly was broadcast isn't explained and the logos just link to the frontpages for those outlets [16:36] I'll bring it up with him. [16:41] thanks :) [16:42] also, elaboration: "ATM finder link doesn't work for me" as in I get Chrome telling me that "an empty response was received" [16:42] "doesn't work" is typically not a terribly useful snippet of information for debugging, heh [16:43] NCUA link is broken, will be fixed, Jordan says. [16:43] On my chrome, it works just fine. [16:43] http://www.cu24.com/ATMLocator/ is where it's trying to go, can you get that? [16:45] http://owely.com/9Cqg9p [16:45] same happens on my desktop [16:45] it also happens for cu24.com without ATMLocator [18:31] SketchCow: i noticed that you put up 34 issues of sega saturn magazine from the tosec [18:32] it was on twitter that i noticed it [18:32] anyways you should tell them that all 37 issues are here too: http://archive.org/details/sega-saturn-magazine [18:35] looks like only issues 32, 33, 35 are missing from collection [18:39] Yes, we have multiple sets [18:39] I'll find them allll [18:43] also there is better scans by the outofprintarchive.com too [18:44] there are 3 versions of each magazine [18:44] one of ipod & iphone [18:44] and one of tablet [18:44] then a max-rez version [18:44] http://www.outofprintarchive.com/catalogue/officialsegasaturnmagazine/OSSM12.html [21:20] http://web.archive.org/web/20130309104447/http://developers.posterous.com/amazon-ip-ranges-blocked [21:27] lol [21:31] note that's from 2012! [21:33] hm, so it is [22:34] Anyone using virtualbox with kernel 3.9 successfully? the DKMS module for virtualbox won't compile for me on 3.9 [22:35] which version of virtualbox you got dashcloud ? [22:36] 4.1.18 [22:37] 4.2.10 has the Linux 3.9 build fix [22:37] and the current version is 4.2.16 which is stable [22:37] https://www.virtualbox.org/wiki/Changelog [22:51] okay [22:51] thanks! [22:56] glad I could help [23:43] Anyone ever read the book "How People Read on the Web"? [23:48] or "Unix and Linux System Administration Handbook"? [23:50] pretty sure I've read the latter