[00:47] I haven't gotten to it. [00:47] Been busy [00:49] evening gents [00:49] Evening. [00:50] morning [00:50] ahoy [00:50] 464479 github repos for those playing the home game [00:52] just hit the 5TB mark [00:53] SketchCow, I have been building up a spam list to help the admins save time - http://archiveteam.org/index.php?title=Category:Deleteme [00:53] Excellent. Thank you. [00:53] github repos? [00:59] namespace: yes, im downloading them all [01:01] SketchCow, Do you like the idea of creating a category for historical pages so we can lock them from editing so they do not get defaced? [01:02] Yes, if it's done orderly. [01:21] We currently have 32 categories - http://www.archiveteam.org/index.php?title=Special:Categories [01:37] I found 3 pages that are all the same and only contain 1 line of text. Should I just redirect 2 of them to the third? [01:37] http://archiveteam.org/index.php?title=MediaWiki_Guidance_for_unexperienced_users [01:39] omf_: sure, sounds good; what are these pages? [01:40] http://archiveteam.org/index.php?title=MediaWiki_formatting_explained [01:40] and http://archiveteam.org/index.php?title=Media_Wiki_syntax_explained [01:41] I was thinking more of a 'How to use our Wiki' page [01:41] oh, heh, totally go for it [01:41] those are the same page [01:41] not according to the history all 3 were created by different users [01:41] http://en.wikipedia.org/wiki/Wikipedia:Be_bold this rule applies to archiveteam wiki too [01:41] same page as in they ought to be [01:41] got you [01:41] yea [01:43] I want to put a few lines down about using templates to setup project pages since I didn't do that for a few pages I made which I working on fixing now [01:45] Can we made this page http://www.archiveteam.org/index.php?title=Current_Projects embedded on the front page so the currently active projects list stays synced up? [01:45] that's a great idea [01:45] you mean like a transclude? [01:46] http://www.mediawiki.org/wiki/Transclusion [01:46] yes [01:46] Didn't know the term [01:46] does that help? [01:47] I cannot edit the front page [01:47] ok [01:48] neither can I, apparently [01:48] only admins as far as I know [01:48] rite [01:51] SketchCow: why does the cert on https://www.archiveteam.org/ belong to 'smoketest.in' ? [01:57] Okay I started the new page and I am putting in the redirects http://www.archiveteam.org/index.php?title=How_to_use_our_wiki [02:06] chronomex: I do not know [02:06] chronomex: because they share the same IP, and nobody's set up SNI [02:06] figures [02:06] ok [02:06] (or so I assume) [02:08] right [02:15] http://i.imgur.com/Clg4x.png [02:15] Usenet has consolidated to almost nothing [02:16] that's really sad [02:18] Fuck the New York AG who started that limiting usenet crap [02:19] We protecting the children [02:21] You would think they'd have much more important shit to worry about [02:23] Anything to make your name more popular [02:24] those are servers with full feeds only? or are there no universities with servers these days? [02:25] looks like full feed only [02:29] Many universities in the US started cutting back after all the major ISPs cut their feeds [02:29] less peer points [02:30] sure, I know mine got out around 2008 [02:30] 2008 was when the law stepped in [02:31] huh [02:44] anyone grab share.opml.org before it died? http://scripting.com/stories/2008/01/23/shareopmlorgRetired.html [03:26] alard, or GLaDOS around? [03:45] Usenet is dead. [03:45] Then again, it could use a modernization. [03:45] Not NNTP. [03:46] NNTP is just Usenet over IP as far as I know. [03:49] The idea is good, it just needs a decent reboot. [03:57] COMING THIS SUMMER, THE CLASSIC INFORMATION DISTRIBUTION SERVICE USENET RECIEVES AN U-U-U-U-UPGRADE! [03:59] maybe with some ads too [04:03] we have seen it. Things ranging from forum software, to stackoverflow to 4chan [04:03] forums focus on smaller niches like an individual newsgroup did [04:03] stackoverflow and the like has moderation [04:03] and 4chan provides anonymous everything [04:04] usenet was federated and none of my examples are. That is the major difference I see [05:30] huh. Somehow my warrior is suddenly processing 18 items at once. [05:37] refresh the page, that happens if the connection to the warrior breaks [06:04] hi!! [07:40] 23:49 < namespace> NNTP is just Usenet over IP as far as I know. [07:40] Wrong [07:53] lol [09:13] Does anyone have experience using image duplication finding software. I was thinking it might be useful for us to have a tool like that on hand [10:59] SketchCow: where's that Usenet diagram from? (it doesn't include individual.net or aioe.org or eternal-september.org -- is it only servers with binary groups or something?) [11:03] Does anybody know if there's plans to grab the Linux Game Tome? Its shutdown was announced a couple of days ago, but I haven't seen anything on the AT site. [11:04] The site is timing out for me at the moment, but the shutdown message on the Google cache says we've got until 13th April: https://webcache.googleusercontent.com/search?q=cache:www.happypenguin.org%2Fnewsitem%3Fid%3D11236 [11:04] They've posted a dump on the site, I believe. [11:05] So we're going to grab it, yeah. [11:05] The message says they plan to post it, but there's no date. I'm not aware of any dumps available yet, but I can't reach the site proper right now. [11:05] It'd be really nice if they do provide a dump. Makes things easy. [11:08] I am already grabbing happypenguin [11:09] So the dump is definitely online? Awesome. [11:09] nope [11:09] I got a cron job that will start a grab as soon as the site comes back up [11:09] Nice job. Don't wait for them. :) [11:09] I never wait for any dump [11:10] unless the dump is released with a closing statement I assume it will not show up [11:11] What wget options are you going to use to grab it? [11:26] whatever works [11:28] I looked at copies of the site in the way back machine. No crazy ass cross domain anything [11:28] All assets and pages come from one server [11:29] Too easy. Cross domain resources was going to be my next question. [11:30] this design is from the 1990s [11:30] There was no fucking around like there is now. [11:30] you don't remember server-side imagemaps? [11:31] I remember when CDNs were kept hidden from the user to provide a more seamless experience. Now fucking every site exposes every cdn it uses [11:31] wget can still grab those [11:31] yeah with 100,000 requests [11:31] one for every pixel [11:32] I would take that over the fact that most of the ispygame sites cannot be gotten with wget [11:32] I have pages of wgets I tried [11:32] ok [11:32] goodnight btw [11:32] later [11:37] GLaDOS, you around? [11:37] I am [11:38] What do you think of taking the current projects part on the main wiki page and just making it a transclusion to the current projects page. That way when the current projects page gets updated it would update just that section on the main page as well [11:39] You have been doing those updates recently [11:39] That would require the Current Projects page to be only editable by trusted people. [11:39] So [11:40] I suppose it's possible. [11:40] It is just a suggestion to remove having the same data in two places that can get out of sync [11:40] I'll look into it. [11:41] brb [12:26] SketchCow: It's not enough at any rate. [12:41] Hmmm, shrug if Feedburner goes down. A lot of links to a lot of posts will be crap [15:26] What's up. [15:27] yahoo messages still too slow for my liking [15:28] I think alard split the big jobs though so all that's left is redundant? [15:29] maybe [15:37] Is THAT what I saw happen? [15:40] ok re-reading scrollback there may be some large single threads left still [15:42] yeah those are the ones I'm worried about [15:47] Please, could someone WARC-WGET http://www.naoldcatholic.com ? Multiple leaders just resigned, including their main clergy, and financial problems are coming to light [15:49] I got a grab going now [15:50] that site is really slow though [15:50] slow for me too [15:51] I am sure it is. [15:51] It's a national church that just imploded! [15:57] ah..bad luck [16:00] geez splitters much https://en.wikipedia.org/wiki/List_of_Old_Catholic_Churches#Other_churches_with_Old_Catholic_origins [16:26] Standard Website Hosting - archiveteam.org (04/10/2013 - 04/09/2014)$239.40 USD [17:01] SketchCow, the grab for that catholic site just finished. 261 pages [17:01] Thanks. [17:02] it took 45 minutes their servers are so slow [20:00] Hi! So a much beloved homepage www.jhaible.de (idiosyncratic synth maker) is now down on account of the guy dying, with partial backups at http://web.archive.org/web/*/http://www.jhaible.de/ ... I believe I have a full backup, alas, not in warc, just individual files. Is it of use to anyone? [20:03] sure, make a .zip and upload to an archive.org item [20:04] Thanks [20:04] Is .tar OK or is .zip really preferred? [20:04] either is ok, .tar.gz etc are not supported by the archive browser on the site currently [20:05] Thanks! [20:05] put in the community texts collection for now and an admin can move it to archive team's area later [20:10] Thanks! [20:11] Uploading... [20:35] 1/3 of the way there... it's a slow connection for uploads... [21:16] Here you go: https://archive.org/details/Www.jhaible.de Thanks! [21:23] no, thank you [21:24] DFJustin: Not sure if you want to thank him or if you are declining his offer [21:25] the former [21:25] English is tricky [21:26] that can be correct [22:01] him? [22:01] Yeah, English isn't the most logical language, I'm afraid. [22:13] Right, I'm off. Tally ho!