[10:23] Nemo_bis: got some updated dumps for wiki.bukkit.org due to http://forums.bukkit.org/threads/bukkit-its-time-to-say.305106/ [10:24] https://dl.dropboxusercontent.com/u/50690736/wikiteam/wikibukkitorg-20140821-history.xml.7z -- https://dl.dropboxusercontent.com/u/50690736/wikiteam/wikibukkitorg-20140821-wikidump.7z [10:25] k [10:26] also got a big update for the teso.gamepaedia.com dump, but thats still uploading to my dropbox [10:27] done [10:27] ok [10:27] big as in more than one gigabyte bigger :p [10:27] bigger than the previous dump* [10:28] lol i dumped it already in ia [10:28] https://archive.org/details/wiki.bukkit.org [10:28] just needs a move to the proper collection [10:28] lol [10:29] WARC and XML complement each other well, it's good to have both [10:30] we both did xml :p [10:30] warc is still in queue [10:33] Ah, I was tricked by the identifier [10:40] I also found a broken wikiteam item on IA at https://archive.org/details/wiki-plelderscrollswikiacom [10:40] Im also uploading a updated dump of that one to my dropbox [10:40] :p [10:41] Can't update it though, until it's in wikiteam collection [10:41] ah [10:42] is kissistvan@kibk.extra.hu someone in here? [10:52] nobody I knpw [11:05] it aint me [12:39] Nemo_bis: here are the links to those dumps I mentioned earlier http://pastebin.com/raw.php?i=fggweyVq [12:43] Muad-Dib: what did they do on gamepedia, ten times as big in just 6 months :o [12:46] I *think* they added a lot of imagery [12:47] :p [12:48] it seems they at least added a lot of icons/buttons from the Elder Scrolls Online in .png format [12:49] but the history also increased tenfold [12:49] and a whole load of screenshots :D [12:49] automated edits, maybe? [12:49] reading values from the game client or something like they did on UESP? [12:50] SketchCow: can you please move https://archive.org/search.php?query=subject%3A%22wikiteam%22%20collection%3Aopensource into wikiteam collection? I checked the titles and they all belong there [12:51] Muad-Dib: ok :) [14:08] Nemo_bis: Here's what I've done - I've shoved a bunch of stuff into Wikiteam. At some point, give me items that should be removed. In a mail. [14:09] I actually can't say "that search.php result... do "this" with it." I need item names or a way to search among specific lines. Annoying, I know. [14:24] ia search subject:"wikiteam" collection:opensource should help [14:25] http://pastebin.com/dFmZDGRv [14:25] hm we need more results [14:29] SketchCow: oh sorry, I thought you could reuse the search query in your tool [14:31] And sure, I'll email any IDs which should be removed [14:33] I'm sorting godane stuff right now. [14:33] You can't imagine. YOU CAN'T IMAGINE. [14:33] Yesterday it was 27,000 items in the "inbox" [14:33] I've got it down to 6,000 [14:34] But now we're getting into the ones with 'only' 200-300 items in each grouping. [14:34] holy shit [14:35] 27,000 [14:35] i did my 1.000st item yesterday [14:36] I can imagine a bit because I looked at the collection a few days ago :p and also at his items in deriving queue [14:40] ah thats probably what locked up my derives :p [14:42] The chances are much, much more likely that it was Jake. [14:42] Jake's our internal guy who works with institutions, who will do 500,000-item work like it's nothing. [14:42] (It won't be under his name, it's in partnering) [14:44] lol [14:44] is that one of the bookguys? because there was a MASSIVE queue yesterday [14:46] He's one of MY guys [14:46] He's in my department, "collections" [14:46] nice [14:46] ah my history reappeard today [14:46] Although I guarantee that most people, when they think of my relationship to the archive, don't think of me as being in a department or having someone I report to. [14:46] But I do. [14:47] I usally assume you are a department [14:48] I am. [14:49] yay, my assumption was right for once! [14:51] In recent years, I've described my situation as being like Noam Chomsky to MIT. [14:56] the mona lisa to davinci [14:56] the face of archive.org ;) [14:57] A face [15:45] SketchCow: You're probably one of only a few people that gets to say that you work in a "collections department" and people don't hate you because of it :) [15:50] probably helps that SketchCow's collections department doesn't do deaccessioning