[00:02] *** www2 has quit IRC (Read error: Operation timed out) [00:03] *** GLaDOS has quit IRC (Ping timeout: 272 seconds) [00:04] *** GLaDOS has joined #archiveteam [00:04] *** swebb sets mode: +o GLaDOS [00:05] *** thechip has quit IRC (Quit: Leaving...) [00:15] *** cbb has joined #archiveteam [00:17] *** signius has quit IRC (Read error: Operation timed out) [00:17] *** www2 has joined #archiveteam [00:18] *** X41 has joined #archiveteam [00:18] hi guys [00:18] quick question: you don't happen to have a up to date backup of TPB, do yoU? [00:20] i think someone said the latest is from 2013 [00:21] dang i already have that one, but tons of new torrents exist :/ [00:21] i think i'm gonna make a new backup as soon as TPB is back up and try keeping it up to date [00:21] though i'm pretty sure someone scraped it already. it's not a big script, and the file amount is minimal. it should surface any time. [00:22] *** Ymgve has quit IRC () [00:22] i thought that's how those "proxys" work [00:22] turns out most of them really just are proxys [00:22] but there are one or two pages that are actual mirrors [00:23] some of them still work, if you don't use the search function [00:25] *** yipdw changes topic to: Archive Team: We're not archive.org | http://archiveteam.org/ | lengthy/off-topic discussion: #archiveteam-bs | Shipwrecked pirates: 2013 is latest backup [00:26] *** cbb2 has joined #archiveteam [00:27] wtf, archiveteam.org is blacklisted by ur1.ca [00:27] *** yipdw changes topic to: Archive Team: We're not archive.org | http://archiveteam.org/ | lengthy/off-topic discussion: #archiveteam-bs | Shipwrecked pirates: http://is.gd/3mNFUy [00:27] *** cbb has quit IRC (Ping timeout: 265 seconds) [00:27] *** cbb2 is now known as cbb [00:28] seems like the back ups http://archiveteam.org/index.php?title=Pirate_bay#Backups are out of date. here's the next torrents from id 8m on: http://webcache.googleusercontent.com/search?q=cache:u2of9QsRuF8J:thepiratebay.se/torrent/9267841/_Backup_of_The_Pirate_Bay_%28IDs__8000000_-_8999999%29_&strip=1 [00:29] "Seeders: 0 Leechers: 1" though :/ [00:29] i wouldn't bet on it [00:29] *** www2 has quit IRC (Read error: Operation timed out) [00:29] that's the google cache [00:29] i'm gonna fire the torrent up to see if there are any seeders [00:30] the cache is from a few days ago. maybe someone continued seeding now, but i wouldn't bet on it [00:31] nope, torrent is kill :I [00:31] *** signius has joined #archiveteam [00:32] thepiratebay.ee seems to be working just fine [00:33] never fucking mind that [00:33] why? [00:33] you have to register and pay a 4€ fee per year [00:33] i mean it's not like that's much, but eh... [00:35] theres a whole series of those torrents [00:35] http://webcache.googleusercontent.com/search?q=cache:-dZzWibkRM8J:thepiratebay.se/torrent/8044295+&cd=1&hl=en&ct=clnk&gl=us&client=firefox-a is the 770000-79999999 ne i think [00:36] nevermind what i've said, you can simply guess passwords for tpb.ee [00:37] when did https://github.com/tpb-archive get taken down btw? [00:37] *** tittys has joined #archiveteam [00:38] guess? and what is it? [00:39] huh [00:39] https://github.com/tpb-archive/8xxxxxx [00:39] weird [00:40] 1xxxxxx, 2xxxxxx and 4xxxxxx are gone/never existed [00:40] everything up to 8xxxxxx seens to be working just fine though [00:40] very odd, but the rootpage shows a 404 [00:41] maybe github never fully pulled it down [00:41] whatever the case i'll see if i can clone it... [00:41] probably just banned the public profile and 2-3 repos or sth like that [00:41] same here, already at it [00:43] 4xxxxxx is 'disabled by github staff' [00:43] https://github.com/tpb-archive/4xxxxxx/commits/master [00:43] yeah that one probably got banned while 1 and 2 never existed [00:43] 2xxxxxx is down/gone [00:43] neevr existed [00:45] that shits 2 years old though anyway [00:45] better than nothing :/ [00:46] *** www2 has joined #archiveteam [00:48] http://webcache.googleusercontent.com/search?q=cache:-dZzWibkRM8J:thepiratebay.se/torrent/8044295+&cd=1&hl=en&ct=clnk&gl=us&client=firefox-a is in cache, should abot deal with that one too? [00:48] in the interest of organization please take TPB recovery discussion to a different channel - I suggest #yarharfiddlededee [00:48] Lord_Nigh: nah, it's already in the git [00:50] http://webcache.googleusercontent.com/search?q=cache:3BT1SmNGWwgJ:thepiratebay.isohunt.to/torrent/7736542/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29_LISTING+&cd=3&hl=en&ct=clnk&gl=us&client=firefox-a contains the 4xxxxxx range which is blocked on the git [00:51] that's just the listing [00:51] git only goes up to 85xxxxxx so the torrent up to 89xxxxx... [00:51] oh, damn. [00:52] the 4xxxxx range is probably well and truly lost then [00:52] http://thepiratebay.ee/user/andronikov/ [00:54] use 696969 as pass [00:54] or 123000 [00:54] saved page [00:54] or any other number combination so many work lol [00:54] is the magnet link on the wiki? [00:55] i think that's the one, yeah :/ [00:55] hmm the magnet for 80xxxxxxx-89xxxxxx is not, but this oen is [00:55] where is the 80xxxxxx - 89xxxxxxx? [00:55] i think i've missed that one [00:56] well, http://thepiratebay.cr seems to work from time to time [00:56] X41: http://webcache.googleusercontent.com/search?q=cache:-E7dqIP0dDUJ:thepiratebay.isohunt.to/torrent/9267841/_Backup_of_The_Pirate_Bay_%28IDs__8000000_-_8999999%29_+&cd=2&hl=en&ct=clnk&gl=us&client=firefox-a [00:57] its an isohunt mirror of tpb but the magnet should be good [00:57] oh right, the 0 seeds torrent [00:57] eh, maybe someone will wake up a seed now that tpb is down? [00:57] i'll try adding trackers to the list, might help [00:59] *** cadbury_ has joined #archiveteam [01:07] *** X41 has quit IRC (Read error: Connection reset by peer) [01:12] *** signius has quit IRC (Read error: Operation timed out) [01:13] http://www.pixlbit.com/blog/8839/allgamecom_shutting_down [01:16] allgame is shutting down december 12 [01:22] *** brains has joined #archiveteam [01:26] *** signius has joined #archiveteam [01:26] *** www2 has quit IRC (Read error: Operation timed out) [01:27] *** cf has joined #archiveteam [01:29] *** cf_ has joined #archiveteam [01:30] *** primus104 has quit IRC (Leaving.) [01:31] *** mistym has quit IRC (Leaving...) [01:33] *** cf has quit IRC (Read error: Operation timed out) [01:33] *** cf_ is now known as cf [01:33] *** ZorbaBeta has joined #archiveteam [01:34] *** GLaDOS has quit IRC (Ping timeout: 272 seconds) [01:34] *** ersi has quit IRC (Read error: Operation timed out) [01:34] *** ZorbaTHut has quit IRC (Read error: Connection reset by peer) [01:34] *** GLaDOS has joined #archiveteam [01:34] *** swebb sets mode: +o GLaDOS [01:35] *** ersi has joined #archiveteam [01:35] *** swebb sets mode: +o ersi [01:39] Start: its been grabbed by the bot already, not sure how much it would have changed in the past three weeks or so. [01:41] damn wheres x41 [01:49] *** mistym has joined #archiveteam [01:54] *** brains has quit IRC (Ping timeout: 240 seconds) [01:54] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [01:56] *** cbb has quit IRC (Quit: cbb) [01:57] *** Lord_Nigh has joined #archiveteam [01:57] *** cf_ has joined #archiveteam [01:57] *** balrog sets mode: +o Lord_Nigh [01:58] *** cf has quit IRC (Ping timeout: 265 seconds) [01:58] *** cf_ is now known as cf [02:04] *** cf_ has joined #archiveteam [02:05] *** cf has quit IRC (Ping timeout: 258 seconds) [02:05] *** cf_ is now known as cf [02:09] *** LordNigh2 has joined #archiveteam [02:11] *** Lord_Nigh has quit IRC (Ping timeout: 272 seconds) [02:11] *** LordNigh2 is now known as Lord_Nigh [02:11] tell x41 my interent is not strong enough to clone those git repos. he's on his own. [02:22] *** cadbury_ has quit IRC (Read error: Operation timed out) [02:24] *** LuckyMonk has joined #archiveteam [02:24] How updated is the piratebay backups, where can the latest backup be found? [02:24] I'm looking for something post 2014 but I cannot even find that. [02:25] It seems like the site was archived once for proof of concept but never again. [02:25] *** arbin has quit IRC (Read error: Connection reset by peer) [02:26] *** cadbury_ has joined #archiveteam [02:27] *** Nertsy has joined #archiveteam [02:28] I really hope this wont get me banned but, WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [02:29] Nertsy: the secret word is: yahoosucks [02:29] as long as you don't spam you should be fine [02:30] That made me laugh. And sweet. [02:31] Thanks btw [02:37] *** chfoo has quit IRC (Remote host closed the connection) [02:41] *** commentat has quit IRC (Ping timeout: 240 seconds) [02:44] LuckyMonk: theres a backup up to april 2014 that's floating around [02:44] up to id 999xxxxxx [02:44] Hmm that would be extremely handy [02:45] other than that i don't believe so. i don't think we have access to the backend stuff which ran the site either [02:45] just served pages [02:45] I have the XML file open in sublime text CTRL+F is fine for me [02:46] i'm curious if we should start archiving demonoid, because if tpb stays down, demonoid is gonna be the next target [02:47] isn't demonoid down? [02:47] Oh shi, yeah that is up [02:48] i should httrack the torrent files on there [02:48] yes. and if the MPAA has its way, it won't be for long. [02:49] well this was hardly a MPAA/US thing [02:50] mostly the swedish police; of course the MPAA and RIAA are the catalyst but I'm not here to talk about pirate politics, I just want my free shit. [02:51] there's lots of interesting stuff that isn't specifically pirated content on sites like demonoid and tpb [02:51] stuff that often you can find nowhere else [02:52] may have been pushed by sony to try to take down copies of the stuff from the sony pictures hack [02:52] LuckyMonk: Sweden does what Big Brother tells 'er [02:53] I'm stunned I can't find anything other thank bilek's orginal 76mb archive [02:55] well there's this (really old) one https://archive.org/details/TPB_index_20090815 [02:55] *** Nertsy has quit IRC (Quit: Leaving) [02:56] *** chfoo has joined #archiveteam [02:58] *** defcon_ has joined #archiveteam [02:59] we got a recent archive of tpb? [02:59] ... [03:01] magnet:?xt=urn:btih:E4B6F847647211B930219492ECF1A9C7BC696D29 [03:01] which archive is that? [03:01] *** Nertsy has joined #archiveteam [03:02] that is a 76 MB archived file from Bilek Febuary 2013 [03:02] ya thats ages ago [03:02] That is the latest I've been able to find [03:02] i was looking for something at least in 2014 [03:02] I've been on the channel discussing this a bit for a few minutes.. [03:02] LOL [03:02] http://puu.sh/dopas/6aafa55597.png [03:03] I said the exact same thing [03:03] you'd think that there would be a continuous update of the archive [03:03] agh, a hash that hashes to hashes of hashes of hashes of all the things that can be pirated [03:03] actually just a hash that downloads an xml file of hashes [03:03] its not as recursive as it sounds unfortunately [03:04] id fucking pastebin it if that were possible [03:04] if the whole tpb can be archived into 90mb or less, then we can get up a new tpb in no time [03:05] not quite, there are alot of issues with that, the magnet links need to be up to date, some type of security would need to be setup to prevent almost immediate closure by the web host [03:05] and then there is the whole, develop the website part [03:05] should be simple with PHP to parse the XML file and output shit, but lets just wait to see how perminant this shutdown is [03:06] imo we should be httracking google's cached version [03:06] if you can spider those links somehow [03:06] probably need to add a google cache prefix, but I think there is a (maybe?)random seed generated in the URL you need to put in to output a page [03:07] *** yipdw changes topic to: Archive Team: We're not archive.org | http://archiveteam.org/ | lengthy/off-topic discussion: #archiveteam-bs | Shipwrecked pirates: #yarharfiddlededee [03:07] Ty [03:09] Can we get an OP in that channel [03:09] also are there other channels with this discussion topic? they should be added to the page that was the shipwrecked pirates page [03:09] i think if we all quit, when someone joins it again they get ops [03:09] if i understand how it works [03:10] Shipwrecked pirates: http://is.gd/3mNFUy [03:10] Urgh, URL Shorteners. [03:10] * ersi shakes fist [03:10] rejoin now [03:12] *** defcon_ has quit IRC (Quit: Page closed) [03:12] LuckyMonk: rejoin? [03:12] *** _defcon has joined #archiveteam [03:25] *** dx has quit IRC (Read error: Operation timed out) [03:26] *** ruukasu has quit IRC (Quit: WeeChat 1.0.1) [03:27] Sketch [03:28] SketchCow: i hope you save the Telethon livestream that your doing thursday [03:31] *** ruukasu has joined #archiveteam [03:36] anyways i'm starting to uploaded G4 Icons seaason 1 [03:38] Of course I'll save it [03:39] you will be getting ralph baer episode of icon this week i hope [03:50] *** xk_id has quit IRC (Read error: Operation timed out) [03:54] *** LuckyMonk has quit IRC (Ping timeout: 240 seconds) [03:56] *** cf_ has joined #archiveteam [03:57] has anyone saved the data from the stats.minecraft.net torrents? [03:57] the torrents appear to be dead, as does stats.minecraft.net [03:59] typical mojang/minecraft - it'll probably be back shortly [03:59] *** cf has quit IRC (Ping timeout: 265 seconds) [03:59] *** cf_ is now known as cf [04:11] *** _defcon has quit IRC (Leaving) [04:11] *** RainbowCo has joined #archiveteam [04:38] *** Lord_Nigh has quit IRC (Read error: Connection reset by peer) [04:39] *** Lord_Nigh has joined #archiveteam [04:47] *** dx has joined #archiveteam [04:58] *** aaaaaaaaa has quit IRC (Leaving) [05:14] *** rejon has joined #archiveteam [05:39] *** Muad-Dib has joined #archiveteam [05:42] *** RynO has quit IRC (Quit: Page closed) [05:58] *** Start is now known as StartAway [06:05] *** dx has quit IRC (Ping timeout: 369 seconds) [06:18] *** dx has joined #archiveteam [06:35] *** thefox has joined #archiveteam [06:37] *** the_fox has quit IRC (Read error: Connection reset by peer) [06:50] *** cadbury__ has joined #archiveteam [06:51] *** cadbury_ has quit IRC (Read error: Operation timed out) [06:51] *** ats has quit IRC (Read error: Operation timed out) [06:56] *** ats has joined #archiveteam [06:59] https://github.com/tpb-archive is a 404 [06:59] Should I remove it from the wiki page http://archiveteam.org/index.php?title=The_Pirate_Bay [07:00] http://archiveteam.org/index.php?title=Main_Page [07:00] http://archiveteam.org/index.php?title=Talk:The_Pirate_Bay&action=edit&redlink=1 [07:00] Ctrl-S: the page is blocked as a 404 BUT the repos still exist [07:00] sorry, addon screwed up copy/paste [07:00] https://github.com/tpb-archive/5xxxxxx [07:00] https://github.com/tpb-archive/6xxxxxx [07:00] https://github.com/tpb-archive/7xxxxxx [07:00] https://github.com/tpb-archive/8xxxxxx [07:00] https://github.com/tpb-archive/3xxxxxx [07:01] https://github.com/tpb-archive/4xxxxxx however is blocked by github admins [07:01] and i don't know if there were any other repos in there [07:01] i.e. backend code etc [07:01] or web asp/php code [07:01] we should probably try to host it ourselves [07:01] that data is only up to march or may 2013 [07:02] we have the data up to 9999999 (april 2014) from another source [07:02] please join #yarharfiddlededee [07:02] *** commentat has joined #archiveteam [07:30] *** MMovie has quit IRC (Read error: Operation timed out) [07:34] *** MMovie has joined #archiveteam [08:06] https://mega.co.nz/#F!QJYASZaJ!S_Cmki11WnO7GPhZxeGhSg <- from reddit, those are supposedly some of the lewin lectures that mit took down [08:06] see http://np.reddit.com/r/Physics/comments/2ope15/mit_indefinitely_removes_online_physics_lectures/cmpazdb?context=3 [08:07] I'm saving them from archive.org atm [08:13] from? or for? [08:15] in case they get asked to take it down by mit and comply with said hypothetical request [08:15] links to pages http://np.reddit.com/r/Physics/comments/2ope15/mit_indefinitely_removes_online_physics_lectures/cmpazdb [08:17] *** mistym has quit IRC (Remote host closed the connection) [08:19] *** brayden has quit IRC (Read error: Operation timed out) [08:19] if they get asked to take it down it will get blacked out. afaik NOTHING gets deteled on archive.org (except corrupt files/etc) [08:20] illegal in hosting juristiction mateials? [08:21] you have to ask someone with more experience or knowledge than i have to answer that one [08:44] Lord_Nigh: Ctrl-S: what's the question? [08:45] ctrl-s wanted to know if archiveteam or IA (which are not the same thing, i know) will delete stuff that is illegal to host in whatever jurisdiction the servers are in. I answered that I don't know. [08:45] 99% of things is illegal to *publish*, not illegal to *possess* [08:45] thus, darking [08:45] *** primus104 has joined #archiveteam [08:46] making it unavailable publicly but keeping a copy in the archives [08:46] (library status makes that a lot easier, too) [08:46] that's what i thought [08:49] What a fantastic conversation not to have. [08:49] SketchCow: as you can see, we are capable of learning [08:53] (or did you mean 'this shouldn't have been spoken of'... you're kind of hard to read) [09:04] I don't mind discussions of wanting to save material. I don't like seeing speculation on what the edge-legal-case might be for illicit materials [09:05] I don't control anything, but it's a flag to me when someone comes in, glistening and new like a freshly baked cupcake, going AND SO WHAT DOES IT TAKE TO GET SHUT DOWN [09:06] I feel like there's so much work to do, so much to grab [09:06] I'm now cleaning up FOS intensely, getting projects into the archive, etc. [09:06] I just set up a mirror of Stone Oakvalley, which is a pretty critical component of C64/Amiga history. Archive.org is now a mirror for it. [09:07] So, if I'm allowed to summarise. [09:07] Ctrl-S: don't ask what guarantees IA provides; making further copies is good, yes. [09:07] Assume everything in the world disappears and be delighted when it doesn't. [09:08] Go to town but you might as well use the ia/internetarchive python client. [09:08] I might try the client when i am not so sleep deprived [09:09] also since there are people listening here, could someone who knows about the warrior stuff take a look at this? http://pastebin.com/GjbVGenZ [09:10] nvm, they're helping on #warrior [09:14] Good news, I have an auto-updated for archivebot, so that stuff will go in much quicker. [09:14] neat [09:28] https://thepiratebay.cr/ seems up [09:29] how do i help rip it? [09:30] wait search is broken [09:30] Here's a shocker for you. [09:30] I wouldn't be ripping any "piratebays" for the next week [09:31] honeypots ahoy? [09:31] ha ha no, Jason Bourne [09:31] Just a bunch of craptastic mirrors [09:35] * joepie91 sighs [09:35] Ctrl-S: if it's up, it's not TPB [10:10] *** brayden has joined #archiveteam [10:20] *** xk_id has joined #archiveteam [10:21] i think i need one of these: http://www.guru3d.com/news-story/seagate-launches-8tb-archive-hdd-at-250-euro.html [10:42] me too [10:45] i dont trust seagate anymore =/ [10:45] Not available to mortals at the moment anyway. But WD 6TB drives are easily available. [10:47] I want some of those for backups [10:47] 3TB drives are still the cheapest in terms of terabytes per euro/dollar, as long as you don't mind having more physical drives [10:47] I have scripts that have their own 4TB drives [10:47] nice :) [10:48] too lazy to impliment some sort of multi-disk support [10:49] i've had a few seagate drives and they've been a bit naf [10:50] *** cadbury__ is now known as cadbury_ [10:50] more specifically, they've died [10:50] I've got an RMA in for a WD drive at the moment, thing just failed out of the air [10:51] that is about the same price bet tb as a nas hdd [10:51] [although was acting craply beforehand, which is usually a good warning] [10:51] at least seagate are giving it a 3y warranty [10:51] any SMART warnings? [10:51] or clicks? [10:51] Plenty of clicks but SMART said "just fine", as it does.. [10:51] was it an always on drive? reasonably stable thermal? [10:51] does anyone want to give me some 3tb hard drives for christmas? [10:52] I have some spare 1 and 2 tb drives [10:52] ok [10:52] It was an always-on NAS, just a target for backups of other machines and data across the network. So no real data loss, just a nuisance. Did get a little bit hot over summer, whcih is when I first started mistrusting it. [10:52] nothing kills drives like heat. [10:53] what's your definition of hot? [10:53] room temp over 25c. [10:53] *** indigo_ has quit IRC (Read error: Connection reset by peer) [10:53] disk temp? [10:53] [prescription medicine adopts the same measure] [10:53] i noticed that the IA drives i have get very hot [10:54] Not sure, the front end always said "OK!" [10:54] did it not give you an actual number? [10:54] (Nasty WD MyCloud embedded thing.) [10:54] oh [10:54] that's a bit poor [10:54] only bought it because it was cheap. [10:54] i assume it was passively cooled? [10:54] man, it was slow, though. [10:54] I think it had a tiny fan inside the enclosure [10:54] no significant airflow [10:54] my 2tb usb drives never get hot [10:54] it'll get rather toasty without any airflow, especially when active [10:55] *** schbirid has joined #archiveteam [10:55] The NAS firmware won't have helped as it loved to keep indexing hundreds of thousands of files, even though I asked it not to [10:55] i refuse to run my storage as JBOD [10:55] "I'll just index all these GIFs so you can play them on your XBox with DLNA" - er, no, stop it. [10:56] it's being helpful! [10:56] heh. ;) [10:56] at least it was still under warranty [10:56] Yup - and it was only a backup target, so no critical data there. I've got other bigger NAS enclosures for things I don't want to lose. [10:57] so more of a "well that's inconvenient" [10:57] * antomatic nods [10:57] i nearly broke m 8TB ZFS array a few months ago [10:58] that was an interesting learning experience [10:58] *** indigo_ has joined #archiveteam [10:58] Do it, grab allgame [10:58] that's what worries me more than losing any drive - actually losing an array [10:59] as long as you use a suitable level of RAID and a mix of disks from different batches, it's very unlikely [11:03] i am running a normal wget -> warc on allgame, just in case, no time for something better atm [11:15] *** www2 has joined #archiveteam [11:36] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [11:42] antomatic: Qnap nas? [11:45] *** BlueMaxim has quit IRC (Quit: Leaving) [11:50] *** www2 has quit IRC (Read error: Operation timed out) [11:55] *** dx has quit IRC (Ping timeout: 265 seconds) [11:58] *** dx has joined #archiveteam [12:17] *** MMovie1 has joined #archiveteam [12:18] *** Ymgve has joined #archiveteam [12:20] *** MMovie has quit IRC (Ping timeout: 335 seconds) [12:23] cadbury_: idk about that, I've heard a number of different instances of hosting companies losing entire arrays [12:23] due to RAID card failure [12:27] i've had that happen to me with a HP SAN [12:27] updating firmware, let me just remove all your raidsets for you [12:27] not that important, we had backups [12:27] lol [12:29] *** www2 has joined #archiveteam [12:32] well yea, there's always the possibility of the card failing [12:32] but i don't use a card, software for me [12:39] midas: I use Drobos, which I know some people aren't too positive about, but I'm just too dazzled by the ability to mix and match completely different sized drives. [12:40] (and bang in new larger drives to extend the array on-the-fly, rather than having to rebuild, etc) [12:41] bit of a dubious site, but somebody should probably go through the links here and archive them (the ones not on PDFy, anyway): http://www.rhizzone.net/forum/topic/12402/ [12:41] (PDFy ones should already have been mirrored) [12:43] *** cf has quit IRC (cf) [12:56] *** Meeh has quit IRC (Quit: No Ping reply in 180 seconds.) [12:57] *** Meeh has joined #archiveteam [13:19] *** ruukasu has joined #archiveteam [13:33] SketchCow: I accidently put movies as itemtype for the newspapers. can you please change the mediatype to texts for these newspapers for me? https://archive.org/search.php?query=uploader%3A%22Arkiver%40hotmail.com%22%20metro%20brasilia%20newspaper%20mediatype%3A%22movies%22 [13:38] *** nertzy has joined #archiveteam [13:39] *** MMovie has joined #archiveteam [13:41] *** MMovie1 has quit IRC (Read error: Operation timed out) [13:48] *** sankin has joined #archiveteam [13:53] *** sankin has quit IRC (Client Quit) [13:56] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [13:58] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [13:59] *** Froggypwn has joined #archiveteam [14:04] *** sankin has joined #archiveteam [14:07] and also rederive them ^? [14:13] *** Morbus has quit IRC (Quit: http://www.disobey.com/) [14:24] *** lrkj has quit IRC (Ping timeout: 612 seconds) [14:26] *** Morbus has joined #archiveteam [14:36] Hey. Just out of curiosity, anyone have an archive.today clone? I'm interested in running my own. [14:37] Or know about one? [15:02] *** arbin has joined #archiveteam [15:16] *** ruukasuu has joined #archiveteam [15:18] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [15:22] *** ruukasuu has quit IRC (Quit: WeeChat 1.0.1) [15:22] *** ruukasu has joined #archiveteam [15:25] is ent some mails to allgame/allmedia asking if they would release the content under a free license for future generations to enjoy [15:38] *** StartAway has quit IRC (Read error: Operation timed out) [15:50] *** mistym has joined #archiveteam [15:51] *** mistym has quit IRC (Remote host closed the connection) [15:57] *** BiggieJo1 has quit IRC (Read error: Connection reset by peer) [15:57] *** Nertsy has quit IRC (Quit: Nertsy) [15:59] answer is Apologies, but we cannot. [15:59] due to licensing [16:07] *** Nertsy has joined #archiveteam [16:13] *** Start has joined #archiveteam [16:18] *** aaaaaaaaa has joined #archiveteam [16:37] re allgame [16:37] http://archive.fart.website/archivebot/viewer/job/aeefs [16:48] *** lhobas has quit IRC (Ping timeout: 272 seconds) [16:53] Nertsy: try https://github.com/ikreymer/pywb-webrecorder [16:55] Does it render DOM elements like how Archive.today does? (if not, how would I go about doing that? PhantomJS?) [16:56] it uses your browser as a recorder, so it does whatever your browser will do [16:57] also see webrecorder.io for a more elaborate deployment of that code [16:57] there's also #archivebot [16:57] which can hook into PhantomJS, though PhantomJS does seem to have some leak issues [16:58] *** Start has quit IRC (Read error: No route to host) [16:58] *** dx has quit IRC (Ping timeout: 246 seconds) [16:58] Hmm. Fascinating. Another interesting way to archive stuff. [16:58] the main benefits of #archivebot are immediate IA upload, integration into IA's Wayback Machine, and my occasional frustrated tirades at people leaving hundred-gigabyte jobs unattended [16:59] you don't always know when a job is that big :/ [16:59] How do I go about using #archivebot? [17:00] /join #archivebot [17:00] i want to point out that the default archivebot mode is not PhantomJS [17:00] Wget right? [17:00] so if you're doing stuff that requires replay on that level you will probably be happier with webrecorder [17:00] wpull [17:00] Ah [17:02] *** Start has joined #archiveteam [17:02] *** dx has joined #archiveteam [17:03] Just a question, if I were to independently archive a website, what format is the most accepted? Raw html files (one for each page)? [17:08] depends what your goal is and who you want to accept it [17:15] *** mistym has joined #archiveteam [17:16] warc! [17:17] http://archiveteam.org/index.php?title=The_WARC_Ecosystem [17:23] http://archiveteam.org/index.php?title=The_WARC_Ecosystem [17:40] *** VonCloud_ has joined #archiveteam [17:40] *** Start has quit IRC (Read error: Connection reset by peer) [17:43] *** parsons has joined #archiveteam [17:45] *** ruukasu has quit IRC (Quit: WeeChat 1.0.1) [17:49] *** GLaDOS has quit IRC (Ping timeout: 272 seconds) [17:50] *** GLaDOS has joined #archiveteam [17:50] *** swebb sets mode: +o GLaDOS [17:53] if we get trouble with allgame, i have a nice contact there now. [18:03] *** mistym has quit IRC (Remote host closed the connection) [18:04] *** ruukasu has joined #archiveteam [18:04] *** mistym has joined #archiveteam [18:12] *** lhobas has joined #archiveteam [18:29] *** mistym has quit IRC (Remote host closed the connection) [18:30] *** mistym has joined #archiveteam [18:31] *** brayden has quit IRC (Ping timeout: 607 seconds) [18:35] anybody get a copy of https://github.com/ReCaptchaReverser/InsideReCaptcha ? [18:37] https://github.com/toogle/InsideReCaptcha.git [18:52] *** Start has joined #archiveteam [18:58] *** rejon has quit IRC (Ping timeout: 480 seconds) [19:00] *** www2 has quit IRC (Ping timeout: 335 seconds) [19:02] *** signius has quit IRC (Read error: Operation timed out) [19:17] *** signius has joined #archiveteam [19:17] *** ete_ has joined #archiveteam [19:41] *** aaaaaaaa_ has joined #archiveteam [19:45] *** phuzion has quit IRC (Read error: Operation timed out) [19:47] *** xtr-201 has quit IRC (Read error: Operation timed out) [19:47] *** aaaaaaaaa has quit IRC (Read error: Operation timed out) [19:47] *** aaaaaaaa_ has quit IRC (Client Quit) [19:47] *** Start has quit IRC (Read error: Operation timed out) [19:47] *** aaaaaaaa_ has joined #archiveteam [19:47] *** phuzion has joined #archiveteam [19:48] *** Froggypwn has quit IRC (Read error: Operation timed out) [19:48] *** xtr-201 has joined #archiveteam [19:49] *** Froggypwn has joined #archiveteam [19:57] *** aaaaaaaa_ has quit IRC (Ping timeout: 480 seconds) [20:02] *** BlueMaxim has joined #archiveteam [20:05] *** mistym_ has joined #archiveteam [20:28] *** logchfoo starts logging #archiveteam at Wed Dec 10 20:28:35 2014 [20:28] *** logchfoo has joined #archiveteam [20:42] *** Arkiver2 is now known as arkiver [20:43] *** thechip has joined #archiveteam [20:45] *** NovaKing has joined #archiveteam [20:48] *** brayden has joined #archiveteam [20:51] *** Elegance has quit IRC (Ping timeout: 265 seconds) [20:58] *** kyan has quit IRC (Read error: Connection reset by peer) [21:14] *** Elegance has joined #archiveteam [21:30] *** kyan_ has joined #archiveteam [21:33] *** www2 has joined #archiveteam [21:36] *** APerti has joined #archiveteam [21:36] *** Ymgve has quit IRC () [21:39] *** APerti_ has quit IRC (Ping timeout: 370 seconds) [21:41] *** T31m_ has quit IRC (Quit: Leaving) [21:47] *** xk_id has quit IRC (Read error: Operation timed out) [21:49] *** Start has joined #archiveteam [21:58] *** schbirid has quit IRC (Leaving) [21:59] *** ruukasu has joined #archiveteam [22:03] *** ruukasu has quit IRC (Client Quit) [22:04] *** ruukasu has joined #archiveteam [22:08] *** mistym_ has quit IRC (Quit: Leaving...) [22:24] Back from visiting Facebook HQ [22:24] Everything is fine, we can shut down, Facebook will have it all under control [22:24] * SketchCow burps the gallons of free food he got [22:25] now scuse me... nap [22:25] SketchCow: yeaaaaaaaa right [22:25] :P [22:26] *** Start has quit IRC (Read error: Operation timed out) [22:33] * thefox eats the gallons of food after it is burped out. No sense wasting food. [22:34] *** SN4T14_ has joined #archiveteam [22:39] thefox: burping is not vomiting [22:39] *** SN4T14 has quit IRC (Ping timeout: 369 seconds) [23:19] *** mistym has joined #archiveteam [23:22] *** warthurto has quit IRC (Ping timeout: 265 seconds) [23:23] *** okeuday has quit IRC (Ping timeout: 265 seconds) [23:23] *** Fusl has quit IRC (Ping timeout: 265 seconds) [23:23] *** pfallenop has quit IRC (Ping timeout: 265 seconds) [23:23] *** dashcloud has quit IRC (Ping timeout: 265 seconds) [23:23] *** nico has quit IRC (Ping timeout: 265 seconds) [23:24] *** Insomnia1 has quit IRC (Ping timeout: 265 seconds) [23:24] *** Insomnia_ has joined #archiveteam [23:24] *** warthurto has joined #archiveteam [23:24] *** Fusl has joined #archiveteam [23:24] *** pfallenop has joined #archiveteam [23:25] *** okeuday has joined #archiveteam [23:25] *** tev|stdby has quit IRC (Ping timeout: 265 seconds) [23:27] *** tev|stdby has joined #archiveteam [23:27] *** nertzy has joined #archiveteam [23:28] *** dashcloud has joined #archiveteam [23:29] *** cadbury_ has quit IRC (Read error: Operation timed out) [23:33] *** cadbury_ has joined #archiveteam [23:39] *** nico has joined #archiveteam [23:46] *** Start has joined #archiveteam