[00:00] *** db48x has joined #archiveteam [00:01] *** db48x has quit IRC (Read error: Operation timed out) [00:17] *** Nertsy` has joined #archiveteam [00:18] *** SN4T14 has joined #archiveteam [00:19] *** Emcy_ has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** thechip has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** signius has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** Meeh has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** SN4T14_ has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** Nertsy has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** espes__ has quit IRC (ircd.choopa.net irc.eversible.com) [00:19] *** Baljem has quit IRC (ircd.choopa.net irc.eversible.com) [00:20] *** Emcy has joined #archiveteam [00:27] *** espes___ has joined #archiveteam [00:34] *** thechip has joined #archiveteam [00:43] *** Meeh has joined #archiveteam [00:43] *** signius has joined #archiveteam [00:43] *** Baljem has joined #archiveteam [00:46] done uploading the vine accounts [00:46] https://archive.org/details/51VineAccountsDecember52014 [01:05] *** Wyatt8760 has joined #archiveteam [01:06] *** achip has joined #archiveteam [01:18] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [01:20] *** Lord_Nigh has joined #archiveteam [01:20] *** balrog sets mode: +o Lord_Nigh [01:35] *** mistym has quit IRC (Remote host closed the connection) [01:41] *** Wyatt8760 has quit IRC (Read error: Connection reset by peer) [01:50] *** philpem has quit IRC (Ping timeout: 272 seconds) [01:52] *** mistym has joined #archiveteam [01:54] *** achip has quit IRC (Remote host closed the connection) [01:54] *** xk_id has quit IRC (Remote host closed the connection) [02:30] *** dashcloud has quit IRC (Read error: Operation timed out) [02:32] *** dashcloud has joined #archiveteam [02:43] *** schbirid has quit IRC (Ping timeout: 258 seconds) [02:57] *** schbirid has joined #archiveteam [02:59] *** db48x has joined #archiveteam [03:05] *** db48x has quit IRC (Ping timeout: 258 seconds) [03:11] *** kyan has quit IRC (Quit: Leaving) [03:25] *** primus104 has quit IRC (Leaving.) [03:26] *** xk_id has joined #archiveteam [03:36] *** xk_id has quit IRC (Ping timeout: 600 seconds) [03:38] *** schbirid has quit IRC (Read error: Operation timed out) [03:39] *** deathy has quit IRC (Ping timeout: 272 seconds) [03:52] *** schbirid has joined #archiveteam [04:19] *** mistym has quit IRC (Remote host closed the connection) [04:26] *** xk_id has joined #archiveteam [04:27] *** K4k has joined #archiveteam [04:31] *** K4k has quit IRC (Ping timeout: 258 seconds) [04:37] *** xk_id has quit IRC (Ping timeout: 600 seconds) [04:48] *** deathy has joined #archiveteam [04:48] *** kyan has joined #archiveteam [04:51] *** mistym has joined #archiveteam [04:58] *** deathy has quit IRC (Connection closed) [04:59] *** deathy has joined #archiveteam [05:02] *** aaaaaaaaa has quit IRC (Leaving) [05:06] *** mistym has quit IRC (Remote host closed the connection) [05:07] *** mistym has joined #archiveteam [05:09] *** db48x has joined #archiveteam [05:12] arkiver: should we do a warrior/manual project for club nintendo? [05:12] here's an example item: https://club.nintendo.com/rewards-details/a/45108.do [05:12] items are a maximum of 5 characters long [05:28] *** xk_id has joined #archiveteam [05:34] *** Ymgve has quit IRC () [05:36] *** dserodio has quit IRC (Read error: Operation timed out) [05:38] *** xk_id has quit IRC (Ping timeout: 600 seconds) [05:46] *** dserodio has joined #archiveteam [05:46] there's also the japanese and european club nintendoes, which are completely different [05:51] *** Start is now known as StartAway [06:13] *** rejon has joined #archiveteam [06:26] *** ohhdemgir has quit IRC (Quit: Leaving) [06:28] *** xk_id has joined #archiveteam [06:39] *** xk_id has quit IRC (Ping timeout: 600 seconds) [06:45] *** mistym has quit IRC (Remote host closed the connection) [06:45] *** mistym has joined #archiveteam [07:07] *** rejon has quit IRC (Ping timeout: 512 seconds) [07:29] *** xk_id has joined #archiveteam [07:31] *** winr4r has joined #archiveteam [07:36] *** Nertsy` has quit IRC (Ping timeout: 335 seconds) [07:37] *** xk_id has quit IRC (Read error: Operation timed out) [07:37] *** Nertsy has joined #archiveteam [07:37] *** mistym has quit IRC (Remote host closed the connection) [07:52] *** underscor has quit IRC (Ping timeout: 370 seconds) [08:04] *** jmathai has joined #archiveteam [08:06] Hi folks. Is anyone around to chat about how you can help us archive public pages for Trovebox before we shut down? [08:15] arkiver: ^ [08:19] *** underscor has joined #archiveteam [08:23] Heading to bed. But if someone has any sort of guidelines on what you need to archive pages (a list of URLs, etc) that would be helpful. You can ping me via email (jaisen@trovebox.com) or Twitter (@jmathai). [08:23] Would also be helpful to know if you need a complete list or if your software handles crawling links. [08:23] Thanks :) [08:30] *** xk_id has joined #archiveteam [08:31] I'll be swapping clients again soon - machine is almost ready [08:36] *** xk_id has quit IRC (Read error: Operation timed out) [08:50] looks like i did a backup of tri town times pdfs just in time [08:50] there are no pdfs on the site now [08:55] i figured out there problem [08:55] looks like there using bad urls for tri-town times [09:02] *** jmathai has quit IRC (jmathai) [09:02] *** dashcloud has quit IRC (No Ping reply in 180 seconds.) [09:03] *** primus104 has joined #archiveteam [09:06] *** dashcloud has joined #archiveteam [09:30] *** primus104 has quit IRC (Leaving.) [09:31] *** xk_id has joined #archiveteam [09:41] *** xk_id has quit IRC (Ping timeout: 600 seconds) [10:05] I had the textfiles.com machine going, now it stopped dead again. [10:11] :( [10:32] *** xk_id has joined #archiveteam [10:42] *** xk_id has quit IRC (Ping timeout: 600 seconds) [10:54] *** primus104 has joined #archiveteam [10:56] OK, so. [10:56] I've been informed by someone that http://www.abkingdom.com/ is shutting down March 1st. [10:57] His letter is basically "yes, we are a bunch of freaks but we don't deserve to lose 15 years of material" [10:57] I set an archivebot on it, but that may or may not be enough. [10:58] why is it going away? [10:59] Not clear [10:59] k [10:59] * xmc zzz [11:11] *** signius has quit IRC (Ping timeout: 512 seconds) [11:13] *** primus104 has quit IRC (Read error: Connection reset by peer) [11:17] *** primus104 has joined #archiveteam [11:20] *** signius has joined #archiveteam [11:20] *** BlueMaxim has quit IRC (Ping timeout: 335 seconds) [11:33] *** xk_id has joined #archiveteam [11:34] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [11:39] *** xk_id has quit IRC (Read error: Operation timed out) [11:46] *** MMovie1 has joined #archiveteam [11:49] *** MMovie has quit IRC (Ping timeout: 335 seconds) [12:19] looks like some amount of Abkingdom requires registration/login [12:33] *** xk_id has joined #archiveteam [12:43] *** xk_id has quit IRC (Ping timeout: 600 seconds) [12:52] *** Lord_Nigh has quit IRC (Ping timeout: 272 seconds) [12:52] *** Lord_Nigh has joined #archiveteam [13:23] *** ruukasu has joined #archiveteam [13:29] *** ruukasu has quit IRC (Quit: WeeChat 1.1) [13:29] *** Ymgve has joined #archiveteam [13:33] *** ruukasu has joined #archiveteam [13:33] *** ruukasu has quit IRC (Client Quit) [13:34] *** xk_id has joined #archiveteam [13:39] *** primus104 has quit IRC (Leaving.) [13:44] *** xk_id has quit IRC (Ping timeout: 600 seconds) [13:45] *** ruukasu has joined #archiveteam [13:50] *** ethical_a has joined #archiveteam [13:50] *** Anarhist has quit IRC (Read error: Connection reset by peer) [14:03] jmathai is already offline unfortunately [14:03] I'll email jmathai later today [14:03] Start: I think ArchiveBot should be able to do the whole site [14:09] *** ruukasu has quit IRC (Read error: Connection reset by peer) [14:11] *** K4k has joined #archiveteam [14:11] *** sankin has joined #archiveteam [14:14] *** ruukasu has joined #archiveteam [14:35] *** xk_id has joined #archiveteam [14:39] *** brayden has joined #archiveteam [14:45] *** xk_id has quit IRC (Read error: Operation timed out) [14:49] *** ohhdemgir has joined #archiveteam [14:55] *** brayden has quit IRC (Read error: Connection reset by peer) [14:56] *** brayden has joined #archiveteam [15:02] *** ruukasuu has joined #archiveteam [15:02] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [15:03] *** ruukasuu has quit IRC (Client Quit) [15:04] *** ruukasu has joined #archiveteam [15:05] *** StartAway has quit IRC (Quit: Disconnected.) [15:13] *** rejon has joined #archiveteam [15:36] *** xk_id has joined #archiveteam [15:36] *** sankin has quit IRC (Leaving.) [15:38] *** mistym has joined #archiveteam [15:39] *** mistym has quit IRC (Remote host closed the connection) [15:44] *** xk_id has quit IRC (Read error: Operation timed out) [15:48] *** sankin has joined #archiveteam [15:55] *** mistym has joined #archiveteam [16:02] *** toad2 has joined #archiveteam [16:03] *** toad1 has quit IRC (Read error: Operation timed out) [16:17] *** Start has joined #archiveteam [16:17] *** Start has quit IRC (Client Quit) [16:28] *** Start has joined #archiveteam [16:31] *** primus104 has joined #archiveteam [16:36] *** xk_id has joined #archiveteam [16:38] arkiver: some pages for discontinued rewards on club nintendo aren't referenced anywhere on the main site and thus not saved by archivebot (example: https://club.nintendo.com/rewards-details/a/15513.do) [16:44] *** xk_id has quit IRC (Read error: Operation timed out) [16:47] looks like north american and japanese club nintendo can be scraped sequentially [16:47] i'll do further research on european club nintendo later [16:49] *** rejon has quit IRC (Remote host closed the connection) [16:52] *** Start has quit IRC (Quit: Disconnected.) [17:02] *** mistym has quit IRC (Remote host closed the connection) [17:10] *** aaaaaaaaa has joined #archiveteam [17:12] *** aaaaaaaaa has quit IRC (Client Quit) [17:12] *** aaaaaaaaa has joined #archiveteam [17:31] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [17:31] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [17:32] *** dashcloud has joined #archiveteam [17:37] *** xk_id has joined #archiveteam [17:42] *** xk_id_ has joined #archiveteam [17:42] *** xk_id has quit IRC (Read error: Connection reset by peer) [18:05] *** jmathai has joined #archiveteam [18:09] *** Emcy has quit IRC (Ping timeout: 365 seconds) [18:10] *** Emcy has joined #archiveteam [18:17] *** ruukasu has joined #archiveteam [18:19] * jmathai reposts from last night [18:19] Hi folks. Is anyone around to chat about how you can help us archive public pages for Trovebox before we shut down? [18:21] arkiver, SketchCow: ^ [18:21] hi jmathai, i think arkiver might be your best bet to help you, he can build a pipeline so we can grab it [18:22] midas1 arkiver sounds good. what’s the best way to get in touch with them? [18:22] just stick around [18:23] ^ that [18:24] jmathai: arkiver said at 14:03 UTC that he would e-mail you later today. Not sure if he's done so already :) [18:25] Awesome. I missed that message. Thanks. [18:27] No problem :) [19:23] *** philpem has joined #archiveteam [19:30] *** db48x has quit IRC (Read error: Operation timed out) [20:00] so i got the live state of the union coverage that aired on theblaze [20:00] its like Mystery theater 3000 [20:02] *** ruukasu has quit IRC (Quit: WeeChat 1.1) [20:07] ha [20:23] *** Emcy_ has joined #archiveteam [20:26] *** mistym has joined #archiveteam [20:26] *** Emcy has quit IRC (Ping timeout: 512 seconds) [20:44] *** Emcy has joined #archiveteam [20:49] *** Emcy_ has quit IRC (Ping timeout: 512 seconds) [20:58] *** sankin has quit IRC (Leaving.) [21:00] *** Emcy has quit IRC (Ping timeout: 265 seconds) [21:10] jmathai: Thank you for coming here and helping us! [21:10] We don't get that a lot with websites owners, so great you're here! [21:11] Would you be able to be online in the weekend to go over the process of saving your website? [21:18] *** mistym has quit IRC (Remote host closed the connection) [21:18] *** jmathai has quit IRC (Ping timeout: 272 seconds) [21:19] *** schbirid has quit IRC (Quit: Leaving) [21:28] *** BlueMaxim has joined #archiveteam [21:32] everyone: join #inkerasers [21:33] starting very soon with a new batch of comics [21:37] *** nertzy has joined #archiveteam [21:38] Start: I think, if the list of urls not linked to from main pages of the website is not too big, archivebot can handle it with !a < list [21:43] *** Emcy has joined #archiveteam [21:47] *** ruukasu has joined #archiveteam [21:49] *** Ravenloft has joined #archiveteam [21:49] *** Start has joined #archiveteam [21:54] Start: I think, if the list of urls not linked to from main pages of the website is not too big, archivebot can handle it with !a < list [21:54] ok [21:55] i'll prepare a list for each region [21:55] ok [21:55] how many urls do you think this'll be? [21:57] 400,000 for north america [21:58] i'm still building lists for japan and europe [21:58] that should be fine [21:58] we still have a few months for the site [21:59] *** K4k has quit IRC (Ping timeout: 240 seconds) [22:00] *** mistym has joined #archiveteam [22:05] arkiver: i think i've ran into a problem for club nintendo japan [22:05] some items have mini-sites like: http://club.nintendo.jp/present/P125/ [22:06] and these have sub-pages like http://club.nintendo.jp/present/P125/machine.html that an !ao < won't get [22:14] *** jmathai has joined #archiveteam [22:14] Another question. We don’t have any plans on deleting content from Github but some have voiced concerns. I know Github has a robots.txt that asks not to crawl but is there any way our Github issues page can be archived? [22:16] #archivebot doesn't give a shit about robots, iirc. Have a poke around there, jmathai [22:17] with the api yeah [22:17] https://gist.github.com/rodw/3073987 that script does it all [22:17] (apparently) [22:19] *** Start has quit IRC (Quit: Disconnected.) [22:19] Awesome. Who could we hand off a backup generated by that script? The reason I’m asking is that it’s important that the data be backed up by someone other than us :) [22:19] Kazzy: will do [22:19] *** wyatt8760 has joined #archiveteam [22:23] jmathai: upload to archive.org, an admin there (a couple of whom live in here) can put it in the archive team collection [22:23] winr4r: thanks [22:24] :) [22:25] jmathai: not sure if you saw my replies [22:25] jmathai: Thank you for coming here and helping us! [22:25] We don't get that a lot with websites owners, so great you're here! [22:25] Would you be able to be online in the weekend to go over the process of saving your website? [22:25] Kazzy: it’s been queued by archivebot [22:26] arkiver: i did but then got disconnected. weekends are difficult for me. what’s another way to go about it? [22:26] would you have some tomorrow? what timezone are you in? [22:28] Pacific Time [22:28] arkiver: I have time on Friday [22:30] yeah friday is fine [22:31] I'll be online all day, please leave me a message if you have the time on friday [22:31] then we can start saving Trovebox. [22:32] And of course you can always leave me a PM. [22:32] Have a good day/night all! [22:32] you too [22:32] * arkiver is afk now [22:36] textfiles.com machine is back [22:36] Now synchronizing things, and I'll switch my IRC client back soon [22:36] And also set up some automatic backups for various bits [22:37] \o/ [22:46] great news SketchCow :) [22:59] *** SN4T14 has quit IRC (Ping timeout: 335 seconds) [23:00] SketchCow: how bad did it end up? [23:01] A couple sites have issues [23:01] I can track them down. [23:01] imho RAID 1 with monitoring is useful *in addition to* backup [23:01] People have copies. [23:01] saved my ass several times [23:01] Not doing raid [23:01] noooo [23:01] But that's me [23:01] I mean, I definitely want "RAID" in terms of "have a second copy down the way" [23:02] That's happening. [23:02] Already was happening, just happening more formally [23:02] isn't that what RAID 1 technically is? or do you mean a backup rather? [23:04] Well, the problem with "RAID" is that as a concept, it's a very broad concept [23:04] And depending on what people are about, they can claim anything is a raid [23:04] I'm just saying, I will have a more direct backup of my materials going forward. [23:04] RAID1 is nice if you have some data, because you store it on two hard drives, so if one fails, you still have the other. [23:05] RAID1 is no good, for example, [23:05] if your machine corrupts the data itself [23:05] = both copies lost [23:05] yeah, so it's needed in addition to backups. [23:05] Or if a large elephant destroys your datacenter [23:05] = total loss, also [23:05] So it's Redundancy, not Backup [23:05] or if both drives fail at the same time. [23:05] still good [23:05] but not everything you need [23:06] * antomatic nods [23:06] I generally use mdraid set up to RAID1, though I'd use RAID5 or 6 if I had more drives. RAID0 is only useful if you need maximum performance (doing video editing for example) [23:06] together with active monitoring. [23:06] Which is surprisingly likely given that RAID arrays tend to get filled with almost-the-exact-same-dfives-from-the-exact-same-batch [23:07] with the exact-same-manufacturing-faults-and-lifespans [23:07] yep [23:10] *** SN4T14 has joined #archiveteam [23:12] *** mistym has quit IRC (Remote host closed the connection) [23:13] *** Start has joined #archiveteam [23:13] *** mistym has joined #archiveteam [23:14] chfoo: is archive.fart.website down? [23:15] looking fine for me [23:15] works here [23:18] not working for me [23:18] weird [23:18] oh hey it works now [23:22] *** Ymgve has quit IRC () [23:24] i don't see any signs of problems with it [23:32] *** xk_id has joined #archiveteam [23:32] *** xk_id_ has quit IRC (Read error: Connection reset by peer) [23:51] *** SN4T14 has quit IRC (Ping timeout: 369 seconds) [23:53] while we all seem to be sort of here, with the press event today, I think now's a good time more than ever to start looking in to saving the win10 feedback forum [23:54] (and by extension the insider hub) [23:54] http://archiveteam.org/index.php?title=Windows_Technical_Preview [23:54] *** xk_id has quit IRC (Remote host closed the connection)