[00:04] *** kris33 has joined #archiveteam [00:21] *** kris33 has quit IRC (My MacBook has gone to sleep. ZZZzzz…) [00:51] *** clkw has joined #archiveteam [01:10] *** Somebody has joined #archiveteam [01:12] *** Stiletto has joined #archiveteam [01:47] *** Robbie has joined #archiveteam [01:49] *** clkw has quit IRC (Remote host closed the connection) [01:53] *** Robbie has quit IRC (Ping timeout: 268 seconds) [01:59] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [02:27] *** Stiletto has joined #archiveteam [03:09] *** aschmitz has quit IRC (Quit: Leaving) [03:21] *** rrika has joined #archiveteam [03:39] *** notafed has quit IRC (Read error: Operation timed out) [03:55] *** db48x has quit IRC (Quit: new ssd) [04:19] *** wakest has joined #archiveteam [04:20] just heard about what.cd getting shut down. anyone know if it was spidered by anyone before hand? there was so so so much awesome metadata there. [04:20] I think not because it was a sudden shutdown [04:20] yeah, they got raided by french feds [04:22] I feel like there must be a copy somewhere though, like some startup or data scientist had to have seen the value of a dump at somepoint [04:22] even like someone working on spotify or genius [04:26] *** link343 has joined #archiveteam [04:26] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [04:26] link343: yahoosucks [04:27] thank you [04:27] *** link343 has quit IRC (Client Quit) [04:27] Heh, was going to ask what they wanted to edit, but I suppose the bot will tell us. [04:31] *** Asparagir has joined #archiveteam [04:37] *** nicolas17 has quit IRC (Quit: zzz) [04:41] *** Asparagir has quit IRC (Asparagir) [04:50] *** BlueMaxim has joined #archiveteam [04:52] *** Somebody has quit IRC (Ping timeout: 370 seconds) [04:53] *** wakest has quit IRC (Ping timeout: 268 seconds) [05:11] *** Famicoman has quit IRC (Ping timeout: 260 seconds) [05:11] *** Famicoma2 has joined #archiveteam [05:13] *** Aranje has quit IRC (Quit: Three sheets to the wind) [05:31] kremlin: they said the data was all destroyed though, which makes me wonder how they're sure of that [05:32] perhaps the servers needed to check into a master key storage server 'somewhere else' which they nuked [05:32] which would have been smart [05:32] else they're just blocks of encrypted data with a tiny bootloader [05:38] Lord_Nigh: what [05:39] on what.cd it specifically says 'all data was destroyed' [05:39] so i'm curious, if the servers were siezed, how they know that for certain [05:39] *** aschmitz has joined #archiveteam [05:40] oh, right [05:40] yeah the servers were seized [05:40] all at once [05:41] doesn't mean the data is destroyed. unless the site is lying, they must have had some system in place where the servers either self-destruct, or require an off-site key which can be destroyed itself [05:42] though evading that is as simple as keeping a machine powered during siezure, which is not so simple [06:00] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [06:03] *** eprillios has joined #archiveteam [06:07] *** Sk1d has joined #archiveteam [06:07] *** kcaj has quit IRC (Ping timeout: 260 seconds) [06:09] *** kcaj has joined #archiveteam [06:17] according to an update on ars technica whoever runs the what.cd twitter is pretty sure it wasnt seized and they wiped it on their own [06:18] *** BlueMaxim has quit IRC (Quit: Leaving) [06:18] *** BlueMaxim has joined #archiveteam [06:19] *** bwn has quit IRC (Ping timeout: 244 seconds) [06:50] *** Start has joined #archiveteam [07:28] *** vitzli has joined #archiveteam [07:30] *** michaubia has joined #archiveteam [07:36] *** michaubia has quit IRC (Quit: Page closed) [07:39] *** WinterFox has joined #archiveteam [07:44] https://status.feral.io can someone archivebot this please, also FAO Feral Hosting customers [07:46] HCross2: Done [07:57] *** brayden__ has joined #archiveteam [07:57] *** swebb sets mode: +o brayden__ [08:00] *** bwn has joined #archiveteam [08:01] *** brayden_ has quit IRC (Read error: Operation timed out) [08:03] *** yipdw has quit IRC (Read error: Operation timed out) [08:04] *** yipdw has joined #archiveteam [08:29] *** hictooth has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [08:31] *** atomotic has joined #archiveteam [08:44] *** britbacku has joined #archiveteam [08:44] *** Froggypwn has quit IRC (Read error: Operation timed out) [08:44] *** Froggypwn has joined #archiveteam [08:55] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [08:59] *** britbacku has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [09:01] *** zhongfu has joined #archiveteam [09:09] *** dashcloud has quit IRC (Read error: Connection reset by peer) [09:11] *** dashcloud has joined #archiveteam [09:17] *** C4K3 has joined #archiveteam [09:42] *** ravetcofx has quit IRC (Read error: Operation timed out) [10:13] *** fie has joined #archiveteam [10:22] *** Nycatelos has quit IRC (Quit: Party like it's December 31st, 1983) [10:45] *** bwn has quit IRC (Ping timeout: 244 seconds) [10:45] *** Nycatelos has joined #archiveteam [10:55] *** bwn has joined #archiveteam [11:03] *** tapedrive has joined #archiveteam [11:04] *** atomotic has quit IRC (Remote host closed the connection) [11:04] *** kristian_ has joined #archiveteam [11:08] *** atomotic has joined #archiveteam [11:31] *** BlueMaxim has quit IRC (Quit: Leaving) [11:51] *** kristian_ has quit IRC (Quit: Leaving) [11:54] https://twitter.com/whatcd/status/799435387634614272 [11:54] "Reports of our database being seized are not factual." [11:55] From the WCD Twitter account. [12:19] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [12:30] *** krazedkat has quit IRC (Ping timeout: 506 seconds) [12:37] *** krazedkat has joined #archiveteam [12:43] *** atomotic has joined #archiveteam [13:54] looks like a TON of shit went down [13:54] PTP, GGN, Deli.sh [13:54] reportedly [14:20] ggn is only temp though [14:20] whew, my site is still up [14:21] *** Stiletto has quit IRC (Read error: Connection reset by peer) [14:22] *** Stiletto has joined #archiveteam [15:03] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [15:06] *** sep332 has quit IRC (Konversation terminated!) [15:43] *** Aranje has joined #archiveteam [15:58] And people ask what to mass download... I think we have some suggestions :) https://www.reddit.com/r/trackers/comments/5dif9s/dont_download_it_later_there_may_not_be_a_later/da5p4hl/ [16:00] it would seem that waffles is down too? [16:00] supposedly since recently? [16:01] since 2/3 weeks apparently [16:01] *** sep332 has joined #archiveteam [16:09] *** atomotic has joined #archiveteam [16:18] *** godane has quit IRC (Quit: Leaving.) [16:18] *** godane has joined #archiveteam [16:33] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [16:35] *** Somebody has joined #archiveteam [16:38] *** Morbus has joined #archiveteam [16:42] waffles has been unstable recently afaict [16:48] *** RichardG has quit IRC (Ping timeout: 255 seconds) [16:54] *** Somebody has quit IRC (Ping timeout: 370 seconds) [17:00] *** WinterFox has quit IRC (Read error: Operation timed out) [17:12] *** RichardG has joined #archiveteam [17:13] so i have some good news and bad news [17:14] good news is i found Japanese station called JRT Radio [17:14] it has mms urls that go back to 2007 [17:15] maybe 2006 [17:15] anyways the bad news is that its real time mms streams [17:15] and each stream is about 2 hours long [17:15] mms://jrtcomet.jrt.co.jp:80/radio/okamotoy/wma/okamotoy070514.wma [17:16] there's software that will download streams like that, isn't there? [17:16] I'm sure I used something a long time ago to do something like that. [17:17] i'm using mplayer [17:17] mms streams were pretty easy to grab iirc. [17:18] its being grab it just takes a very long time [17:18] Do some research, godane. I'm sure there's something that can be done. [17:18] gmms.sourceforge.net possibly [17:18] other tools are available [17:20] I'm saying it's worth a day or two to check out [17:20] Producing .WAV files would be ideal. [17:20] the mms streams come out as asf [17:21] so it will just be asf convert to wav [17:21] based on my understanding of mms streams [17:21] also i have done tons of the streams in real time [17:21] Well, do a few, let me see them up [17:22] *** minus_ has left WeeChat 1.6 [17:23] hmm [17:27] https://web.archive.org/web/*/http://geocities.com/majormms/mms_client-0.0.3.tar.gz [17:28] looks like it just times out [17:31] Ah, Streambox VCR - that was what i used to use. Ages ago, though, back in the Windows 98 days. 'Streambox' today is a different company entirely. [17:32] loads of alternatives, by the look of it, from a quick google 'download mms streams' [17:48] *** vitzli has quit IRC (Quit: Leaving) [17:54] so some good news [17:55] it maybe only a weekly show [18:20] *** luckcolor has quit IRC (Remote host closed the connection) [18:21] *** luckcolor has joined #archiveteam [18:25] *** luckcolor has quit IRC (Read error: Connection reset by peer) [18:26] *** luckcolor has joined #archiveteam [18:28] *** luckcolor has quit IRC (Remote host closed the connection) [18:31] *** luckcolor has joined #archiveteam [18:37] *** ravetcofx has joined #archiveteam [18:39] *** Start_ has joined #archiveteam [18:39] *** Start has quit IRC (Read error: Connection reset by peer) [18:48] *** luckcolor has quit IRC (Read error: Connection reset by peer) [18:52] *** Swizzle has joined #archiveteam [18:56] *** luckcolor has joined #archiveteam [19:01] *** bwn has quit IRC (Ping timeout: 961 seconds) [19:14] *** Swizzle has quit IRC (Read error: Operation timed out) [19:40] *** kristian_ has joined #archiveteam [20:05] *** db48x has joined #archiveteam [20:10] *** yozlet has joined #archiveteam [20:24] *** Start_ is now known as Start [20:24] HI YOZ [20:25] HI SKETCHCOW [20:25] Let me make a channel [20:25] Thank you! [20:25] Gotta be named funny, give me a moment [20:25] what [20:25] OK, who wants to save a bunch of government sites? [20:26] #bureaucracyteam [20:27] I have chosen #CHEETOFLEE [20:27] niiice [20:31] are you a bad enough dude to save the president('s websites) [20:31] http://i0.kym-cdn.com/photos/images/newsfeed/000/033/081/7proxiesbaddude.jpg [20:31] *** maelstrom has joined #archiveteam [20:32] *** bwn has joined #archiveteam [20:50] *** sep332 has quit IRC (Read error: Operation timed out) [20:58] *** kniffy has quit IRC (Ping timeout: 260 seconds) [21:03] *** Swizzle has joined #archiveteam [21:11] *** kniffy has joined #archiveteam [21:16] *** sep332_ is now known as sep332 [21:28] *** Aranje has quit IRC (Ping timeout: 260 seconds) [21:28] *** Aranje has joined #archiveteam [21:40] *** whydomain has joined #archiveteam [21:42] Would someone be able to stick https://modsandmembersblog.wordpress.com/ into ArchiveBot? It's a nice anti-Yahoo site that would be a shame to lose. [21:55] *** Swizzle has quit IRC (Read error: Operation timed out) [22:04] whydomain: sure thing [22:05] it's running at http://dashboard.at.ninjawedding.org/3 [22:07] *** Yoshimura has quit IRC (Remote host closed the connection) [22:14] *** sep332 is now known as sep332_ [22:26] *** kris33 has joined #archiveteam [22:26] Thanks! [22:27] *** BlueMaxim has joined #archiveteam [22:31] *** kris33 has quit IRC (Textual IRC Client: www.textualapp.com) [22:39] *** yozlet has quit IRC (Quit: My MacBook Pro has gone to sleep. ZZZzzz…) [23:21] *** mathau has joined #archiveteam [23:23] Is there a way to append to a warc file? and is this desirable? or should I just make a new warc for each copy? [23:23] each copy of what? [23:23] warcs are a bunch of records stuck together, but there's usually not a reason to append them [23:23] if you have a jillion small warcs, we have a tool called 'megawarc' which will glom them together [23:25] Ok, thanks! [23:25] glad to help! [23:25] A copy of a site, I meant, say a blog I backup one a month [23:26] ah [23:26] it might make sense to just have a backup file per month [23:27] Ok, and if I want to upload to wayback machine on archive.org, I would upload them all? [23:27] Ha ha, boy, NOBODY is discussing vine now that we have an actual apocalyptic scenario [23:28] mathau: yeah, you can put one warc file per item, or you can put them all in the same item [23:28] i would suggest an item per backup [23:29] Ok! Thanks! That was all I needed to hear :) [23:29] :) [23:29] SketchCow: doesn't mean we're not on it :) [23:30] mathau: generally, items should be a collection of files that are "one thing", or "a single work". you can use tags to group items that are related, i.e. different copies of a work [23:30] mathau: also, warc files don't go into the wayback machine unless they've been blessed. you can email info@archive.org for more details there. [23:32] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [23:34] *** mathau has quit IRC (Quit: Page closed) [23:37] *** maelstrom has quit IRC (Remote host closed the connection) [23:44] *** Swizzle has joined #archiveteam [23:46] *** bwn has quit IRC (Ping timeout: 244 seconds)