[00:05] *** Aranje has quit IRC (Ping timeout: 245 seconds) [00:09] PurpleSym: awesome! [00:10] looks great [00:15] let's create a channel for imzy [00:16] timmc: we'll get the images through the warrior project [00:20] *** j08nY has quit IRC (Quit: Leaving) [00:27] gonna be hard to think of a channel name :P imzy ain't an easy one [00:31] gonezy? [00:32] Does someone have all the comments yet for imzy? I'e got posts and community groups that are listed publically, but don't want to go regrabbing all the comments if someone's already done so and posted them somewhere... [00:41] I think PurpleSym got them as part of his API grab: https://archive.org/details/imzy-api-2017-05-26 [00:41] Though not in a format useful for the IA Wayback Machine [00:59] #donezy? [01:05] MrRadar: thanks. Mine aren't either. [01:09] *** HCross2 has quit IRC (Quit: Connection closed for inactivity) [01:17] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [01:25] *** Sk1d has joined #archiveteam-bs [03:13] voltagex: is that the edutainment title with the same name from the Learning Company? [03:14] PurpleSym: If you're here, I think you might? be missing comments after depth 2 from your imzy grab. looks like api can pull more comments using a from_id param via the /communities/{community}/posts/{post}/comments endpoint. I could be wrong though... [03:16] *** Odd0002 has quit IRC (Remote host closed the connection) [03:21] dashcloud: yes [03:23] I enjoyed that- I may have even finished that one [03:44] PurpleSym: I stand corrected. My bad. I thought depth would have referred to a reply depth in total, and parent was referring to a replied-to comment. [03:52] *** ndizzie has quit IRC () [03:56] *** REiN^ has quit IRC (Max SendQ exceeded) [03:58] *** REiN^ has joined #archiveteam-bs [03:58] *** ploop has quit IRC (Read error: Operation timed out) [04:04] *** Odd0002 has joined #archiveteam-bs [04:06] dashcloud: the DOS version is archived, the Windows 95 version is nowhere [04:19] awesome then [04:34] *** ploop has joined #archiveteam-bs [05:10] *** robogoat has joined #archiveteam-bs [05:15] *** Ravenloft has joined #archiveteam-bs [05:39] *** Honno__ has quit IRC (Ping timeout: 370 seconds) [05:53] *** kittymeow has joined #archiveteam-bs [06:11] I've been reading about how CDs degrade within a few years and DVDs supposedly start around 10, I'm not sure what BluRays are at but I have a bunch of saved stuff from stuff that no longer exists anymore including videos of live events etc (I'm on ADSL broadband and it's not really practical to upload at 100KB/s until I can get fibre) and I've been reading that the best thing to do is [06:11] keep stuff ... [06:11] ... on hard disks and change them every few years? But USB external hard disks seem to have a tendency of dying more often than normal hard disks (and easy to get physically damaged by bumps or drops too) and I don't really like them anymore (it also seems a pain to keep backed up since it's so slow compared to something like RAID 1 where it's always done for you constantly), I have [06:11] been looking ... [06:11] ... into Network Attached Storage for RAID 1 but I don't really know what I'm doing but I saw that FreeNAS is recommended on a few forums, but it looks like even if I buy a prebuilt one I don't really know enough to make any kind of educated choice even if I should pay extra for read cache or write cache or not ... [06:19] and I have stuff from a few closed down online games too [06:23] I think I could afford to buy one one NAS thing with the disks in raid 1 but there's such a markup on the price of the hard disks it seems really expensive compared to a normal computer but I guess that's more vulnerable to getting virues and stuff and probably consumes more power too.. but I don't think I would be able to afford to change the hard disks every few years, if that really [06:23] is ... [06:23] ... recommended? (or do the technical stuff of setting them up to work with raid 1 by myself instead of paying for it, sending stuff for warranties with hard disks that aren't empty through couriers or postal services in bumpy trucks seems a bad idea) [06:35] bmcginty: I *think* that endpoint returns all comments at once, because the number of comments returns matched the number of comments on the website for the posts I checked. [07:32] *** SHODAN_UI has joined #archiveteam-bs [07:32] *** n00b228 has joined #archiveteam-bs [07:36] *** n00b228 has quit IRC (Client Quit) [08:24] *** HCross2 has joined #archiveteam-bs [08:55] *** j08nY has joined #archiveteam-bs [09:06] *** Honno has joined #archiveteam-bs [09:40] *** RichardG has quit IRC (Read error: Operation timed out) [11:08] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [11:08] *** zhongfu has joined #archiveteam-bs [11:50] *** icedice has joined #archiveteam-bs [12:09] #whimzy? [12:10] #flimzy? [12:13] was serious questions :'( [12:25] *** icedice has quit IRC (Quit: Leaving) [12:47] *** BlueMaxim has quit IRC (Read error: Operation timed out) [12:49] kittymeow: It might help to give a very concise summary of that wall of text. :-) [13:36] *** Ravenloft has quit IRC (Ping timeout: 260 seconds) [14:04] *** RichardG has joined #archiveteam-bs [14:04] *** LastNinja has quit IRC (Remote host closed the connection) [14:13] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [14:14] "Someone just stumbled into the media problem" [14:14] *** RichardG has joined #archiveteam-bs [14:14] Toshiba announced a 128tb SSD by 2018, that slipped by me. [14:14] Even if the deadline slips, that's extraordinary. [14:14] *** trs80 has joined #archiveteam-bs [14:14] Especially if it's not crazy, like sub $500 [14:16] Samsung says 16tb next year. [14:19] I misread that and thought you said 128 GB [14:20] so I was confused wondering why they're announcing such a thing [14:24] samsung already sell enterprise 15TB SSDs (I assume we're talking ssds?) [14:26] is that more capacity than platter-based drives? [14:30] yeah [14:30] 12TB drives are shipping, 14TB are due in a month or two [14:32] the 15TB ssds are supposedly $10k a piece (no public pricing, if you have to ask etc.) [14:33] from netapp they're $100k a pop http://www.netapp.com/us/media/pa-price-list.pdf [14:36] Netapp is not a judgement of price [14:36] Also I can't believe Netapp is in business [14:36] still [14:37] oh yeah, it was more an example of their order of magnitude markup. having managed several netapps, I can understand why, but the reasons are fewer now [14:37] Way fewer. [14:37] Almost zero since... what, 2006? [14:38] The day the netapp person mentiond super secret mode to me [14:38] ? [14:39] In the command line of the old server (we had the silver ones, and later the plastic ones) there's a command that drops you down into the good mode [14:39] the ones with actual commands [14:39] as in the freebsd layer? [14:40] Whatever [14:40] Bear in mind I ended my life as a sysadmin in 2009, walking backwards, fingers up [14:40] What a great day for the world [14:41] https://pbs.twimg.com/media/DAuU0ScUAAA9EE_.jpg:large [14:42] it's actually documented now, called the systemshell https://library.netapp.com/ecmdocs/ECMP1196817/html/system/node/systemshell.html [14:43] hmm, I guess linus is a project manager now, so he is in the right square of that matrix [15:46] *** Ravenloft has joined #archiveteam-bs [16:29] SketchCow : damn if even you don't know then the world is screwed... well at least until stuff gets cheaper :o [16:31] I loved reading this http://www.archiveteam.org/index.php?title=Posterous/Story [16:33] I have been looking at this https://www.ixsystems.com/freenas-mini/ and it seems nice but the cost of the hard disks is waaaay higher than a hard disk actually costs its marked up so much [16:33] and if you are meant to change hard disks every few years then it'd be way too expensive to maintain but maybe since it's raid 1 you could just wait until one of the mirrors breaks down with no bad effects [16:44] *** antomatic has quit IRC (Ping timeout: 245 seconds) [16:51] *** antomatic has joined #archiveteam-bs [16:51] *** swebb sets mode: +o antomatic [16:51] kittymeow: even if you are on ADSL, you could still upload your stuff to the Internet Archive, just *very very slowly* [16:52] but presuming the stuff is not changing, even if it takes a couple of years to upload, it'll still get there [17:06] kittymeow: do you have any spare hard drives you can put the data on and then mail it to IA? [17:37] *** antomati_ has joined #archiveteam-bs [17:37] *** swebb sets mode: +o antomati_ [17:38] *** antomatic has quit IRC (Ping timeout: 250 seconds) [17:39] *** antomati_ is now known as antomatic [17:45] *** antomati_ has joined #archiveteam-bs [17:45] *** swebb sets mode: +o antomati_ [17:47] *** antomatic has quit IRC (Ping timeout: 250 seconds) [17:47] *** antomati_ is now known as antomatic [17:56] *** j08nY has quit IRC (Read error: Operation timed out) [18:28] *** Ravenloft has quit IRC (Read error: Operation timed out) [18:28] *** j08nY has joined #archiveteam-bs [18:45] oh i didn't know you can mail stuff to them that's pretty neat, yeah I think for a lot of people it's easier to use mail by a pigeon than upload on ADSl broadband :D email dvds and blurays or hard disks even (but hard disks are more expensive if its any reasonable size) hmm maybe flash pens, I never really looked into ultra cheap disposable stuff before [18:47] the problem with uploading stuff slowly with big files is it uses up all your bandwidth unless you use something like NetBalancer but its not cheap.... I remember I left computer on for a few days to upload 2gb youtube and when it finished it just error'd, so annoying [18:49] arkiver: I have some modifications to the deduplicator that avoid loading records.json into memory; this saves some space, but there's still something in deduplicator.py that's eating up too much memory [18:49] I'll PR the changes when I have access to them again (an hour or so) [18:49] I wonder if it's payload.read() [18:51] oh it probably is. warc expects payload to be a string :( [18:51] that is so not going to fly [19:20] *** HP has quit IRC (Remote host closed the connection) [19:54] *** tfgbd_znc has joined #archiveteam-bs [20:48] godane: how's the mr rogers archiving going? [21:06] is http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/index.html even being maintained anymore? [21:07] I'd love to see some more recent stats [21:34] cc xmc ^ [22:05] ok for me [22:05] but there is parts missing in my archive [22:06] about 8 to 12 hours early on may 26 [23:03] *** RichardG_ has joined #archiveteam-bs [23:05] *** SHODAN_UI has quit IRC (Remote host closed the connection) [23:09] *** RichardG has quit IRC (Read error: Operation timed out) [23:15] *** kristian_ has joined #archiveteam-bs [23:45] *** wabu has quit IRC (Read error: Operation timed out) [23:49] *** wabu has joined #archiveteam-bs