[00:03] huh, apparently there are still live MUD/MUSH/MUCK servers [00:03] oh yeah [00:03] I wonder if these could be archived [00:03] Gemstone <3 [00:03] somehow [00:03] I just telneted to one [00:05] Yay, band! [00:13] I think the closest one could get to "archiving" a MUD is by doing something like the following [00:13] detect the server it uses, so you know what commands are compatible and general behavior [00:14] walk everywhere, look at everything, try out everything that can provide description of some kind [00:14] maybe try actions next? [00:14] idk I've never played one [00:14] lol [00:16] starting to upload StreamRoot DT fileshttps://archive.org/details/e_history_StreamRoot_DT_0001_1956_01 [00:17] only 161 files out of what should be 167 [00:18] godane: waw, nice [00:21] Any specific do's and dont's when archiving stuff from servers in North Korea? [00:22] its south korean stuff [00:22] stuff like the KCNA [00:22] they have stuff still in dial up video mms streams [00:23] and realmedia if start going back to 1999 to 2000 [00:23] Obv I wont go to agressive on them, but was wondering if there is anything I need to take especially good note of? [00:24] i don't know about kcna stuff [00:24] i know mostly about kbs [01:05] just donated to the telethon [01:13] *** asdf has joined #archiveteam-bs [01:22] *** aaaaaaaaa has joined #archiveteam-bs [01:22] *** swebb sets mode: +o aaaaaaaaa [02:04] *** parker_ has quit IRC (Remote host closed the connection) [02:05] *** parker_ has joined #archiveteam-bs [02:38] *** parker_ has quit IRC (Remote host closed the connection) [02:38] *** parker_ has joined #archiveteam-bs [02:43] *** parker_ has quit IRC (Remote host closed the connection) [02:44] *** parker_ has joined #archiveteam-bs [02:46] *** nd1ddy has quit IRC (Read error: Connection reset by peer) [02:48] *** parker_ has quit IRC (Remote host closed the connection) [02:49] *** parker_ has joined #archiveteam-bs [02:59] *** ndiddy has joined #archiveteam-bs [03:04] *** asdf has quit IRC (Ping timeout: 378 seconds) [03:44] *** godane has quit IRC (Ping timeout: 311 seconds) [03:46] *** godane has joined #archiveteam-bs [03:50] *** DDR has quit IRC (Remote host closed the connection) [03:55] *** godane has quit IRC (Leaving.) [03:55] *** godane has joined #archiveteam-bs [04:28] *** aaaaaaaaa has quit IRC (Leaving) [04:35] so my personal archiving of byte magazine may have helped in finding a "lost" issue [04:35] turns out some one uploaded a zip of byte magazine for 1994-05 [04:39] *** ndiddy has quit IRC (Read error: Connection reset by peer) [04:42] godane: link? [04:47] https://archive.org/details/199405BYTE1905ComponentWare [05:01] I just added an entry for the telethon to MusicBrainz: https://musicbrainz.org/recording/890b435b-afc0-4389-898e-32733a7103c7 [06:30] *** asdf has joined #archiveteam-bs [07:39] *** vitzli has joined #archiveteam-bs [08:03] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [08:11] *** VADemon has quit IRC (left4dead) [08:19] *** Boppen has quit IRC (Read error: Connection reset by peer) [08:19] *** Boppen has joined #archiveteam-bs [08:37] *** JesseW has quit IRC (Leaving.) [08:59] I love this comment https://www.jwz.org/blog/2015/12/the-internet-archive-telethon-is-live-now/#comment-165199 [08:59] "don't give them money to fix the problem I am talking about" [09:18] *** schbirid has joined #archiveteam-bs [09:25] *** asdf has quit IRC (Ping timeout: 252 seconds) [14:16] *** Muad-Dib has joined #archiveteam-bs [14:32] how can i wget this site http://electriciantraining.tpub.com/ ? wget -m -k -nv -c -e robots=off doesn't seem to grab everything [15:08] *** signius has quit IRC (Ping timeout: 364 seconds) [15:15] *** VADemon has joined #archiveteam-bs [15:21] *** CatButts has joined #archiveteam-bs [15:21] the cat butt will rise [15:22] * CatButts unpacks contents of memory [15:22] hie fie [15:22] *hi [15:23] yo [15:23] I remembering pestering you about some aussie TV show [15:23] weeks ago [15:24] some private tracker forum post [15:24] still nothing [15:24] ah [15:24] as for Nevashut game, tough luck [15:24] I have low hopes for that [15:25] I put a small bounty on it, if I had a better ratio i would put more to get it more attention [15:25] also, I have shitloads of CDs [15:25] and they sit in my cabinet and rot [15:25] hahahahaha [15:25] for years they have [15:25] of what? [15:26] CDs that come with software and gaming magazines [15:26] http://dberkholz.com/2006/09/06/gentoo-lots-of-new-gui-config-tools/ [15:26] failz [15:27] https://s2.wp.com/wp-content/blog-plugins/domains/expiring-domain-alert/images/automattobot-panic-level-high.gif [15:27] beep beep boop boop [15:27] Uh oh! Your blog’s domain dberkholz.com expired 6 days ago! [15:27] any idea what to do with those CDs? [15:28] CatButts: where are you physically? [15:28] country-wise [15:28] Romania [15:28] I have no intention to ship stuff [15:28] at best, I will rip ISOs and scan covers/CD [15:28] CatButts: on Linux? [15:29] I am on windux [15:29] weendoze [15:29] >iso [15:29] right, so ISO alone won't be enough because there may be multi-track CDs, but I don't know of any batch imaging tools for Windows that will produce archival-quality copies [15:29] I will rip one by one manually [15:30] but for this, I will need to find person who is interested in said CDs [15:30] and deals with archival of such a thing [15:30] CatButts: try SketchCo1 :P but he's at the telethon right now [15:31] CatButts: anyhow, if you want to image them yourself, you want a bin/cue, as generated by cdrdao (not sure what tools exist for this on Windows) [15:31] cover and label scans at at least 300DPI, preferably 600DPI [15:31] yeah, I've been told ISO is not exactly ideal format [15:31] DVDs can be safely imaged as ISOs in all cases, as they don't have multi-track features [15:31] (CDs do) [15:31] yeah [15:31] ISO will only work for single-track disks [15:32] it's not always easy to identify whether a disk is single-track or not [15:32] CatButts: fwiw, I wrote this for Linux: https://github.com/joepie91/image-disc/blob/master/image.py - perhaps it may be useful as a reference [15:32] auto-detects disc type and images to appropriate format [15:34] * CatButts vaguely remembers aussie he used to talk to on IRC with [15:34] he might be the answer to my aussie show connundrum [15:36] CatButts: fwiw, this is where your magazine disk uploads will likely end up: https://archive.org/details/coverdiscs [15:36] also, there's also issue of malware with some shareware disks [15:37] lol DOS malware [15:37] wouldn't worry about it too much tbh [15:37] may set off the virus scanner [15:37] if it does, email IA and let them know [15:38] eh, I'm sure the uploads are inspected [15:40] * CatButts rolls on his lazy arse [15:42] hmmm [15:42] just realised the telephon is one [15:42] telethon, is on. even [15:42] or rather, was. [15:45] hmmm [15:46] SketchCo1 be dancin [15:46] yah [15:46] told mah homies to donate [15:46] for what it's worth :/ [15:47] CatButts: there's an automated virus scanner [15:47] CatButts: that might dark items [15:47] because there've been problems with people using IA for malware distribution before [15:47] so anything that gets darked that way will likely not get further attention from IA unless you notify them [15:49] got it [15:50] do I get notified if something I upload gets dorked? [15:51] dorked? [15:51] darked [15:51] I maed a pun [15:51] I go to punitenciary [15:52] dorke:O [15:53] *video of fanatical religious woman yelling DARK SIDE, but having it sound like DORK instead* [15:53] hehe [15:54] it's the one where people come to house to give her a monetary prize for something and she goes nut and rips their check and argues [15:54] in front of her familly [15:54] or a daughter at least [15:57] *** alberto has joined #archiveteam-bs [15:58] no idea what this is but ok :D [15:58] then I will seek it [15:59] https://www.youtube.com/watch?v=bOpva_iit-8 [15:59] probably this [16:00] *** vitzli has quit IRC (Quit: Leaving) [16:00] yep [16:03] I should get off my arse and backup my favourite porns [16:04] well [16:04] not sure wtf just happened [16:04] but something hooked behind something else and then half the contents of my desk flew off [16:04] somehow [16:04] ... my external HDD appears to have survived [16:06] doesn't sound like a good place to store data [16:09] I've heard stories of low reliability regarding external HDDs [16:13] I have internal SATA HDDs I use externally [16:13] though, I have to power off PC to use [16:14] which I don't mind, because I do it once a month [16:14] for partition backup [16:19] joepie91: :/ [16:19] glad the disk survived [16:20] i had a poer blip kill 1 disk in a raid [16:20] fine, fine, got backups, annoying [16:20] start restoring from backups [16:20] second disk had also fried but not quite enough to kill it directly [16:20] only discover after 2 days of restoring XD [16:38] CatButts: yeah, internal HDD + external enclosure [16:38] I don't trust ready-made external HDDs anymore [16:39] I have made a habit of looping my external HDD power cable behind something [16:39] which means it powers off before it hits the ground [16:39] I suspect that that+ carpet on the floor saved irt [16:39] it* [16:41] hmmm nice [16:41] I once dropped a CRT monitor from about 5ft, watched it bounce, still powered on [16:41] why aren't electronics like that anymore :D [16:41] Smiley: well, HDDs never were [16:41] :P [16:42] yah true [16:42] Smiley: and um, my Samsung TFT has been operating with a deep scratch in the screen for... 4 years now? [16:42] as in [16:42] probably several millimeter deep scratch [16:43] Smiley: corner of one of these landed on it: 1288171765_c8fa075e07_logo [16:43] er [16:43] http://www.bouwbinder.nl/typo3temp/GB/690-1-1288171765_c8fa075e07_logo-bouwbinder_773648e9d1_d8a00830a8.jpg [16:44] aside from a brightly lit scratch in the middle of my screen, ~3cm high, there's been absolutely zero quality degradation through the years [16:44] so, robust hardware does still exist :p [17:03] robutts [17:22] heh [17:22] m samsung has a nice scratcch too [17:22] not quite that dewep [17:23] daughter hitting keyboard [17:25] http://thumbs.ebaystatic.com/d/l225/m/mhq6FtMJoG67WzdlPXJ8OMw.jpg [17:25] fell on him [17:25] sry for bad pic :/ [17:51] *** JesseW has joined #archiveteam-bs [17:53] *** CatButts has quit IRC (Here is my journey's end, here is my butt. <k!15b8>) [17:53] *** ndiddy has joined #archiveteam-bs [17:59] *** signius has joined #archiveteam-bs [18:04] *** RichardG has quit IRC (Read error: Connection reset by peer) [18:10] *** RichardG has joined #archiveteam-bs [18:33] another lost byte magazine found not in collection: https://archive.org/details/198610ByteMagazineVol1111InsideTheIBMPC [18:36] this guy saved lots of mid to late 1990s byte magazine: https://archive.org/details/@epobirs [18:47] *** Amitari has joined #archiveteam-bs [18:47] Hey, can anyone here help me with wget? [18:48] *** SN4T14 has quit IRC (Read error: Operation timed out) [18:48] *** SN4T14 has joined #archiveteam-bs [18:49] I'm not familiar enough with wget, sadly. [18:51] Amitari: depends what u need [18:52] I'm mirroring a PhpBB forum, and I read on the wiki that I should save some cookies first. When I try however, I get the message "Remote file exists and could contain further links,but recursion is disabled -- not retrieving." [18:52] but recursion is disabled [18:52] thewres your clue [18:52] I didn't find any info about this on the web though... [18:52] https://www.google.co.uk/search?q=wget+enable+recursion&ie=utf-8&oe=utf-8&gws_rd=cr&ei=_fh2VvmQMsaKPsfvuMgB [18:53] no? [18:53] Oh... [18:53] Sorry for bothering you then... [18:54] tis ok to ask :D [18:54] u get what u need?@ [18:54] -r [18:54] wget .... -r [18:55] Wait, no... [18:55] Do you know how to download PhpBB-forums? [18:55] The example command on the wiki doesn't seem to work for some reason. [18:56] what does the command on the wiki do? (and link please to the wiki page) [18:56] I haven't archived anything personally for long time [18:56] busy with so many things :/ [18:56] http://www.archiveteam.org/index.php?title=PhpBB [18:56] Let me get the results for you on Pastebin... [18:56] ty :) [18:56] now we are getting somewhere :) [18:57] Also I'm about to shower my daughter, so I'll be on and off [18:57] but stick around and someone will attempt to assist you anyway once we have the info we need :) [18:58] Smiley: Here. http://pastebin.com/GLEbVVJP [19:00] ah ok [19:01] So, know what to do? [19:01] I think so [19:01] I don't think you're using bash? [19:01] I don't know to be honest. [19:02] We'll see how things go [19:02] instead of $(date +...) [19:02] oops, sorry [19:02] just use 20151220 [19:02] wrong window [19:02] so http://sloyer.5forum.info20151220 [19:02] for the warcfile and log [19:02] Let me try... [19:02] wait nope [19:03] it is working, as it's crying about the log file o_O [19:03] run this: [19:03] touch http://sloyer.5forum.info20151220.log [19:03] have you tried the phpbb page in the wiki? [19:03] schbirid: this is from it... [19:03] ayyyyy [19:03] :D [19:04] Wait... [19:04] wget: http://sloyer.5forum.info20151220.log: No such file or directory [19:04] wget isnt' creating the log file [19:04] I tried the first thing, same result. [19:04] give me a second [19:05] that's... odd [19:05] man page deffo says it'll create a log file if one doesn't exist [19:05] ah [19:05] Amitari: can you pastebin the output of ls -lah ./ [19:06] stop [19:06] start from the beginning [19:06] "Remote file exists and could contain further links,but recursion is disabled -- not retrieving." [19:06] means nothing here [19:06] the first wget is ONLY to get a cookie [19:07] yesh I told him to turn on -r for that [19:07] oph wait [19:07] hmm :/ [19:07] * Smiley bows out [19:07] Daughter ready for shower [19:07] Uh, I'm not sure if I want to pastebin the outpus of ls on my /home directory... [19:07] this forum does not even need a cookie [19:07] oh then we don't need that command :D [19:07] >:D [19:08] So, anything I could do? [19:09] Amitari: i'd always suggest to use a specific directory to play around in, not your home :) i use a ramdisk [19:10] try simply the second wget line from http://www.archiveteam.org/index.php?title=PhpBB and remove the " --keep-session-cookies" bit if you want [19:10] err [19:10] Oh, alright. [19:10] the " --keep-session-cookies --load-cookies=COOKIEFILE" part [19:10] I'll try. [19:10] if you dont care about being nice, remove the "-w 1" part [19:11] I get the problems with the log file again.. [19:12] what name did you specify for the log file? you must not use the http:// in it [19:13] wget -m -a sloyer.5forum.info_$(date +%Y%m%d).log -e robots=off -nv --adjust-extension --convert-links --page-requisites --reject-regex='(\?p=|&p=|mode=reply|view=|search.php|/abuse\?)' --warc-file=sloyer.5forum.info_$(date +%Y%m%d) --warc-cdx http://sloyer.5forum.info/ [19:13] Ah, now it works! [19:13] :) [19:13] It still gives that error about timestamps, but it still works. [19:13] Thank you! [19:14] It looks like it doesn't archive the images though... [19:14] "WARC output does not work with timestamping, timestamping will be disabled."? no problem, that's just a warning, you can totally ignore it [19:14] Yeah, I know. [19:14] it just means that it will download everything [19:14] external images are not included, no :\ [19:15] Can I make it include them? [19:15] iirc i never succeeded but did the grab first, then grepped the files for external urls and grabbed them later :\ [19:16] wpull probably does it well: https://wpull.readthedocs.org/en/master/usage.html [19:16] omg [19:16] --span-hosts-allow linked-pages,page-requisites [19:16] chfoo i love you [19:17] So, if I add that to the command, it will download the external images? [19:18] if you use wpull(!) i guess this would do --span-hosts-allow page-requisites [19:19] hrm, wpull is broken for me :( [19:20] Nevermind, the text is plenty. [19:22] *** Amitari has quit IRC (Leaving) [19:27] i hope she was satisfied :\ [19:38] *** schbirid has quit IRC (Quit: Leaving) [19:50] *** brayden_ has quit IRC (Read error: Connection reset by peer) [19:50] *** brayden has joined #archiveteam-bs [19:50] *** swebb sets mode: +o brayden [20:11] *** schbirid has joined #archiveteam-bs [20:19] *** JesseW has quit IRC (Leaving.) [20:25] *** alberto has quit IRC (Ping timeout: 250 seconds) [20:25] *** JesseW has joined #archiveteam-bs [20:32] https://www.indiegogo.com/projects/skrolli-international-edition#/ [20:38] *** JesseW has quit IRC (Leaving.) [21:02] *** xXx_ndidd has joined #archiveteam-bs [21:08] *** Coderjoe has quit IRC (Read error: Connection reset by peer) [21:09] *** ndiddy has quit IRC (Read error: Operation timed out) [21:14] *** Coderjoe has joined #archiveteam-bs [21:27] https://www.youtube.com/watch?v=o-nJpaCXL0k [21:33] *** schbirid has quit IRC (Quit: Leaving) [21:56] *** JesseW has joined #archiveteam-bs [22:26] *** JesseW has quit IRC (Leaving.) [22:44] *** closure has joined #archiveteam-bs [22:44] *** midas sets mode: +o closure [23:05] *** err3 has joined #archiveteam-bs [23:05] hey [23:05] what happened with the telethon? [23:29] *** RichardG_ has joined #archiveteam-bs [23:29] *** RichardG has quit IRC (Read error: Connection reset by peer) [23:34] it finished