[00:07] *** JesseW has joined #archiveteam-bs [00:29] *** JesseW has quit IRC (Ping timeout: 370 seconds) [00:30] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [01:15] *** VADemon has quit IRC (Read error: Connection reset by peer) [01:35] *** tomwsmf-a has joined #archiveteam-bs [02:26] *** JesseW has joined #archiveteam-bs [02:51] *** BlueMaxim has quit IRC (Quit: Leaving) [02:54] *** ndiddy has quit IRC (Read error: Connection reset by peer) [03:34] added the two WARC tools mentioned earlier [03:55] *** dan- has quit IRC (Ping timeout: 260 seconds) [04:10] *** dan- has joined #archiveteam-bs [04:23] *** toad1 has quit IRC (Read error: Operation timed out) [04:37] *** hawc145 has joined #archiveteam-bs [04:41] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:43] *** HCross has quit IRC (Ping timeout: 370 seconds) [04:48] *** Sk1d has joined #archiveteam-bs [04:52] *** BlueMaxim has joined #archiveteam-bs [05:17] *** dashcloud has quit IRC (Read error: Operation timed out) [05:21] *** dashcloud has joined #archiveteam-bs [06:01] *** Honno has joined #archiveteam-bs [06:39] *** BlueMaxim has quit IRC (Read error: Operation timed out) [06:41] *** BlueMaxim has joined #archiveteam-bs [07:15] *** Honno has quit IRC (Read error: Operation timed out) [07:37] *** JesseW has quit IRC (Ping timeout: 370 seconds) [08:02] *** metalcamp has joined #archiveteam-bs [08:06] *** schbirid has joined #archiveteam-bs [08:15] *** Madthias has joined #archiveteam-bs [10:07] *** BlueMaxim has quit IRC (Quit: Leaving) [10:08] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [12:34] *** Honno has joined #archiveteam-bs [12:35] *** Honno_ has joined #archiveteam-bs [12:41] *** Honno has quit IRC (Read error: Operation timed out) [13:13] *** powerKitt has joined #archiveteam-bs [13:28] *** powerKitt has quit IRC () [13:54] *** Madthias has quit IRC () [15:05] *** hawc145 is now known as HCross [15:23] I am not sure that there is a "typical" use for desktop drives, though my brother probably comes close (Letting Windows do it's power management thing on a mostly gaming machine) [15:25] I have taken to putting semi-rigid packing foam under some of my machines (to absorb local vibration). I run my drives way past warranty though. [16:11] *** zgrant has joined #archiveteam-bs [16:12] *** zgrant has quit IRC (Client Quit) [16:21] *** JesseW has joined #archiveteam-bs [16:35] Best way to go about getting http://www.bbc.co.uk/iwonder - thinking a healthy application of youtube-dl and phantomjs [16:38] HCross does normal !ao work :P? [16:38] maybe !a --phatomjs [16:38] luckcolor, need to get all the articles behind it etc etc [16:38] its part of the BBC's attempt at going full yahoo [16:39] yeah [16:39] i will have a run i suppose [16:39] shall bring another instance of archivebot up [16:39] was gonna throw it in my grab-site instance [16:39] ok that works too [16:40] :P [16:40] will checkout [16:43] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:44] yesh it does i can read the webpage [16:44] the only thing tha probably doesn't work of bcc is the streams [16:45] ill see what grab-site makes of it all [16:45] does grabsite support phatomjs and youtube-dl? [16:47] *** metalcamp has quit IRC (Quit: Bye) [16:47] yeah, you can pass arguments to wpull [16:48] ok didn't know [16:51] HCross it has already finished [16:51] yeah, for some reason [16:52] *** metalcamp has joined #archiveteam-bs [16:52] umm luckcolor - it crashed my server [17:02] *** metal_cam has joined #archiveteam-bs [17:05] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [17:12] *** metal_cam has quit IRC (Quit: Bye) [17:16] *** goekesmi_ has joined #archiveteam-bs [17:19] *** goekesmi has quit IRC (Ping timeout: 260 seconds) [17:30] The whole thing? [17:32] I have had a flaky on-board LAN flake out when receiving "lots" of data. Lukily for me, I was using a serial cable to control the computer remotely, so knew exactly why it was not working. [17:35] (It popped up and error on the console) I fixed it by installing a RTL8139 card in the machine. [17:36] nah, seems SSH fell over [17:38] *** schbirid has quit IRC (Read error: Connection refused) [17:44] *** superkuh has quit IRC (Remote host closed the connection) [17:58] *** schbirid has joined #archiveteam-bs [18:30] *** metalcamp has joined #archiveteam-bs [18:38] *** superkuh has joined #archiveteam-bs [18:59] *** bzc6p has joined #archiveteam-bs [18:59] *** swebb sets mode: +o bzc6p [19:00] *** tomwsmf-a has joined #archiveteam-bs [19:04] IA has a classy 502 error page: http://kepfeltoltes.hu/160518/IA_busy_www.kepfeltoltes.hu_.png [19:04] Underlying GIF: https://ia601500.us.archive.org/about/busy.gif [19:11] interesting, and it's mirrored to every server [19:11] e.g. https://ia701505.us.archive.org/about/busy.gif [19:13] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [19:18] Indeed, the outgoing link seems to be quite saturated ATM. [19:21] *** logchfoo4 starts logging #archiveteam-bs at Wed May 18 19:21:33 2016 [19:21] *** logchfoo4 has joined #archiveteam-bs [19:41] *** logchfoo1 starts logging #archiveteam-bs at Wed May 18 19:41:02 2016 [19:41] *** logchfoo1 has joined #archiveteam-bs [19:58] test [19:58] hi logchfoo1 [19:58] hi, chfoo [19:58] ok, i got to replace this stupid chat logger [19:59] i think someone was messing around with it [20:01] chfool [20:02] i'm really tired of commenting out code and trying to patch it [20:03] the html output is very nice; don't change that part if possible [20:03] it's written in bash and i had to patch out a directory traversal vulnerability [20:04] ooh yikes [20:09] ick [20:09] the html output is written in bash? [20:10] sure, why not [20:10] fast.com is giving me numbers all over hte place, lol [20:12] xmc: directory traversal vulnerabilities, for one [20:12] :P [20:14] *** tomwsmf-a has joined #archiveteam-bs [20:16] *** hook54321 has joined #archiveteam-bs [20:16] how about this bot for logging? [20:17] do a /ctcp Chii version [20:18] the only thing is that it phones home to my friend's VPS if you let it scrape the of a URL, however you can disable that functionality [20:18] <ranma> titlesnarfer.js [20:37] *** fie has joined #archiveteam-bs [20:38] *** fie_ has quit IRC (Quit: Leaving) [20:57] *** remsen has quit IRC (ZNC 1.6.2 - http://znc.in) [20:57] *** remsen has joined #archiveteam-bs [21:09] *** JW_work has quit IRC (Read error: Operation timed out) [21:11] *** bzc6p has left [21:17] *** JW_work has joined #archiveteam-bs [21:21] <Frogging> real programmers write their IRC bots in C [21:30] <MrRadar> And subsequently get owned by a buffer overflow error :P [21:30] <MrRadar> C has its places but writing code that deals primarily with networking and strings is not one of them [21:30] <hook54321> Is their a way to search everything the archive team has archived? [21:32] <MrRadar> You can look on the wiki or browse the IA collection: https://archive.org/details/archiveteam [21:32] <MrRadar> This IA search query might also be useful: https://archive.org/search.php?query=subject%3Aarchiveteam [21:33] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [21:34] <MrRadar> There's also the archivebot viewer at http://archive.fart.website/archivebot/viewer/ [21:34] <MrRadar> That will show you which URLs people have requested be archived through in #archivebot [21:36] <hook54321> What about the actual content? [21:39] *** ndiddy has joined #archiveteam-bs [21:40] <JW_work> hook54321: Without downloading it locally, no, I don't think there is full-text search available. [21:40] <godane> so i found out that RTE Player keeps there f4m files [21:40] *** fie has quit IRC (Quit: Leaving) [21:41] <MrRadar> hook54321: There's no way to search through the content since it is literally hundreds of terabytes worth of data [21:41] <godane> example: http://cdn.rasset.ie/hds-vod/2015/1127/20151127_rteone-latelate-thelatelat_cl10498592_10499420_260_drm_/manifest.f4m [21:41] <JW_work> godane: what is that an example of? [21:41] <JW_work> ah, missed your comment above [21:42] <JW_work> MrRadar: I wouldn't say "no way" — if you wanted to pay enough to google or amazon, I'm pretty sure they could run a search over all of it. [21:43] <MrRadar> It would probably take quite a while to index through [21:43] <MrRadar> Since you'd have to pull everything from the IA [21:44] <hook54321> Isn't their going to eventually going to be a search engine for the wayback machine? [21:44] <JW_work> maybe [21:44] *** schbirid has quit IRC (Quit: Leaving) [21:44] <hook54321> I thought I saw an article somewhere about it [21:44] <JW_work> MrRadar: yeah, I wouldn't say it would be *quick* [21:45] <MrRadar> That would be nice, but it would be on the scale of creating a full commercial search engine with the added dimension of time [21:45] <MrRadar> (re a search engine) [21:46] <hook54321> Let me see if I can find the article, just a sec. [21:49] <hook54321> http://gizmodo.com/the-wayback-machine-is-getting-a-search-engine-1739099940 [21:49] <MrRadar> Huh, I hadn't seen that [21:50] <hook54321> Do you think they have started testing it yet? [21:51] <MrRadar> If they are I haven't heard of it [21:51] <MrRadar> What are you trying to find? [21:52] <hook54321> Eh. I guess not anything specific. But their are probably lots of things I would search for if it were already available. [21:52] <hook54321> How many admins does Archive.org have? [21:57] <MrRadar> Probably many of the people listed on this page are: https://archive.org/about/bios.php [21:57] <MrRadar> AFAIK the only one who hangs out here is Jason Scott (aka SketchCow) [21:58] <SketchCow> <3 [22:00] <hook54321> SketchCow, are all of the Archive.org admins official staff that are local or are their some volunteers? [22:00] <SketchCow> Mostly [22:00] <SketchCow> Some people are admin of their collections. [22:03] <hook54321> Ah, ok. Do you know if Archive.org has access to kinda rare German psychology research papers from World War 2? [22:06] <SketchCow> No idea. [22:07] <godane> SketchCow: you may get some old RTE News [22:07] <hook54321> RTE? [22:08] <godane> RTE News [22:08] <MrRadar> hook54321: If they were in a book the IA scanned they have a copy via that book scan. Also if someone had posted themon the web it's possible they may have been archived via the Wayback Machine (though that would require knowing a URL) [22:08] <MrRadar> Otherwise someone may have uploaded them directly, but that's even less likely than the other two [22:09] <godane> i will say the rtmpdump of the stream is weird [22:09] <godane> the video file is 35 minutes when i should be 29 minutes [22:18] <hook54321> The specific paper I'm looking for is: "Das psychisch abnorme Kind" (Written by Hans Asperger in 1938) [22:21] <hook54321> It's been referenced in books, but I haven't been able to find a copy of it. I emailed one of the authors that vaguely talked about it in one of their books, turns out they just mentioned someone else referencing it. So I emailed that person who told me to email this other person. Then I emailed that person but never got a response. 😐 [22:22] <MrRadar> Have you tried searching Google Books? [22:22] <hook54321> For the original article? [22:22] <MrRadar> Yeah; though you're probably more likely to find citations [22:22] <hook54321> I think I have, but I'll try again. [22:23] <HCross> Just took a look at Google Books, seems to just be references [22:23] <hook54321> So it obviously exists somewhere if people are referencing it, right? [22:25] <hook54321> Could someone be trying to keep the article for just themselves and a few other people?... [22:26] *** tomwsmf-a has joined #archiveteam-bs [22:27] <MrRadar> Perhaps. Do you know anything about its original publication? Like whether it was published in an academic journal? [22:29] <hook54321> http://link.springer.com/referenceworkentry/10.1007%2F978-1-4419-1698-3_1847 [22:30] <hook54321> "His first published paper in this area was not the celebrated 1944 paper but “Das psychisch abnorme Kind,” which appeared in the Wiener Klinischen Wochenzeitschrift in 1938 (Asperger, 1938). This was th ..." [22:30] <MrRadar> That's a good lead [22:31] <MrRadar> If you have access to a university library you should search for that journal in academic databases [22:31] <MrRadar> Though it's so old it may not appear in them [22:35] <hook54321> Like JSTOR and stuff like that? [22:35] <MrRadar> Yeah [22:35] <MrRadar> (Though I am by no means an expert at using those kind of systems; I last touched them when I graduated 5 years ago) [22:40] <hook54321> I have access to some of them, would they have the name of the article or would I need to find the specific issue it was published in? [22:41] <MrRadar> Try searching for the article and if that doesn't work for the journal by name I guess? [22:41] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [22:41] <MrRadar> Based on the references section of that article page you linked earlier it looks like the article you're looking for is in issue(? volume?) 51 [22:42] <hook54321> Is their a way to find out who owned that academic journal? [22:43] <MrRadar> I'm not sure [22:47] <MrRadar> I'm going AFK for a while [22:49] <JW_work> hook54321: I presume you've seen this: http://wrongplanet.net/forums/viewtopic.php?t=236213 [22:51] <hook54321> Yeah, saw that. I even messaged the person that claimed to have a copy of it. She said it was in her attic somewhere and that she isn't able to get it from there. [22:53] <hook54321> Hans Aspergers' daughter is still alive, but I haven't been able to find her email address anywhere... [22:58] <hook54321> I found her address though, do any of you live or know someone who lives in Switzerland? [22:59] <JW_work> I'd focus more on the journal first [22:59] <hook54321> She might have a copy of it though, right? [23:15] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [23:15] *** tomwsmf-a has joined #archiveteam-bs