#archiveteam-bs 2016-05-18,Wed

↑back Search

Time Nickname Message
00:07 🔗 JesseW has joined #archiveteam-bs
00:29 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
00:30 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
01:15 🔗 VADemon has quit IRC (Read error: Connection reset by peer)
01:35 🔗 tomwsmf-a has joined #archiveteam-bs
02:26 🔗 JesseW has joined #archiveteam-bs
02:51 🔗 BlueMaxim has quit IRC (Quit: Leaving)
02:54 🔗 ndiddy has quit IRC (Read error: Connection reset by peer)
03:34 🔗 JesseW added the two WARC tools mentioned earlier
03:55 🔗 dan- has quit IRC (Ping timeout: 260 seconds)
04:10 🔗 dan- has joined #archiveteam-bs
04:23 🔗 toad1 has quit IRC (Read error: Operation timed out)
04:37 🔗 hawc145 has joined #archiveteam-bs
04:41 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
04:43 🔗 HCross has quit IRC (Ping timeout: 370 seconds)
04:48 🔗 Sk1d has joined #archiveteam-bs
04:52 🔗 BlueMaxim has joined #archiveteam-bs
05:17 🔗 dashcloud has quit IRC (Read error: Operation timed out)
05:21 🔗 dashcloud has joined #archiveteam-bs
06:01 🔗 Honno has joined #archiveteam-bs
06:39 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
06:41 🔗 BlueMaxim has joined #archiveteam-bs
07:15 🔗 Honno has quit IRC (Read error: Operation timed out)
07:37 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
08:02 🔗 metalcamp has joined #archiveteam-bs
08:06 🔗 schbirid has joined #archiveteam-bs
08:15 🔗 Madthias has joined #archiveteam-bs
10:07 🔗 BlueMaxim has quit IRC (Quit: Leaving)
10:08 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
12:34 🔗 Honno has joined #archiveteam-bs
12:35 🔗 Honno_ has joined #archiveteam-bs
12:41 🔗 Honno has quit IRC (Read error: Operation timed out)
13:13 🔗 powerKitt has joined #archiveteam-bs
13:28 🔗 powerKitt has quit IRC ()
13:54 🔗 Madthias has quit IRC ()
15:05 🔗 hawc145 is now known as HCross
15:23 🔗 phillipsj I am not sure that there is a "typical" use for desktop drives, though my brother probably comes close (Letting Windows do it's power management thing on a mostly gaming machine)
15:25 🔗 phillipsj I have taken to putting semi-rigid packing foam under some of my machines (to absorb local vibration). I run my drives way past warranty though.
16:11 🔗 zgrant has joined #archiveteam-bs
16:12 🔗 zgrant has quit IRC (Client Quit)
16:21 🔗 JesseW has joined #archiveteam-bs
16:35 🔗 HCross Best way to go about getting http://www.bbc.co.uk/iwonder - thinking a healthy application of youtube-dl and phantomjs
16:38 🔗 luckcolor HCross does normal !ao work :P?
16:38 🔗 luckcolor maybe !a --phatomjs
16:38 🔗 HCross luckcolor, need to get all the articles behind it etc etc
16:38 🔗 HCross its part of the BBC's attempt at going full yahoo
16:39 🔗 luckcolor yeah
16:39 🔗 luckcolor i will have a run i suppose
16:39 🔗 luckcolor shall bring another instance of archivebot up
16:39 🔗 HCross was gonna throw it in my grab-site instance
16:39 🔗 luckcolor ok that works too
16:40 🔗 luckcolor :P
16:40 🔗 luckcolor will checkout
16:43 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
16:44 🔗 luckcolor yesh it does i can read the webpage
16:44 🔗 luckcolor the only thing tha probably doesn't work of bcc is the streams
16:45 🔗 HCross ill see what grab-site makes of it all
16:45 🔗 luckcolor does grabsite support phatomjs and youtube-dl?
16:47 🔗 metalcamp has quit IRC (Quit: Bye)
16:47 🔗 HCross yeah, you can pass arguments to wpull
16:48 🔗 luckcolor ok didn't know
16:51 🔗 luckcolor HCross it has already finished
16:51 🔗 HCross yeah, for some reason
16:52 🔗 metalcamp has joined #archiveteam-bs
16:52 🔗 HCross umm luckcolor - it crashed my server
17:02 🔗 metal_cam has joined #archiveteam-bs
17:05 🔗 metalcamp has quit IRC (Ping timeout: 244 seconds)
17:12 🔗 metal_cam has quit IRC (Quit: Bye)
17:16 🔗 goekesmi_ has joined #archiveteam-bs
17:19 🔗 goekesmi has quit IRC (Ping timeout: 260 seconds)
17:30 🔗 phillipsj The whole thing?
17:32 🔗 phillipsj I have had a flaky on-board LAN flake out when receiving "lots" of data. Lukily for me, I was using a serial cable to control the computer remotely, so knew exactly why it was not working.
17:35 🔗 phillipsj (It popped up and error on the console) I fixed it by installing a RTL8139 card in the machine.
17:36 🔗 HCross nah, seems SSH fell over
17:38 🔗 schbirid has quit IRC (Read error: Connection refused)
17:44 🔗 superkuh has quit IRC (Remote host closed the connection)
17:58 🔗 schbirid has joined #archiveteam-bs
18:30 🔗 metalcamp has joined #archiveteam-bs
18:38 🔗 superkuh has joined #archiveteam-bs
18:59 🔗 bzc6p has joined #archiveteam-bs
18:59 🔗 swebb sets mode: +o bzc6p
19:00 🔗 tomwsmf-a has joined #archiveteam-bs
19:04 🔗 bzc6p IA has a classy 502 error page: http://kepfeltoltes.hu/160518/IA_busy_www.kepfeltoltes.hu_.png
19:04 🔗 bzc6p Underlying GIF: https://ia601500.us.archive.org/about/busy.gif
19:11 🔗 JW_work interesting, and it's mirrored to every server
19:11 🔗 JW_work e.g. https://ia701505.us.archive.org/about/busy.gif
19:13 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
19:18 🔗 bzc6p Indeed, the outgoing link seems to be quite saturated ATM.
19:21 🔗 logchfoo4 starts logging #archiveteam-bs at Wed May 18 19:21:33 2016
19:21 🔗 logchfoo4 has joined #archiveteam-bs
19:41 🔗 logchfoo1 starts logging #archiveteam-bs at Wed May 18 19:41:02 2016
19:41 🔗 logchfoo1 has joined #archiveteam-bs
19:58 🔗 chfoo test
19:58 🔗 chfoo hi logchfoo1
19:58 🔗 phuzion hi, chfoo
19:58 🔗 chfoo ok, i got to replace this stupid chat logger
19:59 🔗 chfoo i think someone was messing around with it
20:01 🔗 xmc chfool
20:02 🔗 chfoo i'm really tired of commenting out code and trying to patch it
20:03 🔗 JW_work the html output is very nice; don't change that part if possible
20:03 🔗 chfoo it's written in bash and i had to patch out a directory traversal vulnerability
20:04 🔗 phuzion ooh yikes
20:09 🔗 JW_work ick
20:09 🔗 JW_work the html output is written in bash?
20:10 🔗 xmc sure, why not
20:10 🔗 atrocity fast.com is giving me numbers all over hte place, lol
20:12 🔗 JW_work xmc: directory traversal vulnerabilities, for one
20:12 🔗 xmc :P
20:14 🔗 tomwsmf-a has joined #archiveteam-bs
20:16 🔗 hook54321 has joined #archiveteam-bs
20:16 🔗 ranma how about this bot for logging?
20:17 🔗 ranma do a /ctcp Chii version
20:18 🔗 ranma the only thing is that it phones home to my friend's VPS if you let it scrape the <title> of a URL, however you can disable that functionality
20:18 🔗 ranma titlesnarfer.js
20:37 🔗 fie has joined #archiveteam-bs
20:38 🔗 fie_ has quit IRC (Quit: Leaving)
20:57 🔗 remsen has quit IRC (ZNC 1.6.2 - http://znc.in)
20:57 🔗 remsen has joined #archiveteam-bs
21:09 🔗 JW_work has quit IRC (Read error: Operation timed out)
21:11 🔗 bzc6p has left
21:17 🔗 JW_work has joined #archiveteam-bs
21:21 🔗 Frogging real programmers write their IRC bots in C
21:30 🔗 MrRadar And subsequently get owned by a buffer overflow error :P
21:30 🔗 MrRadar C has its places but writing code that deals primarily with networking and strings is not one of them
21:30 🔗 hook54321 Is their a way to search everything the archive team has archived?
21:32 🔗 MrRadar You can look on the wiki or browse the IA collection: https://archive.org/details/archiveteam
21:32 🔗 MrRadar This IA search query might also be useful: https://archive.org/search.php?query=subject%3Aarchiveteam
21:33 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
21:34 🔗 MrRadar There's also the archivebot viewer at http://archive.fart.website/archivebot/viewer/
21:34 🔗 MrRadar That will show you which URLs people have requested be archived through in #archivebot
21:36 🔗 hook54321 What about the actual content?
21:39 🔗 ndiddy has joined #archiveteam-bs
21:40 🔗 JW_work hook54321: Without downloading it locally, no, I don't think there is full-text search available.
21:40 🔗 godane so i found out that RTE Player keeps there f4m files
21:40 🔗 fie has quit IRC (Quit: Leaving)
21:41 🔗 MrRadar hook54321: There's no way to search through the content since it is literally hundreds of terabytes worth of data
21:41 🔗 godane example: http://cdn.rasset.ie/hds-vod/2015/1127/20151127_rteone-latelate-thelatelat_cl10498592_10499420_260_drm_/manifest.f4m
21:41 🔗 JW_work godane: what is that an example of?
21:41 🔗 JW_work ah, missed your comment above
21:42 🔗 JW_work MrRadar: I wouldn't say "no way" — if you wanted to pay enough to google or amazon, I'm pretty sure they could run a search over all of it.
21:43 🔗 MrRadar It would probably take quite a while to index through
21:43 🔗 MrRadar Since you'd have to pull everything from the IA
21:44 🔗 hook54321 Isn't their going to eventually going to be a search engine for the wayback machine?
21:44 🔗 JW_work maybe
21:44 🔗 schbirid has quit IRC (Quit: Leaving)
21:44 🔗 hook54321 I thought I saw an article somewhere about it
21:44 🔗 JW_work MrRadar: yeah, I wouldn't say it would be *quick*
21:45 🔗 MrRadar That would be nice, but it would be on the scale of creating a full commercial search engine with the added dimension of time
21:45 🔗 MrRadar (re a search engine)
21:46 🔗 hook54321 Let me see if I can find the article, just a sec.
21:49 🔗 hook54321 http://gizmodo.com/the-wayback-machine-is-getting-a-search-engine-1739099940
21:49 🔗 MrRadar Huh, I hadn't seen that
21:50 🔗 hook54321 Do you think they have started testing it yet?
21:51 🔗 MrRadar If they are I haven't heard of it
21:51 🔗 MrRadar What are you trying to find?
21:52 🔗 hook54321 Eh. I guess not anything specific. But their are probably lots of things I would search for if it were already available.
21:52 🔗 hook54321 How many admins does Archive.org have?
21:57 🔗 MrRadar Probably many of the people listed on this page are: https://archive.org/about/bios.php
21:57 🔗 MrRadar AFAIK the only one who hangs out here is Jason Scott (aka SketchCow)
21:58 🔗 SketchCow <3
22:00 🔗 hook54321 SketchCow, are all of the Archive.org admins official staff that are local or are their some volunteers?
22:00 🔗 SketchCow Mostly
22:00 🔗 SketchCow Some people are admin of their collections.
22:03 🔗 hook54321 Ah, ok. Do you know if Archive.org has access to kinda rare German psychology research papers from World War 2?
22:06 🔗 SketchCow No idea.
22:07 🔗 godane SketchCow: you may get some old RTE News
22:07 🔗 hook54321 RTE?
22:08 🔗 godane RTE News
22:08 🔗 MrRadar hook54321: If they were in a book the IA scanned they have a copy via that book scan. Also if someone had posted themon the web it's possible they may have been archived via the Wayback Machine (though that would require knowing a URL)
22:08 🔗 MrRadar Otherwise someone may have uploaded them directly, but that's even less likely than the other two
22:09 🔗 godane i will say the rtmpdump of the stream is weird
22:09 🔗 godane the video file is 35 minutes when i should be 29 minutes
22:18 🔗 hook54321 The specific paper I'm looking for is: "Das psychisch abnorme Kind" (Written by Hans Asperger in 1938)
22:21 🔗 hook54321 It's been referenced in books, but I haven't been able to find a copy of it. I emailed one of the authors that vaguely talked about it in one of their books, turns out they just mentioned someone else referencing it. So I emailed that person who told me to email this other person. Then I emailed that person but never got a response. 😐
22:22 🔗 MrRadar Have you tried searching Google Books?
22:22 🔗 hook54321 For the original article?
22:22 🔗 MrRadar Yeah; though you're probably more likely to find citations
22:22 🔗 hook54321 I think I have, but I'll try again.
22:23 🔗 HCross Just took a look at Google Books, seems to just be references
22:23 🔗 hook54321 So it obviously exists somewhere if people are referencing it, right?
22:25 🔗 hook54321 Could someone be trying to keep the article for just themselves and a few other people?...
22:26 🔗 tomwsmf-a has joined #archiveteam-bs
22:27 🔗 MrRadar Perhaps. Do you know anything about its original publication? Like whether it was published in an academic journal?
22:29 🔗 hook54321 http://link.springer.com/referenceworkentry/10.1007%2F978-1-4419-1698-3_1847
22:30 🔗 hook54321 "His first published paper in this area was not the celebrated 1944 paper but “Das psychisch abnorme Kind,” which appeared in the Wiener Klinischen Wochenzeitschrift in 1938 (Asperger, 1938). This was th ..."
22:30 🔗 MrRadar That's a good lead
22:31 🔗 MrRadar If you have access to a university library you should search for that journal in academic databases
22:31 🔗 MrRadar Though it's so old it may not appear in them
22:35 🔗 hook54321 Like JSTOR and stuff like that?
22:35 🔗 MrRadar Yeah
22:35 🔗 MrRadar (Though I am by no means an expert at using those kind of systems; I last touched them when I graduated 5 years ago)
22:40 🔗 hook54321 I have access to some of them, would they have the name of the article or would I need to find the specific issue it was published in?
22:41 🔗 MrRadar Try searching for the article and if that doesn't work for the journal by name I guess?
22:41 🔗 metalcamp has quit IRC (Ping timeout: 244 seconds)
22:41 🔗 MrRadar Based on the references section of that article page you linked earlier it looks like the article you're looking for is in issue(? volume?) 51
22:42 🔗 hook54321 Is their a way to find out who owned that academic journal?
22:43 🔗 MrRadar I'm not sure
22:47 🔗 MrRadar I'm going AFK for a while
22:49 🔗 JW_work hook54321: I presume you've seen this: http://wrongplanet.net/forums/viewtopic.php?t=236213
22:51 🔗 hook54321 Yeah, saw that. I even messaged the person that claimed to have a copy of it. She said it was in her attic somewhere and that she isn't able to get it from there.
22:53 🔗 hook54321 Hans Aspergers' daughter is still alive, but I haven't been able to find her email address anywhere...
22:58 🔗 hook54321 I found her address though, do any of you live or know someone who lives in Switzerland?
22:59 🔗 JW_work I'd focus more on the journal first
22:59 🔗 hook54321 She might have a copy of it though, right?
23:15 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
23:15 🔗 tomwsmf-a has joined #archiveteam-bs

irclogger-viewer