#archiveteam 2015-04-05,Sun

↑back Search

Time Nickname Message
00:07 πŸ”— primus has quit IRC (Ping timeout: 512 seconds)
00:10 πŸ”— chfoo SketchCow: for the items "2015_ovi_store_panic" and "2015_ovi_store_panic_2", the CDX files don't seem to be generated. could you check up on that?
00:10 πŸ”— primus has joined #archiveteam
00:18 πŸ”— primus104 has joined #archiveteam
00:22 πŸ”— Start anyone know if there's a python script for scraping reddit.com/domain/ ?
00:25 πŸ”— yipdw !a http://www.reddit.com/r/subreddit --ignore-sets=reddit
00:25 πŸ”— yipdw or https://github.com/ludios/grab-site as a distributaable
00:30 πŸ”— Start i'm more interested in just getting the urls from a specific domain, for example reddit.com/domain/layervault.com
00:32 πŸ”— ohhdemgir has quit IRC (Quit: Leaving)
00:53 πŸ”— kyan has joined #archiveteam
00:56 πŸ”— beardicus has quit IRC (Quit: My MacBook Pro has gone to sleep. ZZZzzz…)
01:07 πŸ”— RichardG has joined #archiveteam
01:08 πŸ”— Wizardcry has quit IRC (Read error: Operation timed out)
01:19 πŸ”— RichardG has quit IRC (Quit: No keyboard found, press F1 to continue)
01:20 πŸ”— RichardG has joined #archiveteam
01:27 πŸ”— joepie91_ Start: there's a node.js module named reddit-stream that might do what you want
01:34 πŸ”— NovaKing_ has quit IRC (Read error: Operation timed out)
01:34 πŸ”— Selanda has quit IRC (Read error: Operation timed out)
01:35 πŸ”— lytv has quit IRC (Read error: Operation timed out)
01:35 πŸ”— cadbury_ has quit IRC (Read error: Operation timed out)
01:35 πŸ”— aNthraXx has quit IRC (Read error: Operation timed out)
01:35 πŸ”— caber has quit IRC (Read error: Operation timed out)
01:36 πŸ”— lytv has joined #archiveteam
01:36 πŸ”— Coderjoe has quit IRC (Read error: Operation timed out)
01:37 πŸ”— Coderjoe has joined #archiveteam
01:37 πŸ”— caber has joined #archiveteam
01:40 πŸ”— brayden has quit IRC (Read error: Operation timed out)
01:40 πŸ”— caber has quit IRC (Read error: Operation timed out)
01:41 πŸ”— Selanda has joined #archiveteam
01:42 πŸ”— Coderjoe has quit IRC (Read error: Operation timed out)
01:51 πŸ”— primus104 has quit IRC (Leaving.)
01:53 πŸ”— Coderjoe has joined #archiveteam
01:59 πŸ”— caber has joined #archiveteam
02:01 πŸ”— cadbury_ has joined #archiveteam
02:03 πŸ”— aNthraXx has joined #archiveteam
02:12 πŸ”— NovaKing_ has joined #archiveteam
02:44 πŸ”— Start arkiver: we should be able to begin layervault in the next few days
02:44 πŸ”— Start i've discovered some sequential api urls
02:45 πŸ”— Start i'd recommend having layervault.com and news.layervault.com (designer news) as separate warrior projects, as they are completely different sites
03:05 πŸ”— brayden has joined #archiveteam
03:19 πŸ”— primus has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— BlueMaxim has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— SN4T14_ has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Emcy has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Mayonaise has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— rejon has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Rickster has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— ryan_ has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— xmc has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Sue_ has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— yipdw has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— dcmorton has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— marnold has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— ersi has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— slash` has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Famicoman has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— eprillios has quit IRC (ircd.choopa.net irc.eversible.com)
03:19 πŸ”— Cameron_D has quit IRC (ircd.choopa.net irc.eversible.com)
03:21 πŸ”— SN4T14 has joined #archiveteam
03:23 πŸ”— primus has joined #archiveteam
03:23 πŸ”— BlueMaxim has joined #archiveteam
03:23 πŸ”— SN4T14_ has joined #archiveteam
03:23 πŸ”— Mayonaise has joined #archiveteam
03:23 πŸ”— rejon has joined #archiveteam
03:23 πŸ”— Rickster has joined #archiveteam
03:23 πŸ”— ryan_ has joined #archiveteam
03:23 πŸ”— xmc has joined #archiveteam
03:23 πŸ”— Sue_ has joined #archiveteam
03:23 πŸ”— yipdw has joined #archiveteam
03:23 πŸ”— dcmorton has joined #archiveteam
03:23 πŸ”— marnold has joined #archiveteam
03:23 πŸ”— ersi has joined #archiveteam
03:23 πŸ”— slash` has joined #archiveteam
03:23 πŸ”— Famicoman has joined #archiveteam
03:23 πŸ”— Cameron_D has joined #archiveteam
03:23 πŸ”— irc.eversible.com sets mode: +oooo xmc dcmorton ersi Cameron_D
03:23 πŸ”— swebb sets mode: +o xmc
03:23 πŸ”— swebb sets mode: +o ersi
03:25 πŸ”— Wolfie has quit IRC (Read error: Connection reset by peer)
03:26 πŸ”— dcmorton has quit IRC (Excess Flood)
03:26 πŸ”— dcmorton has joined #archiveteam
03:27 πŸ”— Famicoman has quit IRC (Remote host closed the connection)
03:27 πŸ”— ersi has quit IRC (Read error: Connection reset by peer)
03:27 πŸ”— ersi has joined #archiveteam
03:27 πŸ”— swebb sets mode: +o ersi
03:30 πŸ”— SN4T14_ has quit IRC (Ping timeout: 512 seconds)
03:35 πŸ”— eprillios has joined #archiveteam
03:36 πŸ”— Famicoman has joined #archiveteam
03:37 πŸ”— fiatjaf has left undefined
04:16 πŸ”— SketchCow chfoo: Restarted - let's see if it derives
04:18 πŸ”— Infreq has joined #archiveteam
04:21 πŸ”— chazchaz_ has quit IRC (Remote host closed the connection)
04:22 πŸ”— chazchaz_ has joined #archiveteam
04:33 πŸ”— svchfoo2 has quit IRC (Quit: Closing)
04:36 πŸ”— svchfoo2 has joined #archiveteam
06:06 πŸ”— garyrh I'm writing up a grab project for blingee.
06:06 πŸ”— garyrh (channel name suggestion: #jankee)
06:18 πŸ”— mistym has joined #archiveteam
06:45 πŸ”— JMC has quit IRC (Ping timeout: 370 seconds)
07:18 πŸ”— scyther has joined #archiveteam
07:32 πŸ”— signius Is there any ETA on when the Google Code Grab is likely to start
07:33 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
08:07 πŸ”— primus104 has joined #archiveteam
08:24 πŸ”— schbirid has joined #archiveteam
08:50 πŸ”— mistym has quit IRC (Remote host closed the connection)
08:58 πŸ”— BlueMaxim has quit IRC (Ping timeout: 512 seconds)
08:59 πŸ”— BlueMaxim has joined #archiveteam
09:44 πŸ”— Ymgve has joined #archiveteam
09:45 πŸ”— habi has joined #archiveteam
09:47 πŸ”— habi has left
10:02 πŸ”— signius has quit IRC (Ping timeout: 306 seconds)
10:14 πŸ”— signius has joined #archiveteam
10:22 πŸ”— schbirid could someone fully archive https://www.reddit.com/r/IAmA/comments/31esm0/iama_95_year_old_german_women_from_a_village_in/ ? it's wonderful
10:31 πŸ”— Smiley archivebot no good for it??
11:09 πŸ”— BlueMaxim I imagine that loading all the comments would be the problem
11:17 πŸ”— BlueMaxim has quit IRC (Read error: Connection reset by peer)
11:34 πŸ”— SimpBrain has joined #archiveteam
11:48 πŸ”— primus has quit IRC (Read error: Connection timed out)
11:49 πŸ”— primus has joined #archiveteam
11:59 πŸ”— dashcloud has joined #archiveteam
12:32 πŸ”— Ara_ has joined #archiveteam
12:35 πŸ”— philpem has joined #archiveteam
12:38 πŸ”— Ara__ has quit IRC (Ping timeout: 492 seconds)
12:45 πŸ”— Ara__ has joined #archiveteam
12:51 πŸ”— Ara_ has quit IRC (Ping timeout: 492 seconds)
12:53 πŸ”— Ara_ has joined #archiveteam
12:54 πŸ”— Ara__ has quit IRC (Ping timeout: 492 seconds)
13:10 πŸ”— SimpBrain got a new server ready to pile archiveteam data on to. Intel(R) Core(TM) i7-2600 CPU @ 3.40GHz (Cores 8), 2 x 3TB hdd's. 16GB Ram
13:29 πŸ”— Ara_ has quit IRC (Ping timeout: 492 seconds)
13:35 πŸ”— monod has joined #archiveteam
13:42 πŸ”— mietek Would someone be able to help recover lost files related to the Charity programming language? https://github.com/mietek/charity-language/issues/1
13:43 πŸ”— mietek I’ve manually archived the Charity website (http://pll.cpsc.ucalgary.ca/charity1/www/home.html) while it’s still available, fixing broken links and restoring papers from other locations: https://github.com/mietek/charity-language
13:43 πŸ”— mietek There are, however, some files which I cannot find
13:45 πŸ”— mietek I’ve also made a full mirror of the website, which includes many TeX source files, and unrelated papers β€” if anyone is interested, I can upload the tarball somewhere.
13:46 πŸ”— mietek It’s 160MB compressed
13:46 πŸ”— arkiver you can always upload those archived to the Internet Archive
13:46 πŸ”— arkiver they'd happily store it for :)
13:49 πŸ”— mietek Didn’t know they take tarballs
13:50 πŸ”— arkiver they take any kind of file
14:01 πŸ”— antomatic has quit IRC ()
14:03 πŸ”— Wolfie has joined #archiveteam
14:05 πŸ”— antomatic has joined #archiveteam
14:08 πŸ”— primus104 has quit IRC (Leaving.)
14:38 πŸ”— bzc6p has joined #archiveteam
14:39 πŸ”— bzc6p has left
14:50 πŸ”— Peetz0r_ has joined #archiveteam
14:50 πŸ”— Peetz0r has quit IRC (Read error: Connection reset by peer)
14:58 πŸ”— Ara_ has joined #archiveteam
15:13 πŸ”— monod has quit IRC (Ping timeout: 512 seconds)
15:18 πŸ”— SimpBrai1 has joined #archiveteam
15:25 πŸ”— SimpBrain has quit IRC (Ping timeout: 512 seconds)
15:31 πŸ”— Infreq has quit IRC ()
15:46 πŸ”— signius has quit IRC (Quit: Leaving)
15:47 πŸ”— signius has joined #archiveteam
15:48 πŸ”— signius has quit IRC (Client Quit)
15:48 πŸ”— signius has joined #archiveteam
16:40 πŸ”— monod has joined #archiveteam
16:40 πŸ”— balrog mietek: what happened to ftp.cpsc.ucalgary.ca?
16:41 πŸ”— mietek balrog: good question
16:41 πŸ”— balrog has anyone asked the university?
16:41 πŸ”— mietek Probably overzealous IT departments
16:41 πŸ”— mietek Note the Calgary pages block IA
16:41 πŸ”— balrog (asked as a programmer/researcher)
16:41 πŸ”— mietek I’ve contacted the main researcher behind the project; no response yet
16:42 πŸ”— balrog :/ ok
16:42 πŸ”— mietek I’m now working down the list of people associated with the project
16:42 πŸ”— mietek But they’re all long gone from the university
16:42 πŸ”— primus104 has joined #archiveteam
16:42 πŸ”— mietek It really pisses me off that universities delete people’s home pages
16:42 πŸ”— mietek It should be a crime to do that
16:43 πŸ”— balrog http://pll.cpsc.ucalgary.ca/charity1/www/home.html does seem still to be up
16:43 πŸ”— balrog and that server has no robots.txt
16:43 πŸ”— mietek I know. I pasted that above :)
16:43 πŸ”— mietek That server is pretty badly set up, so you can actually browse the entire hierarchy
16:43 πŸ”— mietek And so I was able to recover almost all of their papers
16:44 πŸ”— mietek Home pages were hosted on e.g. http://web.archive.org/web/*/pages.cpsc.ucalgary.ca/%7Espoonerd/
16:44 πŸ”— balrog and there's no robots.txt there either
16:44 πŸ”— balrog this is an issue with IA where it doesn't refresh if the robots.txt is removed, apparently :/
16:45 πŸ”— mietek I’m holding out hope that IA crawls even if it’s blocked
16:45 πŸ”— mietek And just silently collects the data
16:45 πŸ”— mietek For the future
16:45 πŸ”— habi has joined #archiveteam
16:45 πŸ”— balrog afaik IA does not
16:45 πŸ”— mietek :(
16:46 πŸ”— habi has left
16:46 πŸ”— xmc archivebot does!
16:47 πŸ”— mietek Was it around in 1997?
16:47 πŸ”— xmc no.
16:50 πŸ”— mietek Do you have any tips for locating people?
16:50 πŸ”— mietek https://github.com/mietek/charity-language/blob/master/doc/pdf/2003-zeng-an-implementation-of-charity.pdf
16:51 πŸ”— mietek Min Zeng, Calgary MSc 2003
16:51 πŸ”— habi1 has joined #archiveteam
16:51 πŸ”— mietek Actually, that’s probably easy.
16:55 πŸ”— Ara_ has quit IRC (Ping timeout: 240 seconds)
16:58 πŸ”— habi1 has left
17:04 πŸ”— mistym has joined #archiveteam
17:28 πŸ”— Wizardcry has joined #archiveteam
17:53 πŸ”— monod has quit IRC (Ping timeout: 512 seconds)
17:56 πŸ”— Wizardcry has quit IRC (Read error: Operation timed out)
18:02 πŸ”— appledash has quit IRC (Read error: Connection reset by peer)
18:12 πŸ”— Ara_ has joined #archiveteam
18:19 πŸ”— rolfb has joined #archiveteam
18:21 πŸ”— aliz has joined #archiveteam
18:29 πŸ”— rolfb has quit IRC (Leaving...)
19:25 πŸ”— garyrh has quit IRC (Write error: Broken pipe)
19:28 πŸ”— useretail has quit IRC (hub.se irc.ac.za)
19:35 πŸ”— garyrh has joined #archiveteam
19:36 πŸ”— lytv has quit IRC (Ping timeout: 265 seconds)
19:37 πŸ”— Start arkiver: once we've started with friendfeed, we'll be able to start layervault
19:37 πŸ”— Start i found a way of grabbing everything sequentially through their api
19:39 πŸ”— lytv has joined #archiveteam
19:40 πŸ”— arkiver Start: awesome
19:40 πŸ”— arkiver looks like I can safely fully start the grab of friendfeed tonight, which means less work on that
19:40 πŸ”— arkiver then I'll get on layervault
19:47 πŸ”— Rickster has quit IRC (Quit: ZNC - http://znc.in)
19:48 πŸ”— SN4T14_ has joined #archiveteam
19:51 πŸ”— Rickster has joined #archiveteam
19:52 πŸ”— Mayonaise has quit IRC (Ping timeout: 512 seconds)
19:53 πŸ”— SN4T14 has quit IRC (Ping timeout: 306 seconds)
20:39 πŸ”— Mayonaise has joined #archiveteam
20:40 πŸ”— godane has quit IRC (Read error: Operation timed out)
20:47 πŸ”— svchfoo2 has quit IRC (Ping timeout: 240 seconds)
20:52 πŸ”— svchfoo2 has joined #archiveteam
21:06 πŸ”— SimpBrai1 has quit IRC (Quit: Leaving)
21:08 πŸ”— Deewiant has joined #archiveteam
21:11 πŸ”— aaaaaaaaa has joined #archiveteam
21:13 πŸ”— godane has joined #archiveteam
21:39 πŸ”— Peetz0r_ is now known as Peetz0r
21:56 πŸ”— scyther has quit IRC (Read error: Connection reset by peer)
22:15 πŸ”— wtron has joined #archiveteam
22:16 πŸ”— BlueMaxim has joined #archiveteam
22:21 πŸ”— mistym has quit IRC (Remote host closed the connection)
22:51 πŸ”— arkiver Start: can we talk in ~10 hours about the findings you got from layerfault?
22:51 πŸ”— arkiver we'll be starting a discover tomorrow for that
22:52 πŸ”— Start ok
23:15 πŸ”— mahadri has joined #archiveteam
23:58 πŸ”— Atluxity hmmm.. I just had a fantasy about a archive warrior using html5, websockets etc... All one would need to participate would be to visit a warrior-url with a modern browser, and then it would do the job from there
23:58 πŸ”— Ara_ has quit IRC (Ping timeout: 240 seconds)
23:59 πŸ”— philpem has quit IRC (Ping timeout: 260 seconds)
23:59 πŸ”— mistym has joined #archiveteam

irclogger-viewer