#archiveteam 2016-06-07,Tue

↑back Search

Time Nickname Message
00:03 🔗 ris has quit IRC ()
00:28 🔗 antomati_ has joined #archiveteam
00:28 🔗 swebb sets mode: +o antomati_
00:30 🔗 antomatic has quit IRC (Read error: Operation timed out)
00:38 🔗 BlueMaxim has quit IRC (Quit: Leaving)
00:40 🔗 jspiros has quit IRC (Read error: Connection reset by peer)
00:40 🔗 jspiros has joined #archiveteam
00:55 🔗 JesseW has joined #archiveteam
01:10 🔗 r3c0d3x has quit IRC (Ping timeout: 260 seconds)
01:39 🔗 philpem has quit IRC (Ping timeout: 260 seconds)
01:46 🔗 r3c0d3x has joined #archiveteam
02:07 🔗 dashcloud SketchCow: any chance you're able to change the collection & type for this? https://archive.org/details/dayyouwereborn Should be a playable Win3.1 title, but I messed up. Thanks!
02:22 🔗 ploop_ has joined #archiveteam
02:26 🔗 ploop has quit IRC (Ping timeout: 633 seconds)
02:55 🔗 kcaj has quit IRC (Ping timeout: 250 seconds)
02:55 🔗 d_rebel has quit IRC (Ping timeout: 250 seconds)
02:55 🔗 Fletcher_ has quit IRC (Ping timeout: 250 seconds)
02:55 🔗 logchfoo4 has quit IRC (Ping timeout: 250 seconds)
02:57 🔗 logchfoo1 starts logging #archiveteam at Tue Jun 07 02:57:29 2016
02:57 🔗 logchfoo1 has joined #archiveteam
02:58 🔗 kcaj has joined #archiveteam
03:00 🔗 dashcloud has joined #archiveteam
03:01 🔗 Gfy has joined #archiveteam
03:08 🔗 Stilett0 has quit IRC ()
03:09 🔗 xXx_ndidd has joined #archiveteam
03:10 🔗 vtyl has joined #archiveteam
03:14 🔗 fie_ has joined #archiveteam
03:18 🔗 fie has quit IRC (Ping timeout: 370 seconds)
03:19 🔗 lytv has quit IRC (Read error: Operation timed out)
03:22 🔗 ndiddy has quit IRC (Read error: Operation timed out)
03:27 🔗 koon has joined #archiveteam
03:32 🔗 xhdr has joined #archiveteam
03:44 🔗 espes__ has joined #archiveteam
03:45 🔗 Fletcher_ has joined #archiveteam
03:46 🔗 Deewiant has joined #archiveteam
04:16 🔗 Sk1d has joined #archiveteam
05:02 🔗 BlueMaxim has joined #archiveteam
05:24 🔗 consarnit has joined #archiveteam
05:26 🔗 consarnit hey all!
05:26 🔗 consarnit Can I have the wiki signup password?
05:26 🔗 consarnit WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
05:26 🔗 consarnit in case there's a bot..
05:27 🔗 consarnit Or, alternately, could somebody start a page putting https://seene.co/ on deathwatch?
05:27 🔗 consarnit It's a weird little creative network for 3D scans, just got acquired by SnapChat, no product updates since 2015
05:28 🔗 dxrt hey, yahoosucks is the password
05:28 🔗 consarnit Seems like it won't last much longer
05:28 🔗 consarnit takk
05:36 🔗 consarnit looks pretty scrapeable
05:36 🔗 consarnit undocumented api but their web renderer uses one
05:37 🔗 consarnit .oemodel files
05:37 🔗 consarnit which I think are proprietary
05:37 🔗 consarnit ex https://d2qkfprjkxv2r7.cloudfront.net/uploads/scene/model/16e40b69-1834-456e-b729-ac5fc08bacee/scene.oemodel
05:37 🔗 consarnit oh but sweet there is already a FOSS viewer
05:37 🔗 consarnit https://github.com/detunized/seene-viewer
05:38 🔗 consarnit so ya
05:38 🔗 consarnit should be a pretty do-able job
05:38 🔗 consarnit I don't know what your process is though
05:38 🔗 consarnit do you have a scraper farm that I can like write a job for?
05:42 🔗 HCross2 If it's small #archivebot
05:44 🔗 consarnit Looks like there are maybe 500,000 users, lets say avg 20 pics items per user?
05:44 🔗 consarnit Probably quite less than that
05:44 🔗 consarnit Is that "small"?
05:44 🔗 consarnit I have no context
05:45 🔗 consarnit I've written lots of pythony scrapers before but IDK how you guys plan your attacks - is there a wiki page on writing Tracker jobs?
05:48 🔗 philpem has joined #archiveteam
05:52 🔗 JesseW That's probably small, yeah.
05:53 🔗 JesseW We have two basic processes -- #archivebot and #warrior jobs.
05:54 🔗 JesseW #archivebot is a set of donated servers that can manually-triggered spiderings of sites (and one-level deep external links) which then get automatically uploaded to the Internet Archive, and (generally) added to the Wayback Machine.
05:55 🔗 consarnit oh nice!
05:55 🔗 consarnit #ab would probably work for a small social/media network right?
05:56 🔗 consarnit how do I schedule that?
05:56 🔗 JesseW The #warrior is a VM, run by a few hundred people (you could be one, too!) that runs custom scripts (generally all written by our hard-working and generally amazing member named arkiver) to handle bigger or more rush jobs.
05:57 🔗 JesseW Join the #archivebot channel on this network -- that's where the bot is commanded from.
05:58 🔗 JesseW Initially you can just trigger specific (non-recursive) jobs, but if you suggest other ones, there are generally people available to trigger them for you. And if you stay around for a while, you'll likely get granted permission to do so yourself.
05:58 🔗 JesseW You can see what is currently being worked on at this dashboard: http://dashboard.at.ninjawedding.org/beta
05:59 🔗 JesseW (that's actually the beta version, but I like it a lot better than the other one)
05:59 🔗 consarnit great domain
06:00 🔗 JesseW yep, a lot of the domains used for archiveteam stuff are ... entertaining.
06:11 🔗 xmc lots of personal domains mostly
06:12 🔗 xmc woop woop woop off-topic siren
06:12 🔗 xmc --> #archiveteam-bs
06:28 🔗 Honno has joined #archiveteam
06:48 🔗 WinterFox has joined #archiveteam
07:22 🔗 schbirid has joined #archiveteam
07:24 🔗 Baljem_ has joined #archiveteam
07:24 🔗 Baljem has quit IRC (Ping timeout: 370 seconds)
07:35 🔗 Cameron_D has quit IRC (Ping timeout: 370 seconds)
07:39 🔗 maseck has quit IRC (Read error: Operation timed out)
07:41 🔗 Cameron_D has joined #archiveteam
07:41 🔗 maseck has joined #archiveteam
07:44 🔗 dxrt has quit IRC (Excess Flood)
07:46 🔗 dxrt has joined #archiveteam
07:46 🔗 dxrt- sets mode: +o dxrt
07:58 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
08:05 🔗 Emcy_ has joined #archiveteam
08:05 🔗 consarnit has quit IRC (Remote host closed the connection)
08:13 🔗 rduser has quit IRC (Ping timeout: 370 seconds)
08:14 🔗 jut has joined #archiveteam
08:14 🔗 rduser has joined #archiveteam
08:18 🔗 Emcy has quit IRC (Read error: Operation timed out)
08:21 🔗 atomotic has joined #archiveteam
08:35 🔗 Honno_ has joined #archiveteam
08:41 🔗 fie has joined #archiveteam
08:43 🔗 fie has quit IRC (Remote host closed the connection)
08:43 🔗 fie has joined #archiveteam
08:44 🔗 fie_ has quit IRC (Ping timeout: 244 seconds)
08:47 🔗 arkiver3 has joined #archiveteam
08:48 🔗 Honno has quit IRC (Read error: Operation timed out)
08:56 🔗 W1nterFox has joined #archiveteam
08:57 🔗 WinterFox has quit IRC (Ping timeout: 1208 seconds)
09:04 🔗 arkiver3 has quit IRC (Ping timeout: 244 seconds)
09:05 🔗 consarnit has joined #archiveteam
09:09 🔗 ariscop has quit IRC (Leaving)
09:09 🔗 consarnit has quit IRC (Ping timeout: 244 seconds)
09:14 🔗 SN4T14 has quit IRC (Ping timeout: 370 seconds)
09:21 🔗 SN4T14 has joined #archiveteam
09:22 🔗 fie has quit IRC (Quit: Leaving)
09:27 🔗 fie has joined #archiveteam
09:32 🔗 SilSte has joined #archiveteam
10:00 🔗 midas https://torrentfreak.com/takedown-staydown-would-be-a-disaster-internet-archive-warns-160607/
10:02 🔗 ariscop has joined #archiveteam
10:47 🔗 Honno__ has joined #archiveteam
10:56 🔗 SketchCow -----------------------------------------------------
10:56 🔗 SketchCow A LITTLE BIRD TOLD ME TWEET TWEET GOOGLE GROUPS GONE WITHIN A YEAR
10:56 🔗 SketchCow -----------------------------------------------------
10:57 🔗 Honno_ has quit IRC (Read error: Operation timed out)
10:58 🔗 W1nterFox has quit IRC (Read error: Operation timed out)
10:59 🔗 SketchCow So... plan accordingly
11:00 🔗 SketchCow dashcloud: That thing's a broken mess
11:05 🔗 dashcloud SketchCow: thanks- I'll take a look at it.
11:05 🔗 Emcy has joined #archiveteam
11:08 🔗 Honno has joined #archiveteam
11:09 🔗 PurpleSym At least we can start with a list of groups discovered in 2011.
11:10 🔗 PurpleSym -> https://archive.org/details/archiveteam-googlegroups?&sort=-publicdate
11:12 🔗 SketchCow I think there's fundamental issues with the item. I got it to sort of boot and it was DLL city
11:18 🔗 Emcy_ has quit IRC (Read error: Operation timed out)
11:20 🔗 Honno__ has quit IRC (Read error: Operation timed out)
11:28 🔗 WinterFox has joined #archiveteam
11:30 🔗 Stiletto has joined #archiveteam
11:34 🔗 dcmorton has quit IRC (Ping timeout: 370 seconds)
11:36 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
11:36 🔗 dcmorton has joined #archiveteam
11:36 🔗 swebb sets mode: +o dcmorton
11:57 🔗 klg_ has joined #archiveteam
11:57 🔗 klg has quit IRC (Ping timeout: 370 seconds)
11:58 🔗 n00bLurke has joined #archiveteam
12:07 🔗 n00bLurke has quit IRC (n00bLurke)
12:07 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
12:29 🔗 dcmorton has quit IRC (Ping timeout: 370 seconds)
12:32 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
12:33 🔗 atomotic has joined #archiveteam
12:34 🔗 dcmorton has joined #archiveteam
12:34 🔗 swebb sets mode: +o dcmorton
12:39 🔗 BartoCH has joined #archiveteam
12:50 🔗 Aranje has quit IRC (Ping timeout: 260 seconds)
12:51 🔗 VADemon has joined #archiveteam
13:00 🔗 BlueMaxim has quit IRC (Quit: Leaving)
13:01 🔗 Aranje has joined #archiveteam
13:12 🔗 WinterFox has quit IRC (Remote host closed the connection)
13:22 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
13:29 🔗 n00bLurke has joined #archiveteam
13:29 🔗 BartoCH has joined #archiveteam
13:37 🔗 BartoCH has quit IRC (Quit: WeeChat 1.5)
13:38 🔗 BartoCH has joined #archiveteam
14:20 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
14:29 🔗 hawc145 has quit IRC (Ping timeout: 370 seconds)
14:32 🔗 hawc145 has joined #archiveteam
14:34 🔗 jut_ has joined #archiveteam
14:37 🔗 jut has quit IRC (Read error: Operation timed out)
14:40 🔗 Start has quit IRC (Quit: Disconnected.)
14:43 🔗 HCross2 has quit IRC (Ping timeout: 260 seconds)
14:44 🔗 sigkell_ has quit IRC (Ping timeout: 260 seconds)
14:44 🔗 sigkell_ has joined #archiveteam
14:55 🔗 SN4T14 has quit IRC (Ping timeout: 370 seconds)
14:55 🔗 SN4T14 has joined #archiveteam
14:57 🔗 HCross2 has joined #archiveteam
15:15 🔗 VADemon has quit IRC (Ping timeout: 250 seconds)
15:26 🔗 VADemon has joined #archiveteam
15:27 🔗 Cameron_D has quit IRC (Ping timeout: 370 seconds)
15:27 🔗 Cameron_D has joined #archiveteam
15:32 🔗 Start has joined #archiveteam
15:48 🔗 JesseW has joined #archiveteam
16:03 🔗 Aranje has quit IRC (Quit: Three sheets to the wind)
16:04 🔗 sivoais_ has joined #archiveteam
16:04 🔗 sivoais has quit IRC (Ping timeout: 370 seconds)
16:07 🔗 Start has quit IRC (Quit: Disconnected.)
16:10 🔗 Aranje has joined #archiveteam
16:13 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
16:19 🔗 Start has joined #archiveteam
16:20 🔗 Start has quit IRC (Client Quit)
16:34 🔗 twrist has joined #archiveteam
16:48 🔗 GLaDOS has quit IRC (Read error: Operation timed out)
16:48 🔗 twrist is now known as GLaDOS
17:09 🔗 consarnit has joined #archiveteam
17:24 🔗 hawc145 is now known as HCross
17:36 🔗 Simpbra1 has quit IRC (Ping timeout: 370 seconds)
17:38 🔗 Cameron_D has quit IRC (Ping timeout: 370 seconds)
17:38 🔗 Cameron_D has joined #archiveteam
17:41 🔗 RichardG has joined #archiveteam
17:53 🔗 Simpbra1 has joined #archiveteam
18:13 🔗 consarnit has quit IRC ()
18:22 🔗 Start has joined #archiveteam
18:30 🔗 Tomcat_ has joined #archiveteam
18:47 🔗 klg_ is now known as klg
19:01 🔗 winr5r has quit IRC (Read error: Operation timed out)
19:07 🔗 Start has quit IRC (Quit: Disconnected.)
19:09 🔗 Simpbra1 has quit IRC (Read error: Operation timed out)
19:11 🔗 Start has joined #archiveteam
19:14 🔗 jut has joined #archiveteam
19:16 🔗 jut_ has quit IRC (Read error: Operation timed out)
19:18 🔗 winr4r has joined #archiveteam
19:18 🔗 ranma is now known as madpent
19:19 🔗 madpent is now known as ranma
19:21 🔗 Simpbra1 has joined #archiveteam
19:23 🔗 jut has quit IRC (Quit: Leaving)
19:26 🔗 atomotic has joined #archiveteam
19:40 🔗 Start has quit IRC (Quit: Disconnected.)
19:56 🔗 maseck_ has joined #archiveteam
20:02 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
20:04 🔗 Honno has quit IRC (Ping timeout: 492 seconds)
20:07 🔗 maseck has quit IRC (Ping timeout: 1208 seconds)
20:24 🔗 Tomcat_ has quit IRC (Remote host closed the connection)
20:36 🔗 VADemon has quit IRC (Quit: left4dead)
20:36 🔗 schbirid has quit IRC (Quit: Leaving)
20:49 🔗 ariscop has quit IRC (Quit: Leaving)
21:05 🔗 tomwsmf-a has joined #archiveteam
21:07 🔗 pikhq has quit IRC (Ping timeout: 506 seconds)
21:16 🔗 n00bLurke has quit IRC (n00bLurke)
21:23 🔗 pikhq has joined #archiveteam
21:24 🔗 fie has quit IRC (Ping timeout: 244 seconds)
21:26 🔗 schbirid has joined #archiveteam
21:28 🔗 ariscop has joined #archiveteam
21:35 🔗 ris has joined #archiveteam
21:48 🔗 arkiver Let's get https://seene.co/ and google groups
21:48 🔗 arkiver :D
21:52 🔗 arkiver seene.co indeed looks pretty doable
21:58 🔗 schbirid has quit IRC (Quit: Leaving)
22:22 🔗 Pudsey has joined #archiveteam
22:23 🔗 Pudsey Any word on the robots.txt issue with the blip archive? You could access it yesterday by adding www. to blip.tv but now even that gives robots.txt
22:28 🔗 Ravenloft has joined #archiveteam
22:39 🔗 JW_work1 arkiver: I think we got seene.co via archivebot yesterday.
22:39 🔗 arkiver all of it?
22:39 🔗 arkiver https://seene.co/u/zettlerm/
22:39 🔗 arkiver https://seene.co/s/nXH5qs/
22:39 🔗 arkiver for example
22:40 🔗 JW_work1 well, we'll need to wait till it posts to IA to check, but I think we got those, yes.
22:41 🔗 Pudsey has quit IRC (Remote host closed the connection)
23:02 🔗 Start has joined #archiveteam
23:06 🔗 ris has quit IRC ()
23:58 🔗 xmc has quit IRC (Read error: Operation timed out)

irclogger-viewer