[00:03] *** ris has quit IRC () [00:28] *** antomati_ has joined #archiveteam [00:28] *** swebb sets mode: +o antomati_ [00:30] *** antomatic has quit IRC (Read error: Operation timed out) [00:38] *** BlueMaxim has quit IRC (Quit: Leaving) [00:40] *** jspiros has quit IRC (Read error: Connection reset by peer) [00:40] *** jspiros has joined #archiveteam [00:55] *** JesseW has joined #archiveteam [01:10] *** r3c0d3x has quit IRC (Ping timeout: 260 seconds) [01:39] *** philpem has quit IRC (Ping timeout: 260 seconds) [01:46] *** r3c0d3x has joined #archiveteam [02:07] SketchCow: any chance you're able to change the collection & type for this? https://archive.org/details/dayyouwereborn Should be a playable Win3.1 title, but I messed up. Thanks! [02:22] *** ploop_ has joined #archiveteam [02:26] *** ploop has quit IRC (Ping timeout: 633 seconds) [02:55] *** kcaj has quit IRC (Ping timeout: 250 seconds) [02:55] *** d_rebel has quit IRC (Ping timeout: 250 seconds) [02:55] *** Fletcher_ has quit IRC (Ping timeout: 250 seconds) [02:55] *** logchfoo4 has quit IRC (Ping timeout: 250 seconds) [02:57] *** logchfoo1 starts logging #archiveteam at Tue Jun 07 02:57:29 2016 [02:57] *** logchfoo1 has joined #archiveteam [02:58] *** kcaj has joined #archiveteam [03:00] *** dashcloud has joined #archiveteam [03:01] *** Gfy has joined #archiveteam [03:08] *** Stilett0 has quit IRC () [03:09] *** xXx_ndidd has joined #archiveteam [03:10] *** vtyl has joined #archiveteam [03:14] *** fie_ has joined #archiveteam [03:18] *** fie has quit IRC (Ping timeout: 370 seconds) [03:19] *** lytv has quit IRC (Read error: Operation timed out) [03:22] *** ndiddy has quit IRC (Read error: Operation timed out) [03:27] *** koon has joined #archiveteam [03:32] *** xhdr has joined #archiveteam [03:44] *** espes__ has joined #archiveteam [03:45] *** Fletcher_ has joined #archiveteam [03:46] *** Deewiant has joined #archiveteam [04:16] *** Sk1d has joined #archiveteam [05:02] *** BlueMaxim has joined #archiveteam [05:24] *** consarnit has joined #archiveteam [05:26] hey all! [05:26] Can I have the wiki signup password? [05:26] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [05:26] in case there's a bot.. [05:27] Or, alternately, could somebody start a page putting https://seene.co/ on deathwatch? [05:27] It's a weird little creative network for 3D scans, just got acquired by SnapChat, no product updates since 2015 [05:28] hey, yahoosucks is the password [05:28] Seems like it won't last much longer [05:28] takk [05:36] looks pretty scrapeable [05:36] undocumented api but their web renderer uses one [05:37] .oemodel files [05:37] which I think are proprietary [05:37] ex https://d2qkfprjkxv2r7.cloudfront.net/uploads/scene/model/16e40b69-1834-456e-b729-ac5fc08bacee/scene.oemodel [05:37] oh but sweet there is already a FOSS viewer [05:37] https://github.com/detunized/seene-viewer [05:38] so ya [05:38] should be a pretty do-able job [05:38] I don't know what your process is though [05:38] do you have a scraper farm that I can like write a job for? [05:42] If it's small #archivebot [05:44] Looks like there are maybe 500,000 users, lets say avg 20 pics items per user? [05:44] Probably quite less than that [05:44] Is that "small"? [05:44] I have no context [05:45] I've written lots of pythony scrapers before but IDK how you guys plan your attacks - is there a wiki page on writing Tracker jobs? [05:48] *** philpem has joined #archiveteam [05:52] That's probably small, yeah. [05:53] We have two basic processes -- #archivebot and #warrior jobs. [05:54] #archivebot is a set of donated servers that can manually-triggered spiderings of sites (and one-level deep external links) which then get automatically uploaded to the Internet Archive, and (generally) added to the Wayback Machine. [05:55] oh nice! [05:55] #ab would probably work for a small social/media network right? [05:56] how do I schedule that? [05:56] The #warrior is a VM, run by a few hundred people (you could be one, too!) that runs custom scripts (generally all written by our hard-working and generally amazing member named arkiver) to handle bigger or more rush jobs. [05:57] Join the #archivebot channel on this network -- that's where the bot is commanded from. [05:58] Initially you can just trigger specific (non-recursive) jobs, but if you suggest other ones, there are generally people available to trigger them for you. And if you stay around for a while, you'll likely get granted permission to do so yourself. [05:58] You can see what is currently being worked on at this dashboard: http://dashboard.at.ninjawedding.org/beta [05:59] (that's actually the beta version, but I like it a lot better than the other one) [05:59] great domain [06:00] yep, a lot of the domains used for archiveteam stuff are ... entertaining. [06:11] lots of personal domains mostly [06:12] woop woop woop off-topic siren [06:12] --> #archiveteam-bs [06:28] *** Honno has joined #archiveteam [06:48] *** WinterFox has joined #archiveteam [07:22] *** schbirid has joined #archiveteam [07:24] *** Baljem_ has joined #archiveteam [07:24] *** Baljem has quit IRC (Ping timeout: 370 seconds) [07:35] *** Cameron_D has quit IRC (Ping timeout: 370 seconds) [07:39] *** maseck has quit IRC (Read error: Operation timed out) [07:41] *** Cameron_D has joined #archiveteam [07:41] *** maseck has joined #archiveteam [07:44] *** dxrt has quit IRC (Excess Flood) [07:46] *** dxrt has joined #archiveteam [07:46] *** dxrt- sets mode: +o dxrt [07:58] *** JesseW has quit IRC (Ping timeout: 370 seconds) [08:05] *** Emcy_ has joined #archiveteam [08:05] *** consarnit has quit IRC (Remote host closed the connection) [08:13] *** rduser has quit IRC (Ping timeout: 370 seconds) [08:14] *** jut has joined #archiveteam [08:14] *** rduser has joined #archiveteam [08:18] *** Emcy has quit IRC (Read error: Operation timed out) [08:21] *** atomotic has joined #archiveteam [08:35] *** Honno_ has joined #archiveteam [08:41] *** fie has joined #archiveteam [08:43] *** fie has quit IRC (Remote host closed the connection) [08:43] *** fie has joined #archiveteam [08:44] *** fie_ has quit IRC (Ping timeout: 244 seconds) [08:47] *** arkiver3 has joined #archiveteam [08:48] *** Honno has quit IRC (Read error: Operation timed out) [08:56] *** W1nterFox has joined #archiveteam [08:57] *** WinterFox has quit IRC (Ping timeout: 1208 seconds) [09:04] *** arkiver3 has quit IRC (Ping timeout: 244 seconds) [09:05] *** consarnit has joined #archiveteam [09:09] *** ariscop has quit IRC (Leaving) [09:09] *** consarnit has quit IRC (Ping timeout: 244 seconds) [09:14] *** SN4T14 has quit IRC (Ping timeout: 370 seconds) [09:21] *** SN4T14 has joined #archiveteam [09:22] *** fie has quit IRC (Quit: Leaving) [09:27] *** fie has joined #archiveteam [09:32] *** SilSte has joined #archiveteam [10:00] https://torrentfreak.com/takedown-staydown-would-be-a-disaster-internet-archive-warns-160607/ [10:02] *** ariscop has joined #archiveteam [10:47] *** Honno__ has joined #archiveteam [10:56] ----------------------------------------------------- [10:56] A LITTLE BIRD TOLD ME TWEET TWEET GOOGLE GROUPS GONE WITHIN A YEAR [10:56] ----------------------------------------------------- [10:57] *** Honno_ has quit IRC (Read error: Operation timed out) [10:58] *** W1nterFox has quit IRC (Read error: Operation timed out) [10:59] So... plan accordingly [11:00] dashcloud: That thing's a broken mess [11:05] SketchCow: thanks- I'll take a look at it. [11:05] *** Emcy has joined #archiveteam [11:08] *** Honno has joined #archiveteam [11:09] At least we can start with a list of groups discovered in 2011. [11:10] -> https://archive.org/details/archiveteam-googlegroups?&sort=-publicdate [11:12] I think there's fundamental issues with the item. I got it to sort of boot and it was DLL city [11:18] *** Emcy_ has quit IRC (Read error: Operation timed out) [11:20] *** Honno__ has quit IRC (Read error: Operation timed out) [11:28] *** WinterFox has joined #archiveteam [11:30] *** Stiletto has joined #archiveteam [11:34] *** dcmorton has quit IRC (Ping timeout: 370 seconds) [11:36] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:36] *** dcmorton has joined #archiveteam [11:36] *** swebb sets mode: +o dcmorton [11:57] *** klg_ has joined #archiveteam [11:57] *** klg has quit IRC (Ping timeout: 370 seconds) [11:58] *** n00bLurke has joined #archiveteam [12:07] *** n00bLurke has quit IRC (n00bLurke) [12:07] *** RichardG has quit IRC (Read error: Connection reset by peer) [12:29] *** dcmorton has quit IRC (Ping timeout: 370 seconds) [12:32] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [12:33] *** atomotic has joined #archiveteam [12:34] *** dcmorton has joined #archiveteam [12:34] *** swebb sets mode: +o dcmorton [12:39] *** BartoCH has joined #archiveteam [12:50] *** Aranje has quit IRC (Ping timeout: 260 seconds) [12:51] *** VADemon has joined #archiveteam [13:00] *** BlueMaxim has quit IRC (Quit: Leaving) [13:01] *** Aranje has joined #archiveteam [13:12] *** WinterFox has quit IRC (Remote host closed the connection) [13:22] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [13:29] *** n00bLurke has joined #archiveteam [13:29] *** BartoCH has joined #archiveteam [13:37] *** BartoCH has quit IRC (Quit: WeeChat 1.5) [13:38] *** BartoCH has joined #archiveteam [14:20] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:29] *** hawc145 has quit IRC (Ping timeout: 370 seconds) [14:32] *** hawc145 has joined #archiveteam [14:34] *** jut_ has joined #archiveteam [14:37] *** jut has quit IRC (Read error: Operation timed out) [14:40] *** Start has quit IRC (Quit: Disconnected.) [14:43] *** HCross2 has quit IRC (Ping timeout: 260 seconds) [14:44] *** sigkell_ has quit IRC (Ping timeout: 260 seconds) [14:44] *** sigkell_ has joined #archiveteam [14:55] *** SN4T14 has quit IRC (Ping timeout: 370 seconds) [14:55] *** SN4T14 has joined #archiveteam [14:57] *** HCross2 has joined #archiveteam [15:15] *** VADemon has quit IRC (Ping timeout: 250 seconds) [15:26] *** VADemon has joined #archiveteam [15:27] *** Cameron_D has quit IRC (Ping timeout: 370 seconds) [15:27] *** Cameron_D has joined #archiveteam [15:32] *** Start has joined #archiveteam [15:48] *** JesseW has joined #archiveteam [16:03] *** Aranje has quit IRC (Quit: Three sheets to the wind) [16:04] *** sivoais_ has joined #archiveteam [16:04] *** sivoais has quit IRC (Ping timeout: 370 seconds) [16:07] *** Start has quit IRC (Quit: Disconnected.) [16:10] *** Aranje has joined #archiveteam [16:13] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:19] *** Start has joined #archiveteam [16:20] *** Start has quit IRC (Client Quit) [16:34] *** twrist has joined #archiveteam [16:48] *** GLaDOS has quit IRC (Read error: Operation timed out) [16:48] *** twrist is now known as GLaDOS [17:09] *** consarnit has joined #archiveteam [17:24] *** hawc145 is now known as HCross [17:36] *** Simpbra1 has quit IRC (Ping timeout: 370 seconds) [17:38] *** Cameron_D has quit IRC (Ping timeout: 370 seconds) [17:38] *** Cameron_D has joined #archiveteam [17:41] *** RichardG has joined #archiveteam [17:53] *** Simpbra1 has joined #archiveteam [18:13] *** consarnit has quit IRC () [18:22] *** Start has joined #archiveteam [18:30] *** Tomcat_ has joined #archiveteam [18:47] *** klg_ is now known as klg [19:01] *** winr5r has quit IRC (Read error: Operation timed out) [19:07] *** Start has quit IRC (Quit: Disconnected.) [19:09] *** Simpbra1 has quit IRC (Read error: Operation timed out) [19:11] *** Start has joined #archiveteam [19:14] *** jut has joined #archiveteam [19:16] *** jut_ has quit IRC (Read error: Operation timed out) [19:18] *** winr4r has joined #archiveteam [19:18] *** ranma is now known as madpent [19:19] *** madpent is now known as ranma [19:21] *** Simpbra1 has joined #archiveteam [19:23] *** jut has quit IRC (Quit: Leaving) [19:26] *** atomotic has joined #archiveteam [19:40] *** Start has quit IRC (Quit: Disconnected.) [19:56] *** maseck_ has joined #archiveteam [20:02] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [20:04] *** Honno has quit IRC (Ping timeout: 492 seconds) [20:07] *** maseck has quit IRC (Ping timeout: 1208 seconds) [20:24] *** Tomcat_ has quit IRC (Remote host closed the connection) [20:36] *** VADemon has quit IRC (Quit: left4dead) [20:36] *** schbirid has quit IRC (Quit: Leaving) [20:49] *** ariscop has quit IRC (Quit: Leaving) [21:05] *** tomwsmf-a has joined #archiveteam [21:07] *** pikhq has quit IRC (Ping timeout: 506 seconds) [21:16] *** n00bLurke has quit IRC (n00bLurke) [21:23] *** pikhq has joined #archiveteam [21:24] *** fie has quit IRC (Ping timeout: 244 seconds) [21:26] *** schbirid has joined #archiveteam [21:28] *** ariscop has joined #archiveteam [21:35] *** ris has joined #archiveteam [21:48] Let's get https://seene.co/ and google groups [21:48] :D [21:52] seene.co indeed looks pretty doable [21:58] *** schbirid has quit IRC (Quit: Leaving) [22:22] *** Pudsey has joined #archiveteam [22:23] Any word on the robots.txt issue with the blip archive? You could access it yesterday by adding www. to blip.tv but now even that gives robots.txt [22:28] *** Ravenloft has joined #archiveteam [22:39] arkiver: I think we got seene.co via archivebot yesterday. [22:39] all of it? [22:39] https://seene.co/u/zettlerm/ [22:39] https://seene.co/s/nXH5qs/ [22:39] for example [22:40] well, we'll need to wait till it posts to IA to check, but I think we got those, yes. [22:41] *** Pudsey has quit IRC (Remote host closed the connection) [23:02] *** Start has joined #archiveteam [23:06] *** ris has quit IRC () [23:58] *** xmc has quit IRC (Read error: Operation timed out)