[00:03] *** antomatic has joined #archiveteam-bs [00:05] *** antomati_ has quit IRC (Read error: Operation timed out) [00:38] *** GE has quit IRC (Remote host closed the connection) [00:52] *** BlueMaxim has quit IRC (Read error: Operation timed out) [02:26] so i found a youtube user saved a ton of WABC and WCBS from march 1993 6pm news broadcast [02:52] * Somebody2 is grumpy about UNESCO's Open Access policy... [02:53] They license all the *text* they write under CC BY-SA -- but they don't *distribute* files consisting of only that text. [02:54] Instead, they distribute files *ALSO* including various graphics and images from others, *NOT* licensed under any open license. [02:54] Thereby making it impossible for others to legally mirror what they distribute! [02:59] *** Silvan has joined #archiveteam-bs [03:00] *** SilSte has quit IRC (Read error: Operation timed out) [03:18] *** ndiddy has quit IRC () [03:29] Or at least, highly impractical. [04:01] yes, impossible to use the license they automatically grant [04:10] *** bwn has quit IRC (Read error: Operation timed out) [04:11] *** pizzaiolo has quit IRC (Remote host closed the connection) [04:13] *** bwn has joined #archiveteam-bs [04:35] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:41] *** Sk1d has joined #archiveteam-bs [05:43] *** Aranje has quit IRC (Quit: Three sheets to the wind) [06:09] Yik Yak is probably going to shut down soon! We need to find a way to archive stuff from it soon. [06:10] that's that silly app for posting anonymous notes about your local area right? [06:11] Yeah. They also have a Regional section and some other stuff. [06:23] *** Nyx has quit IRC (Ping timeout: 260 seconds) [06:38] *** BlueMaxim has joined #archiveteam-bs [06:56] *** BlueMaxim has quit IRC (Read error: Operation timed out) [06:56] *** BlueMaxim has joined #archiveteam-bs [07:14] *** Nyx has joined #archiveteam-bs [07:26] *** odemg has quit IRC (Remote host closed the connection) [07:28] *** schbirid has joined #archiveteam-bs [07:42] *** odemg has joined #archiveteam-bs [07:43] *** JAA has joined #archiveteam-bs [07:47] *** GE has joined #archiveteam-bs [08:03] *** Jonison has joined #archiveteam-bs [08:17] *** Honno has joined #archiveteam-bs [08:18] *** schbirid2 has joined #archiveteam-bs [08:22] *** schbirid has quit IRC (Read error: Operation timed out) [08:57] *** username1 has joined #archiveteam-bs [09:02] *** schbirid2 has quit IRC (Read error: Operation timed out) [09:16] *** schbirid2 has joined #archiveteam-bs [09:19] *** username1 has quit IRC (Read error: Operation timed out) [09:21] so i'm manually grab the RedEye Magazine [09:22] from chicago tribune [09:37] *** username1 has joined #archiveteam-bs [09:43] *** schbirid2 has quit IRC (Read error: Operation timed out) [09:56] *** schbirid2 has joined #archiveteam-bs [10:00] *** username1 has quit IRC (Read error: Operation timed out) [10:00] *** edsu has joined #archiveteam-bs [10:01] *** pnJay has quit IRC (Leaving) [10:02] *** GE has quit IRC (Remote host closed the connection) [10:24] *** username1 has joined #archiveteam-bs [10:27] *** schbirid2 has quit IRC (Read error: Operation timed out) [10:51] *** JAA has quit IRC (Quit: Page closed) [10:53] *** schbirid2 has joined #archiveteam-bs [10:57] *** username1 has quit IRC (Read error: Operation timed out) [11:15] so i found a way to grab the RedEye Magazine [11:16] i just had to login using facebook then run httpfox when downloading a pdf [11:16] you can some api.readoz.com urls [11:17] the first /download/ url has the authorization data in it [11:28] *** icedice has joined #archiveteam-bs [12:11] so i found a way to grab all the readoz.com ids for a channel [12:14] *** username1 has joined #archiveteam-bs [12:18] *** schbirid2 has quit IRC (Read error: Operation timed out) [12:37] *** odemg has quit IRC (Remote host closed the connection) [12:40] *** schbirid2 has joined #archiveteam-bs [12:45] *** username1 has quit IRC (Read error: Operation timed out) [12:53] *** pizzaiolo has joined #archiveteam-bs [13:08] *** username1 has joined #archiveteam-bs [13:11] *** schbirid2 has quit IRC (Read error: Operation timed out) [13:14] *** odemg has joined #archiveteam-bs [13:16] *** odemg has quit IRC (Remote host closed the connection) [13:27] *** schbirid2 has joined #archiveteam-bs [13:30] *** username1 has quit IRC (Read error: Operation timed out) [13:31] *** icedice has quit IRC (Quit: Leaving) [13:45] *** kniffy has quit IRC (Ping timeout: 240 seconds) [13:48] *** username1 has joined #archiveteam-bs [13:51] *** schbirid2 has quit IRC (Read error: Operation timed out) [13:56] *** pnJay has joined #archiveteam-bs [14:00] *** kniffy has joined #archiveteam-bs [14:07] *** BlueMaxim has quit IRC (Read error: Operation timed out) [14:10] *** odemg has joined #archiveteam-bs [14:24] *** schbirid2 has joined #archiveteam-bs [14:27] *** username1 has quit IRC (Read error: Operation timed out) [14:29] *** tuluut has joined #archiveteam-bs [14:40] *** Nume has joined #archiveteam-bs [14:40] hello~ [14:41] [17:35] I guess give webarchiveplayer some time [17:35] 140 GB is big [17:36] you can also browse it in the wayback machine [14:41] hi [14:41] so about this [14:41] #archiveteam is more for announcements [14:41] oh I see [14:41] Nume: I might be able get the stories for you [14:41] arkiver: this is #archiveteam-bs [14:41] yes [14:42] (see #archiveteam) [14:42] Aoede, really? [14:42] I would be so grateful [14:42] oh, nvm [14:48] *** username1 has joined #archiveteam-bs [14:52] *** schbirid2 has quit IRC (Read error: Operation timed out) [14:57] Nume, where did you get the WARC file? [14:59] archive.org [15:01] I was more checking to see if I was the uploader, in which case I could help you out. :) [15:01] Also note that any WARC file properly uploaded on archive.org will show up in the WayBack Machine. [15:02] (unless robots.txt or otherwise excluded, right?) [15:03] (or am I mistaken about that?) [15:03] Correct. robots.txt will exclude. [15:04] I got this [15:04] Aoede already agreeded to help me find the stories I needed, but thank you a lot for your help as well! [15:05] I'm more asking for my own benefit, but yeah [15:05] I might be but wrong but I think robots.txt blocked fanfiction [15:05] if that's the correct term even, I am a bit confused by this whole web archive thing ^^ [15:10] *** schbirid2 has joined #archiveteam-bs [15:13] *** username1 has quit IRC (Read error: Operation timed out) [15:14] r/PLACE snapshots are now added: https://archive.org/details/PLACE-SNAPSHOTS [15:14] yooooo. that's cool [15:15] I am also working on a datafile that tracks diffs, since a lot of devs/artists are using the snapshots for cool things. [15:19] very cool. Reminded me of Drawball [15:19] diffs would be awesome [15:19] Any idea if reddit plans to release the raw data? [15:20] No, but I have 10 second snapshots being uploaded next that diff data was drawn from. [15:20] So that is probably the best we are going to get. [15:20] ah i see, awesome [15:21] hook54321: I asked around a little and unfortunately Yik Yak does not have a public API we could use for archiving them. Going to have to find another way. [15:32] Woah, wait, what's going on with yik yak? [15:33] Also, don't messages on Yik Yak disappear quickly anyway? It's good to grab a snapshot but I'm not sure how much there is there to archive. [15:38] *** username1 has joined #archiveteam-bs [15:41] *** schbirid2 has quit IRC (Read error: Operation timed out) [15:48] *** GE has joined #archiveteam-bs [16:03] *** schbirid2 has joined #archiveteam-bs [16:07] *** username1 has quit IRC (Read error: Operation timed out) [16:20] they disappear after a few days, and they're only visible a mile or so from where they were posted [16:21] Plus they stripped the anonymity. [16:21] Not touching that with a 100 ft pole. [16:21] *** xmc sets mode: +oooo midas HCross2 Lord_Nigh Sanqui [16:21] *** xmc sets mode: +oooo yipdw balrog arkiver swebb [16:21] *** swebb sets mode: +o DFJustin [16:21] *** swebb sets mode: +o SadDM [16:21] *** swebb sets mode: +o antomatic [16:21] *** swebb sets mode: +o brayden [16:21] *** swebb sets mode: +o edsu [16:21] *** xmc sets mode: +oooo chfoo chazchaz godane DFJustin [16:21] *** xmc sets mode: +o schbirid2 [16:22] (I hope your automatic op scripts check host and not just username. :P) [16:22] that wasn't an auto op, that was me scrolling thru the userlist [16:22] so ... not really [16:22] I was refering to swebb. :) [16:23] oh, yeah, swebb's do [16:31] *** username1 has joined #archiveteam-bs [16:35] *** schbirid2 has quit IRC (Read error: Operation timed out) [16:39] Snapshot diffs uploaded. https://archive.org/details/PLACE-SNAPSHOT-DIFFS [16:40] kool [16:41] Once the archivebot jobs finish I think we are 100% on archival for r/PLACE. [16:45] joepie91: my NorthHosts hardware is finally on the way back [17:00] *** schbirid2 has joined #archiveteam-bs [17:02] *** username1 has quit IRC (Read error: Operation timed out) [17:13] HCross2: could have gone worse [17:13] :p [17:14] Microsoft is shutting down literally all of their research division projects. First it was so.cl, now it is their open source repos. [17:14] Jesus. [17:15] joepie91: had a bit of a go at Jon.. and he's packaged and sent it all for free [17:32] rocode: awesome :) [17:35] *** hook54321 has quit IRC (Ping timeout: 244 seconds) [17:36] *** tammy_ has quit IRC (Ping timeout: 244 seconds) [17:37] *** tammy_ has joined #archiveteam-bs [17:46] *** hook54321 has joined #archiveteam-bs [17:54] nightpool: Yik Yak is shutting down [17:54] supposedly [17:57] *** odemg has quit IRC (Remote host closed the connection) [17:58] K4k: As far as i know, they haven't officially said that is. [17:59] It's been in the rumor mill for ~6 months at least. [18:02] *** hook54321 has quit IRC (Ping timeout: 244 seconds) [18:04] *** tuluut has quit IRC (Ping timeout: 244 seconds) [18:04] *** tuluut has joined #archiveteam-bs [18:04] *** JAA has joined #archiveteam-bs [18:07] *** hook54321 has joined #archiveteam-bs [18:08] K4k: There's a web interface [18:12] https://www.yikyak.com/yak/R/581b8281266910ccd6282f43cc10f [18:13] the wayback machine either doesn't save or doesn't show the replies. archive.is doesn't either. [18:20] SFF.net has updated their robots.txt to allow our archives to be browsed on the wayback machine. We should be good now. [18:43] *** pizzaiolo has quit IRC (Ping timeout: 245 seconds) [18:47] *** pizzaiolo has joined #archiveteam-bs [18:53] *** odemg has joined #archiveteam-bs [18:54] *** Nume has left [19:00] *** ndiddy has joined #archiveteam-bs [19:03] *** JensRex has quit IRC (Remote host closed the connection) [19:04] *** JensRex has joined #archiveteam-bs [19:13] *** username1 has joined #archiveteam-bs [19:18] *** schbirid2 has quit IRC (Read error: Operation timed out) [19:20] rocode: also just caused sublime to baloon to 8.5GB of RAM use before it crashed [19:21] wtf is researchgate [19:27] *** wowaname has joined #archiveteam-bs [19:30] rocode: ResearchGate is a social networking site for scientists and researchers to share papers, ask and answer questions, and find collaborators [19:31] scientific data silo :( https://www.researchgate.net/ [19:31] All you need to sign up is a .edu email address or an invite [19:31] silo? [19:31] i have a .edu email address :) [19:31] thanks to my alma mater never retiring them [19:31] hook54321, looks like different context according to https://en.wikipedia.org/wiki/Sci-Hub [19:32] nah, rg hosts papers as well [19:32] but you cannot scrape much [19:32] *** schbirid2 has joined #archiveteam-bs [19:32] xmc: what's an alma mater? [19:32] fuck this isp [19:32] the school i went to, hook54321 [19:32] might be just an american term [19:33] I'm in the US... [19:33] https://en.wikipedia.org/wiki/Alma_mater [19:33] username1: iirc you can access PDFs even if you aren't logged in. [19:33] it's a common term. [19:33] However you need to have the link [19:34] to the pdf [19:34] yes maybe [19:34] not wanna discuss, sorry [19:35] *** username1 has quit IRC (Read error: Operation timed out) [19:35] rocode: I don't see anything in the scihub article about ResearchGate [19:36] I am so confused at this point I am just going to stop. [19:38] sci-hub = site to pirate papers [19:38] rg = "social network" for researchers, including huge paper collection [19:38] sci-hub was showing they had much more papers than rg [19:38] EOS [19:38] for authors to share their papers [19:38] oh [19:39] people upload lots of not-their-own papers [19:39] *** GE has quit IRC (Remote host closed the connection) [19:39] on rg or sci-hub? [19:39] rg [19:40] I thought that it doesn't let people do that [19:40] *** dzl has joined #archiveteam-bs [19:40] heh. someone responded to my request to their article with this: "Sorry, I am unable to share my full-text because I don't know if I have permission to." [19:43] Science! [19:45] *** PyrEx has joined #archiveteam-bs [19:47] *** GE has joined #archiveteam-bs [20:10] Luckily, open access journals are becoming more and more common. [20:19] *** schbirid2 has quit IRC (Quit: Leaving) [20:27] *** Honno has quit IRC (Quit: Leaving) [20:33] *** pnJay has quit IRC (Quit: Leaving) [20:43] *** sep332 has joined #archiveteam-bs [20:45] *** Jonison has quit IRC (Read error: Connection reset by peer) [20:45] *** sep332_ has quit IRC (Read error: Operation timed out) [20:55] *** icedice has joined #archiveteam-bs [21:00] *** pnJay has joined #archiveteam-bs [21:11] *** odemg has quit IRC (Remote host closed the connection) [21:12] *** odemg has joined #archiveteam-bs [21:30] MLKSHK is live. http://archiveteam.org/index.php?title=MLKSHK [21:30] Warriors needed etc. [21:31] JensRex, I will race you to a TB on the mlkshk project! :) [21:31] although i have a headstart <_< [21:32] Any idea yet how many threads are safe? [21:32] They don't seem to be throttling. [21:33] im running 10 on all my warriors so far. No problems so far [21:36] * JAA queues Mr. Burns's "egg salad". [22:05] I was moaning about gmane.org the other day, so FWIW: it looks like their archives are alive and well if you access news.gmane.org via NNTP, it's just the web front end that's not usable. [22:05] (This may be news to nobody) [22:06] excellent, i assumed as much but didn't bother to check [22:10] *** odemg has quit IRC (Remote host closed the connection) [22:13] *** odemg has joined #archiveteam-bs [22:15] *** JAA has quit IRC (Quit: Page closed) [22:16] rocode: channel? [22:16] #totheyard [22:17] *** odemg has quit IRC (Remote host closed the connection) [22:30] *** odemg has joined #archiveteam-bs [22:42] *** GE has quit IRC (Remote host closed the connection) [22:45] *** pizzaiol1 has joined #archiveteam-bs [22:52] *** pizzaiolo has quit IRC (Remote host closed the connection) [23:00] *** BlueMaxim has joined #archiveteam-bs [23:34] *** pizzaiol1 has quit IRC (Read error: Operation timed out)