[00:07] *** Aranje has joined #archiveteam-bs [00:17] *** GLaDOS has quit IRC (Write error: Broken pipe) [00:17] *** GLaDOS has joined #archiveteam-bs [00:40] *** superkuh has joined #archiveteam-bs [00:50] *** Start has joined #archiveteam-bs [01:18] looks like abc.net.au/news/2013 urls is at 68459 in wayback [01:41] *** RichardG has quit IRC (Ping timeout: 370 seconds) [02:00] sitemap urls of abc.net.au/news/2006 are all saved now [02:00] *** kristian_ has quit IRC (Quit: Leaving) [02:25] *** Dragon has quit IRC (Quit: Page closed) [03:20] *** Whopper has quit IRC (Ping timeout: 260 seconds) [03:20] *** Whopper has joined #archiveteam-bs [04:11] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [04:12] *** zhongfu has joined #archiveteam-bs [04:12] eesh, why does Amazon Music need 200% CPU on a Macbook Air just to show a radial gradient [04:12] what the hell is this thing doing [04:16] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [04:21] *** zhongfu has joined #archiveteam-bs [04:22] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:22] typical amazon [04:24] maybe it's built from concurrent microservices that each render a single pixel of the gradient [04:24] Amazon Elastic Graphics Processing [04:25] might as well reinvent GPUs [04:26] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [04:29] *** Sk1d has joined #archiveteam-bs [04:32] *** zhongfu has joined #archiveteam-bs [04:34] *** brayden_ is now known as brayden [04:58] Web pages consume more resources than Crysis [05:05] i'm uploading 5 episodes of Anti-Gravity Room [05:06] 4 episodes are from 1997 and on is from 1996 [05:10] *** Start has quit IRC (Read error: Connection reset by peer) [05:10] *** Start has joined #archiveteam-bs [05:23] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:29] *** Sk1d has joined #archiveteam-bs [06:11] heh, the @archiveteam notifications feed is slowly filling with vines [06:19] *** GE has joined #archiveteam-bs [06:23] *** Start_ has joined #archiveteam-bs [06:23] *** Start has quit IRC (Read error: Connection reset by peer) [07:04] *** Yoshimura has quit IRC (Ping timeout: 255 seconds) [07:13] *** VADemon has quit IRC (Quit: left4dead) [07:35] *** paul_lelo has quit IRC (Ping timeout: 260 seconds) [07:36] *** paul_lelo has joined #archiveteam-bs [08:35] i'm uploading abc.net.au/news/2007 urls [08:36] also here are Anti Gravity Room episodes i have from 1997 : https://archive.org/details/Anti-Gravity_Room-1997-5-episodes [08:39] *** hawc145 is now known as HCross [09:07] *** SilSte has joined #archiveteam-bs [09:10] *** SilSte has quit IRC (Client Quit) [09:12] *** SilSte has joined #archiveteam-bs [09:16] *** SilSte has quit IRC (Client Quit) [09:23] *** SilSte has joined #archiveteam-bs [09:31] *** Kksmkrn has joined #archiveteam-bs [09:33] *** SilSte has quit IRC (Remote host closed the connection) [09:35] *** SilSte has joined #archiveteam-bs [09:46] *** Kksmkrn has quit IRC (Quit: Oh. I see.) [09:47] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [09:48] *** SilSte has joined #archiveteam-bs [09:50] *** Kksmkrn has joined #archiveteam-bs [09:51] *** SilSte has quit IRC (Client Quit) [09:52] *** SilSte has joined #archiveteam-bs [09:58] *** Yoshimura has joined #archiveteam-bs [09:58] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [10:07] *** SilSte has joined #archiveteam-bs [10:37] joepie91: if the script for grabbing vine are running and people can run a warrior [10:37] a warrior with the vine project* [10:37] maybe we can send a mail to tweakers about it? [10:37] we also had something on tweakers for hyves [10:37] https://tweakers.net/nieuws/117311/giphy-komt-met-tool-om-vine-videos-om-te-zetten-naar-gif.html [10:38] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [10:39] *** SilSte has joined #archiveteam-bs [10:56] *** RichardG has joined #archiveteam-bs [10:57] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [10:57] *** SilSte has joined #archiveteam-bs [11:15] *** yeoldetoa has joined #archiveteam-bs [11:20] *** RichardG has quit IRC (Ping timeout: 370 seconds) [11:22] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [11:23] *** SilSte has joined #archiveteam-bs [11:36] *** BlueMaxim has quit IRC (Quit: Leaving) [11:55] *** RichardG has joined #archiveteam-bs [12:09] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [12:09] *** SilSte has joined #archiveteam-bs [12:21] *** GE has quit IRC (Ping timeout: 255 seconds) [12:27] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [12:27] *** SilSte has joined #archiveteam-bs [12:39] *** RichardG has quit IRC (Ping timeout: 244 seconds) [12:41] *** RichardG has joined #archiveteam-bs [12:43] *** SilSte has quit IRC (Remote host closed the connection) [12:43] *** SilSte has joined #archiveteam-bs [12:47] *** SilSte has quit IRC (Client Quit) [12:50] *** SilSte has joined #archiveteam-bs [13:01] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [13:03] *** SilSte has joined #archiveteam-bs [13:11] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [13:19] *** SilSte has joined #archiveteam-bs [13:30] *** SilSte has quit IRC (Remote host closed the connection) [13:33] *** SilSte has joined #archiveteam-bs [13:35] *** GE has joined #archiveteam-bs [13:38] *** SilSte has quit IRC (Remote host closed the connection) [13:42] arkiver: there's a button on the site for submitting news tips [13:44] Someone should attempt to put out unformal standard for robots.txt for archival purposes. [13:44] Like... allowing servers to advise concurrency, delay times, etc. [13:48] *** SilSte has joined #archiveteam-bs [13:57] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [13:58] *** Yoshimura has quit IRC (Remote host closed the connection) [14:02] *** Yoshimura has joined #archiveteam-bs [14:16] *** SilSte has joined #archiveteam-bs [14:20] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [14:27] *** SilSte has joined #archiveteam-bs [14:36] *** SilSte has quit IRC (Remote host closed the connection) [14:37] *** SilSte has joined #archiveteam-bs [14:46] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [14:47] *** SilSte has joined #archiveteam-bs [14:51] *** SilSte has quit IRC (Client Quit) [14:54] *** SilSte has joined #archiveteam-bs [14:58] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [14:58] *** SilSte has joined #archiveteam-bs [15:06] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [15:07] *** SilSte has joined #archiveteam-bs [15:14] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [15:18] *** SilSte has joined #archiveteam-bs [15:30] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [15:31] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [15:37] *** paul_lelo has quit IRC () [16:03] *** BartoCH has joined #archiveteam-bs [16:21] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [16:25] *** BartoCH has joined #archiveteam-bs [16:35] *** kristian_ has joined #archiveteam-bs [16:46] arkiver: so vine urls that are tweeted at @archiveteam are going to be fed into warrior? [16:46] They are going to be fed somewhere. [16:47] Most likely to discovery, which qualifies them automatically for the fetch run. [16:48] edsu: Also #vinewhine [16:51] nice, ok [16:53] would it be useful to start collecting all tweets that have a vine.co link? [16:53] or would that be too noisy? [16:54] theres a bot doing it I think [17:08] *** VADemon has joined #archiveteam-bs [17:11] oh, cool -- i was going to offer to help with that, but i guess it is well in hand, nice work! [17:12] *** VADemon_ has joined #archiveteam-bs [17:15] *** Honno has joined #archiveteam-bs [17:18] *** VADemon has quit IRC (Read error: Operation timed out) [17:27] edsu: I don't think we have a bot runnig yet that collects all tweets containing a vine video [17:27] just those that amention @archiveteam [17:27] so I think that would be very welcome [17:27] (to also collect all tweets mentioning a vine) [17:27] yipdw: is this correct? ^ [17:40] Is there a place where all older tweets are available? [17:40] you can get at the last week through the search api [17:40] I could parse realtime stuff, but have no idea where to get old. [17:41] Yeah, but what I mentioned earlier. URL shorteners. [17:41] here's a quick thing i put together that will look every 60 seconds for tweets that have links to vine.co, write them to stdout [17:42] https://gist.github.com/edsu/88bb252cae8731a17a503d401bba48c4 [17:43] twitter's search api intrdouced a url parameter fairly recently that is useful for this [17:44] Yeah, I just read the docs [17:44] https://www.hitchhq.com/twitter/activities/57f24975224ae0ce476cbb18#change-detail-1 [17:44] oh, cool [17:44] I hate having to register to get API keys. [17:44] yeah... [17:45] edsu: Why not just filter:vine ? [17:46] i think you could do that [17:46] the volume right now is quite high, so it might be harder to keep on top of, with the rate limits [17:47] seeing about 4000 vine.co urls per minute [17:47] not unique though [17:47] Also since_id [17:48] yes, my script uses since_id [17:49] here's some urls from running for a few minutes https://gist.github.com/edsu/5cd28ec059ca6d35c7a12c0e52610595 [17:49] They changed twitter look. its terrible. [17:49] first number is the number of times it was mentioned [17:49] Let's move to PM? [17:50] sure [18:19] *** chungo has joined #archiveteam-bs [18:19] *** chungo has left [18:24] edsu: yes, that's correct [18:24] I don't want to collect all tweets, partially because we can't [18:25] unless the firehose API changed [18:25] which I guess it did [19:10] firehose hasn't changed, but you can use the search api with the url parameter to find them, as long as the volume doesn't get so high you can keep up with the rate limits [19:11] ah [19:13] upper limit is like 1.7M tweets per day [19:13] which might be enough to stay on top of the vines [19:13] They got 15 minute windows though [19:14] Sure, it should. Not even that many published daily, else there would be 350million vines per year [19:58] *** ndizzle has joined #archiveteam-bs [20:00] *** ndizzle has quit IRC (Read error: Connection reset by peer) [20:00] *** ndiddy has quit IRC (Read error: Connection reset by peer) [20:01] *** ndizzle has joined #archiveteam-bs [20:04] *** ndizzle has quit IRC (Client Quit) [20:04] *** ndiddy has joined #archiveteam-bs [20:09] *** ndizzle has joined #archiveteam-bs [20:11] *** jrwr has joined #archiveteam-bs [20:16] *** ndiddy has quit IRC (Read error: Operation timed out) [20:32] *** kyounko has quit IRC (Ping timeout: 260 seconds) [20:34] *** kyounko has joined #archiveteam-bs [20:37] *** RichardG has quit IRC (Ping timeout: 633 seconds) [21:06] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [21:07] *** BartoCH has joined #archiveteam-bs [21:07] *** RichardG has joined #archiveteam-bs [21:11] *** mistym has joined #archiveteam-bs [21:13] *** Yoshimura has quit IRC (Remote host closed the connection) [21:21] *** Yoshimura has joined #archiveteam-bs [21:33] *** RichardG_ has joined #archiveteam-bs [21:33] *** RichardG has quit IRC (Ping timeout: 255 seconds) [21:37] *** RichardG_ has quit IRC (Ping timeout: 250 seconds) [21:41] *** sep332 has quit IRC (konversation out) [21:47] so i'm uploading 118k urls for abc.net.au/news/2008 [21:48] looks like there are 404 errors from incomplete urls in this year [21:48] but luckly there is only 20 of them [21:48] the rest are 404 image urls that are grabbed by --mirror command [21:55] *** RichardG has joined #archiveteam-bs [21:57] *** BlueMaxim has joined #archiveteam-bs [22:01] *** RichardG_ has joined #archiveteam-bs [22:05] *** RichardG has quit IRC (Ping timeout: 370 seconds) [22:10] *** RichardG_ has quit IRC (Read error: Operation timed out) [22:33] *** Yoshimura has quit IRC (Ping timeout: 255 seconds) [23:07] *** GE has quit IRC (Quit: zzz) [23:23] *** Honno has quit IRC (Read error: Operation timed out) [23:24] *** Guest56 has joined #archiveteam-bs [23:24] *** Guest56 has quit IRC (Client Quit) [23:28] *** etudier has joined #archiveteam-bs [23:34] *** kristian_ has quit IRC (Quit: Leaving) [23:50] *** ndizzle is now known as ndiddy [23:56] *** BartoCH has quit IRC (Ping timeout: 260 seconds)