[00:04] *** signius has joined #archiveteam [01:08] [18:29] [18:34:54] Is there something other than the WaybackMachine I can check for removed/expired pastebins? [01:08] do you have a date? [01:08] or a keyword that it contains [01:27] *** mistym has joined #archiveteam [01:47] *** primus104 has quit IRC (Leaving.) [02:07] *** mst_ has joined #archiveteam [02:33] *** Ymgve has quit IRC () [03:09] http://techcrunch.com/2015/03/08/game-over/ [03:16] *** Jonimus has joined #archiveteam [03:19] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [03:24] *** Froggypwn has joined #archiveteam [03:41] is there some kind of archiving going on of blendswap? [03:42] they apparently introduced paid plans a while ago, and now free accounts are limited to 200MB/mo... [03:44] *** xmc has quit IRC (Ping timeout: 512 seconds) [03:51] *** xmc has joined #archiveteam [03:51] *** swebb sets mode: +o xmc [03:56] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [03:57] *** mistym has quit IRC (Remote host closed the connection) [03:57] *** Froggypwn has joined #archiveteam [04:02] *** mst_ has quit IRC (Quit: bye) [04:23] *** mistym has joined #archiveteam [04:44] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [04:44] *** Froggypwn has joined #archiveteam [05:16] *** mistym has quit IRC (Remote host closed the connection) [05:50] *** mistym has joined #archiveteam [06:02] *** Marco__ has joined #archiveteam [06:03] *** Marco__ has quit IRC (Client Quit) [06:15] *** codinghor has joined #archiveteam [06:16] *** codinghor has quit IRC (Client Quit) [06:35] *** mistym has quit IRC (Remote host closed the connection) [07:55] *** test_ has quit IRC (Quit: Page closed) [07:59] *** primus104 has joined #archiveteam [08:26] *** underscor has quit IRC (Ping timeout: 370 seconds) [08:26] *** underscor has joined #archiveteam [08:26] *** swebb sets mode: +o underscor [08:38] joepie91_: thanks, I don't need it any more (redid the contents) [09:28] xmc: nothing that won't need anything special from me [09:29] *** rolfb has joined #archiveteam [10:46] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [11:32] *** Ymgve has joined #archiveteam [11:32] *** dashcloud has quit IRC (Read error: Connection reset by peer) [11:35] *** dashcloud has joined #archiveteam [12:17] *** primus104 has quit IRC (Leaving.) [12:43] *** primus104 has joined #archiveteam [12:51] *** rolfb_ has joined #archiveteam [12:56] *** Ruairi_CO has joined #archiveteam [13:00] *** RuairiCOL has quit IRC (Ping timeout: 370 seconds) [13:06] *** rolfb has quit IRC (Remote host closed the connection) [13:18] *** sankin has joined #archiveteam [13:50] *** primus104 has quit IRC (Leaving.) [13:52] *** rolfb_ has quit IRC (Leaving...) [13:53] *** dserodio has quit IRC (Excess Flood) [13:54] *** dashcloud has quit IRC (Read error: Connection reset by peer) [13:57] *** dserodio has joined #archiveteam [14:02] *** dashcloud has joined #archiveteam [14:11] *** rolfb has joined #archiveteam [14:18] *** Emcy has joined #archiveteam [14:20] *** Emcy_ has quit IRC (Ping timeout: 370 seconds) [14:27] *** Start has quit IRC (Disconnected.) [14:28] *** dserodio has quit IRC (Read error: Operation timed out) [14:30] *** Emcy_ has joined #archiveteam [14:32] *** dserodio has joined #archiveteam [14:37] *** Emcy has quit IRC (Ping timeout: 512 seconds) [14:37] *** Emcy_ has quit IRC (Ping timeout: 370 seconds) [14:49] *** mistym has joined #archiveteam [14:53] ok [14:56] *** mistym has quit IRC (Remote host closed the connection) [15:03] *** Start has joined #archiveteam [15:12] *** mistym has joined #archiveteam [15:14] *** rolfb has quit IRC (Leaving...) [15:33] *** is- has quit IRC (Quit: leaving) [15:57] *** Start_ has joined #archiveteam [15:57] *** Start has quit IRC (Read error: Connection reset by peer) [16:02] *** robink has quit IRC (Ping timeout: 492 seconds) [16:09] *** Start_ has quit IRC (Ping timeout: 370 seconds) [16:10] *** robink has joined #archiveteam [16:14] *** Start has joined #archiveteam [16:16] *** robink has quit IRC (Remote host closed the connection) [16:16] *** Start has quit IRC (Read error: Connection reset by peer) [16:17] *** Start has joined #archiveteam [16:22] *** signius has quit IRC (Ping timeout: 306 seconds) [16:22] *** mistym has quit IRC (Remote host closed the connection) [16:31] *** primus104 has joined #archiveteam [16:35] *** signius has joined #archiveteam [16:40] *** abartov has joined #archiveteam [16:46] *** Start has quit IRC (Disconnected.) [16:57] *** Start has joined #archiveteam [17:00] *** Start has quit IRC (Read error: Connection reset by peer) [17:04] *** Start has joined #archiveteam [17:08] *** dashcloud has quit IRC (Read error: Operation timed out) [17:11] *** dashcloud has joined #archiveteam [17:16] *** lag2 has joined #archiveteam [17:17] *** chfoo has quit IRC (Ping timeout: 512 seconds) [17:20] *** chfoo has joined #archiveteam [17:30] *** sankin1 has joined #archiveteam [17:31] *** sankin has quit IRC (Read error: Operation timed out) [17:34] *** rolfb has joined #archiveteam [17:34] *** rolfb has quit IRC (Client Quit) [17:37] *** sankin has joined #archiveteam [17:39] *** sankin1 has quit IRC (Read error: Operation timed out) [17:45] *** Start_ has joined #archiveteam [17:47] *** Start has quit IRC (Read error: Connection reset by peer) [17:47] *** Start_ is now known as Start [17:49] *** sankin has quit IRC (Ping timeout: 600 seconds) [17:49] *** dashcloud has quit IRC (Read error: Operation timed out) [17:49] *** sankin has joined #archiveteam [17:50] *** dashcloud has joined #archiveteam [18:03] *** sankin has quit IRC (Read error: Operation timed out) [18:17] *** Jonimus has quit IRC (Ping timeout: 370 seconds) [18:21] *** bzc6p has joined #archiveteam [18:22] The tracker's Munin graphs don't work since today [18:22] http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/index.html#archiveteam [18:25] Also, swebb's (?) IRC logs don't work either since Feb 24 [18:25] (The latter my have already been reported; if so, sorry for the duplicate) [18:25] *may [18:25] bzc6p: chfoo has logs too [18:25] I know [18:26] bzc6p: thanks for reminding me. it rebooted. i think nginx doesn't start on boot. fixed now. [18:26] * xmc bad sysadmin [18:26] Thanks. [18:26] Never mind [18:26] np [18:27] *** sankin has joined #archiveteam [18:29] *** bzc6p has left [18:37] *** dashcloud has quit IRC (Read error: Operation timed out) [18:37] *** Start has quit IRC (Disconnected.) [18:41] *** dashcloud has joined #archiveteam [18:42] *** Start has joined #archiveteam [18:43] *** Rotab has quit IRC (Read error: Connection reset by peer) [18:43] *** Start has quit IRC (Read error: Connection reset by peer) [18:43] *** Start has joined #archiveteam [18:50] *** Rotab has joined #archiveteam [18:55] bzc6p: Oops. Sorry. I'll fix it now. [18:56] *** atlogbot has joined #archiveteam [18:57] bzc6p: Fixed. Thanks for the heads up. [19:19] *** Emcy has joined #archiveteam [19:20] *** Start has quit IRC (Disconnected.) [19:36] *** Start has joined #archiveteam [19:37] *** dashcloud has quit IRC (Read error: Operation timed out) [19:40] *** dashcloud has joined #archiveteam [19:49] *** techapj has quit IRC (Quit: Page closed) [20:03] *** Start has quit IRC (Disconnected.) [20:05] *** dashcloud has quit IRC (Read error: Operation timed out) [20:09] *** mistym has joined #archiveteam [20:10] *** dashcloud has joined #archiveteam [20:12] *** Start has joined #archiveteam [20:14] *** Start has quit IRC (Client Quit) [20:19] *** gibigiana has quit IRC (Ping timeout: 265 seconds) [20:19] *** gibigiana has joined #archiveteam [20:20] I'm going to be trying to clear out FOS of projects again, mostly because a few have puffed up. (Many have not.) [20:21] testflight, for example [20:30] Yes, testflight is finished [20:35] *** cbb has joined #archiveteam [20:44] *** mistym has quit IRC (Remote host closed the connection) [20:58] *** sankin has quit IRC (Leaving.) [20:59] *** mistym has joined #archiveteam [21:05] *** mistym has quit IRC (Remote host closed the connection) [21:06] *** mistym has joined #archiveteam [21:16] *** dashcloud has quit IRC (Read error: Operation timed out) [21:20] *** dashcloud has joined #archiveteam [21:30] *** khaoohs_ has quit IRC (Ping timeout: 606 seconds) [21:41] *** cbb has quit IRC (Quit: cbb) [21:44] *** Jonimus has joined #archiveteam [21:47] *** Nertsy has joined #archiveteam [21:55] *** schbirid has joined #archiveteam [21:55] just a quick yes/no question: there are tools to extract the "files" from a warc.gz, correct? [21:56] yes [21:56] thanks [22:00] *** schbirid has quit IRC (Client Quit) [22:12] *** mistym has quit IRC (Remote host closed the connection) [22:12] *** cbb has joined #archiveteam [22:17] *** Start has joined #archiveteam [22:21] *** mistym has joined #archiveteam [22:21] *** Sk1d has joined #archiveteam [22:51] *** rolfb has joined #archiveteam [22:51] *** lag2 has quit IRC (Ping timeout: 512 seconds) [22:51] *pokes SketchCow* [23:08] i guess i just missed [23:28] Say what you want to say instead of pinging, if you have something to you want to say :) [23:28] IRC class 101 [23:29] ersi: yeah, i know, but he emailed me some days ago hoping he would be here at the same time as me [23:30] and here I am staying up past my bedtime ... ;-) [23:30] Burp [23:30] Why is it important I be on here versus e-mail? [23:30] this way we can misunderstand each other more quickly [23:30] SketchCow: i don't know, you said you hoped to find me here [23:30] i'm ok with email :) [23:30] [00:30] this way we can misunderstand each other more quickly [23:30] framing that one [23:31] Well, all of these maniacs have as much access, skill and power as I can bring to the table. [23:31] SketchCow: it's about Gitorious [23:31] I know. [23:31] Oh I know [23:31] I'm already sensing that this will be way more confusing than email :-) [23:32] There are two possibilities here. [23:32] 1. Pack up all the "Gitorious Data" and send it to me on a drive. [23:32] 2. Let some ad-hoc gang of the Archive Team scour Gitorious and pull it into the Wayback. [23:32] you don't want us to massage it into nice IA items for you? [23:33] Not particularly, no. I'd rather a big-ass tar file. [23:33] well, I guess option 1 is best then [23:34] SketchCow: why would you not prefer individual items? [23:34] I would love, while we're here, to hear the earth shattering, justifiable reason for shutting down gitorious just because another company acquired it. [23:34] Spoiler: There is none [23:34] amen [23:34] Extra bonus if you use one of the following two phrases. [23:35] it's not shutting down because it was acquired [23:35] 1. "Let me speak to you privately and I'll explain." (money, money, money, money and money) [23:35] 2. "It's a very good reason" (No) [23:35] * joepie91_ grabs the bingo sheet [23:35] *snerk* [23:36] You could make it static, and then let it slowly drift into transition into other services. [23:36] I mean, that's not hard. [23:36] xmc: Individual items? One per git project, loaded into IA, where it's not really going to make sense? No. [23:36] I'd rather get a big fat tar and then some genius can do "the right thing" [23:36] what do you mean static? we're hosting git repositories [23:36] all right [23:37] Make them read-only [23:37] All of them [23:37] People will leave, and migrate, to be sure [23:38] somewhat unrelatedly, i've got a repo in my github account that i converted from a darcs repo that wayback indexed like ten years ago https://github.com/chronomex/bencode-erlang [23:39] Holy moly [23:39] what about the ones that are "dead", and wouldn't migrate but still have historic. well, may have, historic value? [23:40] rolfb: right. why would you turn off the urls that they're at. [23:40] *** the_fox has joined #archiveteam [23:40] Those are exactly the ones that perfect to be static [23:40] And stay up at that domain [23:40] mmh, smashing tombstones~ [23:40] who would keep it running? [23:41] rolfb: for a static copy of something, "keeping it running" doesn't really mean much more than replacing a HDD when it does [23:41] dies * [23:41] *** OtherFox has quit IRC (Ping timeout: 606 seconds) [23:41] and inserting money into dns [23:42] joepie91_: and bandwidth [23:42] :) [23:42] rolfb: bandwidth is cheap [23:42] (money, money, money, money and money) [23:42] i'll buy the domainname off of you for whatever the transfer fee is and we can make sure things stay around [23:43] Anyway, rolfb, you got your chat with me. E-mail me about sending a drive with all the data. [23:44] SketchCow: thanks, i'll look into the ideas presented here as well [23:44] If you want to stick around and get poked in the cute little nose, feel free. [23:44] rolfb: realistically, depending on your upstreams, a datacenter is going to charge you anywhere from $0.30 - $5 per TB of traffic [23:44] But if you do.... [23:44] advice, do not try and educate Archive Team on "the way the world works" [23:44] not including bandwidth/uplink [23:44] SketchCow: thanks, I won't. I'm not proud of this [23:44] usually around $0.50-$5 per mbps on the cheaper end, on 95th percentile billing [23:45] in other words: you can run a static copy off pocket money [23:45] *** dashcloud has quit IRC (Read error: Operation timed out) [23:45] i'm more than happy to run a static copy off pocket money [23:47] xmc: we average 60 mbps on the 95h percentile, guessing it will be higher as people are migrating [23:48] 95th, even [23:48] that'll almost certainly drop off after the first month or two [23:48] probably [23:49] so that's 18.5TB/mo assuming 60mbps [23:49] roughly [23:49] if my calc is correct [23:49] that's not a lot [23:50] you could even run that off a random high-bandwidth VPS, really [23:50] at an arbitrary VPS provider [23:50] rolfb: how much data is it? in GB [23:50] iirc 700ish? [23:51] it's 4.5 TB [23:51] ok [23:51] and i assume you've gc'd your repos [23:51] yes, we run gc regularly [23:51] rolfb: xmc: moment [23:52] *** dashcloud has joined #archiveteam [23:53] rolfb: xmc: http://www.soyoustart.com/en/disk/ [23:53] 250mbps unmetered if I am not mistaken [23:54] yes, budget box, but more than enough for a static copy [23:54] hrm [23:54] or https://www.hetzner.de/en/hosting/produkte_rootserver/px70 [23:54] 30TB traffic/mo [23:54] 2x4TB disk [23:54] not sure if they have bigger plamns [23:54] plans * [23:54] * xmc nods [23:55] https://www.hetzner.de/en/hosting/produktmatrix/rootserver-produktmatrix-sx [23:55] cheapest one comes with 30TB traffic also [23:55] 4x4TB SATA, 30TB traffic, 69 euro/mo [23:55] specifically https://www.hetzner.de/en/hosting/produkte_rootserver/sx60 [23:56] ok [23:56] anyway, yeah [23:56] and they charge 1.39euro/TB overage [23:56] not too bad at all [23:56] indeed [23:57] anyway, rolfb, if you're willing to hand over control of the domain and stuff to me, i'd be more than happy to run it indefinitely [23:57] more possible plans at https://robot.your-server.de/order/market/sortcol/hd/sorttype/up/page/1#386343 [23:57] and now i'm going to go back to work so i can attempt to earn the money to do more ridiculous computer things [23:57] xmc: i've gotten an OK to hand over domain and data, but with one caveat [23:57] oh? [23:58] xmc: lol, that's how life works isn't it [23:58] :p [23:58] [curiosity intensifies] [23:58] heh [23:58] ^ [23:58] i'm really sorry [23:58] secret sauce deployed [23:58] :) [23:59] the request is that it's read-only and that there's a notice about switching/migrating to gitlab [23:59] sure thing [23:59] i'm contemplating an apache instance with a bunch of bare repos in it that support being cloned from [23:59] xmc: you sure apache is the best option for that? [23:59] and a banner at the top of the directory index [23:59] i have no idea [23:59] maybe nginx