[00:02] there's like 90 people in this channel. [00:03] also if i worked for yahoo id stay quiet about it :/ [00:04] Flash uploader can't do 2GB :( [00:06] Guys, you can get a virtual a VM for 2 hours or so with a fast connection and lots of disk space with root access [00:06] http://www.microsoft.com/en-us/server-cloud/support/learning-center/virtual-labs.aspx [00:06] Just take advantage of MS ;P [00:07] Some of these labs will give you like 5 VMs [00:08] what is alot of diskspace? [00:09] The one I tried last night had like 40GH [00:09] 40GB* [00:09] But like 5 machines with that [00:09] There might be labs that are longer than 2 hours, not sure [00:09] Some of the Hyper-V labs might contain a linux too [00:09] not to break your bubble [00:10] Disk Usage: 263.14GB/459.16GB (57.31%) [00:10] thats my smallest disk atm [00:10] Disk space likely depends on the lab [00:10] The VM ones might let you use as much as you feel like [00:10] I'll check now [00:11] and everybody who grabs anything on his own system kinda calculates that it might take abit longer than 2 hours [00:11] I assume they delete everything shortly after your session ends [00:11] but some let you extend them for a few hours [00:11] yeah, for ftps sure [00:11] websites too [00:11] There is a Linux VPS service that gives you 60 minutes of 7 days if you give them an email [00:12] or* [00:12] Let me try to find it [00:13] Serverlove will give you 15 GBP of free time to play with: http://www.microsoft.com/en-us/server-cloud/support/learning-center/virtual-labs.aspx [00:13] You should be able to use that for at least a few weeks or month if you are careful about resource usage (use a slow processor and little ram) [00:14] gah [00:14] wrong url [00:14] http://www.serverlove.com/free-trial/ [00:15] personally ill pass on them, maybe usable for project grabs but for long lasting grabs ill pass on them [00:16] Oh, sure [00:16] I was just giving options for people who are cheap [00:16] Windows Azure gives you a 1month trial here: http://azure.microsoft.com/en-us/pricing/free-trial/?rnd=1 [00:17] Some of these may need Credit cards or a phone number [00:44] *** toad1 has joined #archiveteam-bs [00:45] *** LordNigh2 has joined #archiveteam-bs [00:48] *** Lord_Nigh has quit IRC (Ping timeout: 272 seconds) [00:48] *** LordNigh2 is now known as Lord_Nigh [01:06] This one has about 427GB between 5 VMs and 1 SAN [01:07] https://vlabs.holsystems.com/vlabs/technet?eng=VLabs&auth=none&src=microsoft.holsystems.com&altadd=true&labid=10871 [01:20] *** torvik has joined #archiveteam-bs [01:36] okay, this is kind of useless [01:36] they limit your speed [02:15] So I think I figured out the YouTube messages export mystery. It seems that the CSV export only exports what YouTube calls "Personal Messages", rather than the entire contents of a person's Messages Inbox. So, because I haven't received any "personal messages", when I clicked the "Download your old messages and contacts" link (presented to me above my full Messages Inbox), I got an empty file. As far as I can tell, there is no way to [02:15] export the other contents of my Messages Inbox, short of saving the web pages using the browser. Grr. Also it looks like they've announced a deletion date for them: "Old messages and contacts will be available until Dec 1st.". [02:48] *** mistym has joined #archiveteam-bs [02:48] *** primus104 has quit IRC (Leaving.) [03:08] *** schbirid2 has quit IRC (Read error: Operation timed out) [03:08] *** schbirid2 has joined #archiveteam-bs [03:52] *** mistym has quit IRC (Remote host closed the connection) [04:03] *** LordNigh2 has joined #archiveteam-bs [04:04] so i uploaded another 300+ pdfs of Times News [04:05] i'm waiting on the jobs to go down to like 50 before uploading more stuff [04:08] godane: Did you upload them seperately or all together? [04:10] *** Lord_Nigh has quit IRC (Ping timeout: 600 seconds) [04:10] *** LordNigh2 is now known as Lord_Nigh [04:19] seperately [04:32] *** ex-parro1 has quit IRC (Leaving.) [04:52] *** aaaaaaaaa has quit IRC (Leaving) [05:25] *** phuzion_ has quit IRC (Read error: Operation timed out) [05:25] *** sirkov has joined #archiveteam-bs [05:27] *** phuzion has joined #archiveteam-bs [05:37] That sounds painful [05:38] Did you write descriptions for everything? [05:38] or script it? [06:21] *** bsmith093 has quit IRC (Read error: Operation timed out) [06:41] tfgbd: we're still not entirely sure whether godane is human :) [06:41] scary efficiency [06:50] godane is neither human or machine [06:50] he is simply...godane [07:15] arkiver: can you link me a WARC you made (a small one, preferably) [07:40] *** primus104 has joined #archiveteam-bs [07:42] joepie91: sure, I'll give you a test warc from a project [07:47] Halo test warc: https://www.filepicker.io/api/file/9K65Lv5jRriJz4xIjP05 [07:47] Qwiki test warc: https://www.filepicker.io/api/file/ecdHUOVXSenVCpSowijC [07:48] 28.3 MB and 69.4 MB [07:59] *** primus105 has joined #archiveteam-bs [08:06] *** primus104 has quit IRC (Read error: Operation timed out) [08:12] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [08:14] *** xtr-201 has joined #archiveteam-bs [08:16] arkiver: thanks - are these made with heritrix or wget-warc? [08:17] wget-warc [08:17] alright, happen to have a heritrix one laying around? [08:17] I can also supply you with some heritrix warc's if you want [08:17] ah yes [08:17] that'd be good :P [08:17] coming up in a bit [08:20] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [08:22] *** xtr-201 has joined #archiveteam-bs [08:26] *** primus105 has quit IRC (Leaving.) [08:27] joepie91: http://www.onetimebox.org/box/LAfZXNNZA6xm7i2wo [08:27] 5 warc's from a heritrix crawl of technet.microsoft.com, ~100 MB each [08:27] Heritrix 3.3.0 [08:28] joepie91: https://engineering.groupon.com/2014/misc/gnome-foundation-and-groupon-product-names/ [08:29] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [08:30] *** xtr-201 has joined #archiveteam-bs [08:30] arkiver: thanks [08:31] midas: yup [08:31] they wont use the name [08:32] UPDATE: After additional conversations with the open source community and the Gnome Foundation, we have decided to abandon our pending trademark applications for ???Gnome.??? We will choose a new name for our product going forward. [08:32] the real reason: after getting hammered by every techsite in the world we wont try to abuse the name gnome [08:34] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [08:37] *** xtr-201 has joined #archiveteam-bs [08:42] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [08:43] hm is it possible to add ignores while wget is already running? [08:44] *** xtr-201 has joined #archiveteam-bs [08:46] oh lua it is [08:55] wget-lua you mean? [08:56] yeah [08:56] midas ^ [08:57] as far as I know it's not possible to add ignores to the lua file when wget-lua is running [08:57] but it might be possible though somehow [08:57] chfoo would know it better then me [08:57] but I don't think it's possible [08:58] *** Jonimus has quit IRC (Read error: Operation timed out) [08:59] hm [09:02] midasL my torrent is almost running a week on IA https://catalogd.archive.org/log/351937074 [09:03] after that week it will probably stop and create a resume file [09:03] I'll be able to see if it rexumes the torrent from the resume file if I redirive it [09:03] If it doesn't that sucks :/ [09:04] yeah that would be a major issue :p [09:04] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [09:05] Percent Done: 2.1% Peers: ^ 0 kB/s to 0, v 546 kB/s from 1, of 1 (Ratio: 0.0) (0s idle) [09:05] in 17 hours :p [09:07] [09:32] the real reason: after getting hammered by every techsite in the world we wont try to abuse the name gnome [09:07] basically [09:07] *** xtr-201 has joined #archiveteam-bs [09:07] as I said on Twitter, I suspect they realized that the majority of their 'partners' are small businesses [09:07] and they can't actually depend on corporate apathy here [09:08] i just cant figure out how someone decided it was a good name to go with [09:14] *** Jonimus has joined #archiveteam-bs [09:15] *** xtr-201 has quit IRC (Read error: Connection reset by peer) [09:17] *** xtr-201 has joined #archiveteam-bs [09:31] midas: honestly? I very strongly doubt this was a case of ignorance [10:15] *** godane has quit IRC (Ping timeout: 633 seconds) [10:31] *** godane has joined #archiveteam-bs [10:58] *** BlueMaxim has quit IRC (Quit: Leaving) [12:00] yay, ramnode's response to my complaint was essentially "yeah no we have issues and we have no idea how to fix them" [12:18] lolwut? [12:24] *** Famicoman has joined #archiveteam-bs [12:24] *** swebb sets mode: +o Famicoman [12:24] *** midas sets mode: +o Famicoman [12:36] *** bsmith093 has joined #archiveteam-bs [13:44] hey Famicoman [13:44] i'm very close to 50k items in my inbox [13:50] *** primus104 has joined #archiveteam-bs [13:53] Kazzy: well that's a bit problematic.. any more details? [13:53] * joepie91 is ready to assign blame to openvz [13:53] *** primus104 has quit IRC (Client Quit) [13:53] joepie91, not an openvz issue, was going on with their KVM's too. It's a network-wide issue in NL apparently.. not sure why it only affects some nodes [13:54] nick basically told me to check twitter for updates, and the alternative was moving to seattle etc, which isn't ideal at all [14:02] some one added warc.gz files to my cbsnews.com videos: https://archive.org/details/cbsnews.com-video-2003-12-25 [14:02] what the hell? [14:03] did i dc? [14:05] anyone else psyched about philea today? :) [14:05] another item with warc.gz added: https://archive.org/details/cbsnews.com-video-2003-12-26 [14:06] ximm@archive.org is the guy [14:07] hm? not seeing a warc file? [14:07] schbirid2, kind of, I'm unable to watch the stream / keep up to date at the minute though [14:07] https://archive.org/download/cbsnews.com-video-2003-12-26/IA-FOC-cbsnews.com-20140812225007-00004.warc.gz [14:08] oh yeah now i see godane [14:08] Kazzy: livestreamer "http://new.livestream.com/accounts/362/events/3544091" best [14:08] :) [14:09] are they wayback warcs maybe godane ? because when i try to grab them im getting a The item is not available due to issues with the item's content. [14:09] you have to log in to download anything from podcasts collection for some reason [14:09] schbirid2, ~1mbit tethering, while trying to irc, skype and browse the web [14:10] i am logged in :/ [14:10] I don't have 4g yet, so I can't do much [14:10] Kazzy: =) [14:10] midas: i can download the file [14:10] schbirid2: http://25.media.tumblr.com/179479eed571ab7fcfc6e4e5336d7461/tumblr_msmtnjdNiX1qlukpso1_400.gif [14:11] maybe because you uploaded it [14:11] (the vid, not the warc) [14:12] schbirid2: https://twitter.com/NASA/status/532528944143413248/photo/1 [14:12] *** sankin has joined #archiveteam-bs [14:24] midas: ESA > NASA! [14:24] imperialist pig [14:25] hey, dutchy here, ESA was located near my work :P [14:25] my old work that is [14:26] i sent a email to the guy that uploaded those warc.gz [14:26] and i just grabbed nasa because they had that picture on their twitterfeed [14:26] schbirid2: and im hoping, praying, begging it to land safely on the comet :p [14:27] the US had too many firsts in space [14:29] :> [14:29] Kazzy: have they confirmed whether it's an issue with their own equipment or with the upstream? [14:29] (upstream being dataplace) [14:29] perhaps I should walk by the datacenter, they're right across the water :P [14:30] hey SketchCow [14:30] I didn't ask, but I'm assuming it's an issue with their equipment, joepie91 [14:30] I'd assume a decent datacenter wouldn't have issues for ' a couple of days ' without throwing new hardware at it [14:31] ehhhh. :P [14:31] looks like one person at IA uploaded warc.gz to my existing items [14:31] * joepie91 thinks back to a... certain... incident with leaseweb [14:31] well [14:31] maybe then, but they'd have passed off blame a while ago if it wasn't their problem [14:32] idk, they're usually fairly 'quiet' in support tickets [14:32] might want to ask them on IRC [14:32] will probably get you a more direct answer [14:32] :P [14:33] joepie91: across the water from you, im betting you are to blame! ;) [14:34] looks another item with warc.gz: https://archive.org/details/cbsnews.com-video-2004-01-01 [14:34] http://dat.serveert.me.uk/p/sunet almost done \o/ [14:34] thats really going to bother me today [14:36] godane: might just be an accident [14:36] godane: i once found a bug that allowed me to upload to other people's items [14:36] midas: hehe [14:36] got a t-shirt, maybe you will get one too ;) [14:36] ok [14:36] really though, they're in alblasserdam [14:36] I'm in Dordrecht [14:36] err sweater, not shirt [14:36] look it up on google maps :P [14:36] (my ping to them is, what, 6ms? via AMS-IX though) [14:37] schbirid2: you get an IA sweater for reporting vulns?! [14:37] but it still will bother me today [14:37] it shouldn't be there [14:38] i wonder what the biggest file on IA is right now [14:40] joepie91: i did. it was pretty massive (the vuln) [14:41] hm what shall i make for dinner [14:42] thinking pasta [14:44] schbirid2: pentesting time! lol [14:44] also [14:44] perhaps we should update archivebot to support this internet, too: http://www.bbc.com/earth/story/20141111-plants-have-a-hidden-internet [14:45] :D [14:56] *** primus104 has joined #archiveteam-bs [15:25] lol [15:28] live rosetta landing! [15:28] http://www.nasa.gov/multimedia/nasatv/index.html [15:39] schbirid2: did something go wrong? or is he just mad or something? [15:40] oh nevermind [15:47] *** mistym has joined #archiveteam-bs [15:49] whu [15:55] *** aaaaaaaaa has joined #archiveteam-bs [16:01] http://xkcd1446.org [16:16] *** sankin has quit IRC (Leaving.) [16:17] *** sankin has joined #archiveteam-bs [16:29] *** kvieta has quit IRC (Read error: Operation timed out) [16:29] *** kvieta has joined #archiveteam-bs [16:42] *** mistym has quit IRC (Remote host closed the connection) [16:57] *** SadDM has quit IRC (Read error: Operation timed out) [17:59] *** godane has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** torvik has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** lysobit has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** robink has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** amerrykan has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** RedType has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** Baljem_ has quit IRC (ircd.choopa.net irc.mzima.net) [17:59] *** cloudmons has quit IRC (ircd.choopa.net irc.mzima.net) [18:23] *** godane has joined #archiveteam-bs [18:23] *** torvik has joined #archiveteam-bs [18:23] *** lysobit has joined #archiveteam-bs [18:23] *** robink has joined #archiveteam-bs [18:23] *** amerrykan has joined #archiveteam-bs [18:23] *** RedType has joined #archiveteam-bs [18:23] *** Baljem_ has joined #archiveteam-bs [18:23] *** cloudmons has joined #archiveteam-bs [18:23] *** midas sets mode: +o Baljem_ [18:25] *** dashcloud has quit IRC (Read error: Operation timed out) [18:26] *** dashcloud has joined #archiveteam-bs [19:14] *** RedType has quit IRC (Quit: kill dash nine) [19:15] *** RedType has joined #archiveteam-bs [19:34] *** bauruine has quit IRC (Ping timeout: 265 seconds) [19:39] *** bauruine has joined #archiveteam-bs [20:15] *** Katrina20 has joined #archiveteam-bs [20:16] *** Katrina20 has quit IRC (Read error: Connection reset by peer) [20:28] *** lrkj has quit IRC (Ping timeout: 272 seconds) [21:52] *** sankin has quit IRC (Leaving.) [22:17] * midas plays archiveteam themesong [22:17] * midas is listening to Can't Stop Won't Stop - Up and Away (feat. June) [22:18] btw, song has _nothing_ to do with archiving, yet the band AND title could be exactly archiveteams motto [22:20] * Kazzy tries to remember which devinsupertramp video(s) this was used in [22:33] *** zenguy_pc has quit IRC (Read error: Operation timed out) [22:39] damnit, ovh has new BK boxes [22:39] the old one im running is 4T [22:39] new one is 8T [22:41] wait a second [22:41] mine is running raid1 [22:41] thats the issue :p [23:03] nn all [23:03] nn o/ [23:07] The backup of Encyclopedia Dramatica is gone. ("2011-06-22 backup" on the Wiki.) They all redirect to some shady download manager. http://archiveteam.org/index.php?title=Encyclopedia_Dramatica#2011-06-22_backup [23:10] Same goes for the "local mirror" of the "Webecology Project Dumps", but the Wayback Machine has a copy. I'll update the wiki. http://www.archiveteam.org/archives/edramatica/ [23:10] seems multiupload.com redirects to some other place now [23:10] quick google shows it was happening janurary this year [23:11] 2013 even [23:13] it would probably be good to get those links replaced with links to IA if anyone still has the files [23:18] *** dashcloud has quit IRC (Read error: Operation timed out) [23:21] *** dashcloud has joined #archiveteam-bs [23:31] *** robink has quit IRC (Remote host closed the connection) [23:35] *** robink has joined #archiveteam-bs