[00:04] *** JesseW has quit IRC (Leaving.) [00:05] *** aaaaaaaaa has joined #archiveteam [00:28] *** mistym has quit IRC (Remote host closed the connection) [00:35] *** philpem has quit IRC (Ping timeout: 252 seconds) [00:43] *** mistym has joined #archiveteam [01:36] *** JesseW has joined #archiveteam [01:39] *** xk_id has joined #archiveteam [01:49] *** mistym has quit IRC (Remote host closed the connection) [02:00] *** JesseW has quit IRC (Leaving.) [02:27] *** beardicus has joined #archiveteam [02:30] *** beardicus has quit IRC (Client Quit) [02:35] *** skiy has quit IRC (Ping timeout: 258 seconds) [02:40] *** xk_id_ has joined #archiveteam [02:40] *** xk_id has quit IRC (Read error: Connection reset by peer) [02:46] *** beardicus has joined #archiveteam [02:48] *** beardicus has quit IRC (Client Quit) [02:49] *** beardicus has joined #archiveteam [02:56] *** primus104 has quit IRC (Leaving.) [03:05] *** ripvanwin has quit IRC (Read error: Operation timed out) [03:06] *** mistym has joined #archiveteam [03:10] *** ripvanwin has joined #archiveteam [03:41] *** xk_id has joined #archiveteam [03:41] *** xk_id_ has quit IRC (Read error: Connection reset by peer) [03:53] *** mistym has quit IRC (Remote host closed the connection) [03:55] *** xk_id has quit IRC (Remote host closed the connection) [03:55] *** xk_id has joined #archiveteam [04:01] *** nwf has quit IRC (Read error: Operation timed out) [04:03] *** marvinw has quit IRC (Read error: Operation timed out) [04:03] *** aaaaaaaa_ has joined #archiveteam [04:03] *** achip has quit IRC (Read error: Operation timed out) [04:03] *** chfoo has quit IRC (Read error: Operation timed out) [04:04] *** Kenshin has quit IRC (Read error: Operation timed out) [04:04] *** Kenshin has joined #archiveteam [04:05] *** aMunster has quit IRC (Read error: Operation timed out) [04:06] *** chfoo has joined #archiveteam [04:06] *** aaaaaaaaa has quit IRC (Read error: Operation timed out) [04:06] *** Gfy has quit IRC (Closing Link: 195-154-179-104.rev.poneytelecom.eu (Ping timeout: 364 seconds)) [04:07] *** yotta has quit IRC (Read error: Operation timed out) [04:09] *** yotta has joined #archiveteam [04:09] *** pfallenop has quit IRC (Read error: Operation timed out) [04:10] *** GLaDOS has quit IRC (Ping timeout: 600 seconds) [04:11] *** xk_id has quit IRC (Read error: Operation timed out) [04:11] *** GLaDOS has joined #archiveteam [04:11] *** aMunster has joined #archiveteam [04:13] *** aaaaaaaa_ is now known as aaaaaaaaa [04:16] *** useretail has quit IRC (Ping timeout: 600 seconds) [04:17] *** pfallenop has joined #archiveteam [04:17] *** sep332 has quit IRC (Ping timeout: 600 seconds) [04:17] *** dashcloud has quit IRC (Read error: Operation timed out) [04:18] *** aMunster has quit IRC (Read error: Connection reset by peer) [04:18] *** rduser has quit IRC (Read error: Operation timed out) [04:19] *** rduser has joined #archiveteam [04:21] *** dashcloud has joined #archiveteam [04:21] *** sjm has quit IRC (Read error: Connection reset by peer) [04:22] *** mistym has joined #archiveteam [04:23] *** yotta has quit IRC (Ping timeout: 600 seconds) [04:24] *** ikreymer has joined #archiveteam [04:24] *** aaaaaaaaa has quit IRC (Quit: Leaving) [04:25] *** dxrt has quit IRC (Read error: Operation timed out) [04:25] *** dxrt has joined #archiveteam [04:25] *** toad1 has quit IRC (Ping timeout: 600 seconds) [04:26] *** toad1 has joined #archiveteam [04:34] *** S[h]O[r]T has quit IRC (Ping timeout: 600 seconds) [04:35] *** marvinw has joined #archiveteam [04:37] *** toad1 has quit IRC (Ping timeout: 600 seconds) [04:37] *** sjm has joined #archiveteam [04:37] *** achip has joined #archiveteam [04:37] *** sep332 has joined #archiveteam [04:37] *** S[h]O[r]T has joined #archiveteam [04:37] *** yotta_ has joined #archiveteam [04:38] *** toad1 has joined #archiveteam [04:38] *** yotta_ is now known as yotta [04:38] *** aMunster has joined #archiveteam [04:39] *** useretail has joined #archiveteam [04:41] *** Gfy has joined #archiveteam [04:44] *** nwf has joined #archiveteam [04:45] *** ikreymer has quit IRC (Remote host closed the connection) [04:59] *** xk_id has joined #archiveteam [05:11] *** ikreymer has joined #archiveteam [05:21] *** ikreymer has quit IRC (Remote host closed the connection) [05:26] *** mistym has quit IRC (Remote host closed the connection) [05:26] *** mistym has joined #archiveteam [05:40] *** JesseW has joined #archiveteam [05:46] *** mistym has quit IRC (Remote host closed the connection) [05:52] *** mistym has joined #archiveteam [05:54] An idea I just had (that probably has been suggested before) -- digging through the existing Wayback Machine data for linked pages that aren't archived, and adding them. This could be done fully automatically, say, as a warrior job... [05:58] *** xk_id has quit IRC (Remote host closed the connection) [05:58] *** xk_id has joined #archiveteam [06:02] *** ikreymer has joined #archiveteam [06:09] *** ikreymer has quit IRC () [06:11] *** mistym has quit IRC (Remote host closed the connection) [06:12] *** mistym has joined #archiveteam [06:18] *** JesseW has quit IRC (Leaving.) [06:31] *** superkuh has quit IRC (hub.efnet.us irc.Prison.NET) [06:31] *** tsp_ has quit IRC (hub.efnet.us irc.Prison.NET) [06:31] *** nico_32 has quit IRC (hub.efnet.us irc.Prison.NET) [06:31] *** zenguy_pc has quit IRC (hub.efnet.us irc.Prison.NET) [06:31] *** db48x has quit IRC (hub.efnet.us irc.Prison.NET) [06:31] *** sunnymilk has quit IRC (hub.efnet.us irc.Prison.NET) [06:38] *** yuuko_ has joined #archiveteam [06:38] *** superkuh has joined #archiveteam [06:38] *** tsp_ has joined #archiveteam [06:38] *** nico_32 has joined #archiveteam [06:38] *** zenguy_pc has joined #archiveteam [06:50] *** mistym has quit IRC (Remote host closed the connection) [06:59] =========================================================== [06:59] JASON SCOTT AUGUST JUBILEE HAS BEGUN [06:59] Time to gear up with all your requests and little this and [06:59] thats regarding archiveteam business. I'm available to use [06:59] all my abilities and time to help with them. No reasonable [06:59] silliness rejected. Goo goo ga joob. [06:59] =========================================================== [07:09] *** khaoohs_ has joined #archiveteam [07:11] *** khaoohs has quit IRC (Ping timeout: 483 seconds) [07:11] *** khaoohs has joined #archiveteam [07:14] *** khaoohs_ has quit IRC (Read error: Operation timed out) [07:43] *** schbirid has joined #archiveteam [07:49] *** Ungstein has joined #archiveteam [07:51] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [07:51] *** Ungstein has quit IRC (Client Quit) [08:05] *** Ungstein has joined #archiveteam [08:07] *** habi has joined #archiveteam [08:08] *** habi has left [08:09] *** toad2 has joined #archiveteam [08:10] *** toad1 has quit IRC (Read error: Operation timed out) [08:11] *** primus104 has joined #archiveteam [08:27] *** MMovie2 has joined #archiveteam [08:30] *** MMovie has quit IRC (Ping timeout: 306 seconds) [08:34] *** primus104 has quit IRC (Leaving.) [08:53] *** db48x has joined #archiveteam [09:15] *** khaoohs_ has joined #archiveteam [09:17] *** khaoohs has quit IRC (Ping timeout: 240 seconds) [09:19] *** primus104 has joined #archiveteam [09:31] *** brayden has joined #archiveteam [09:38] *** xk_id has quit IRC (Remote host closed the connection) [10:38] *** khaoohs has joined #archiveteam [10:39] *** khaoohs_ has quit IRC (Ping timeout: 306 seconds) [10:41] *** xk_id has joined #archiveteam [10:49] *** khaoohs_ has joined #archiveteam [10:51] *** khaoohs__ has joined #archiveteam [10:54] *** khaoohs_ has quit IRC (Read error: Operation timed out) [10:54] *** khaoohs has quit IRC (Read error: Operation timed out) [10:54] *** khaoohs_ has joined #archiveteam [10:57] *** khaoohs__ has quit IRC (Ping timeout: 240 seconds) [10:58] *** primus104 has quit IRC (Read error: Connection reset by peer) [11:02] *** khaoohs__ has joined #archiveteam [11:03] Much better than the Roman pope [11:03] *** khaoohs_ has quit IRC (Read error: Operation timed out) [11:05] *** primus104 has joined #archiveteam [11:07] *** khaoohs has joined #archiveteam [11:07] *** khaoohs has quit IRC (Client Quit) [11:09] *** khaoohs__ has quit IRC (Read error: Operation timed out) [11:25] *** db48x has quit IRC (Ping timeout: 258 seconds) [11:36] *** db48x has joined #archiveteam [11:37] *** xk_id has quit IRC (Read error: Connection reset by peer) [11:38] *** xk_id has joined #archiveteam [11:50] *** khaoohs has joined #archiveteam [11:52] *** primus104 has quit IRC (Leaving.) [12:10] *** primus104 has joined #archiveteam [12:22] *** khaoohs_ has joined #archiveteam [12:29] *** khaoohs has quit IRC (Read error: Operation timed out) [12:45] *** scyther has joined #archiveteam [13:12] *** khaoohs has joined #archiveteam [13:19] *** brayden has quit IRC (Read error: Connection reset by peer) [13:20] *** Ungstein has quit IRC (Quit: Leaving.) [13:20] *** khaoohs_ has quit IRC (Read error: Operation timed out) [14:21] *** mistym has joined #archiveteam [14:39] *** mistym has quit IRC (Remote host closed the connection) [14:56] *** mistym has joined #archiveteam [15:28] *** primus104 has quit IRC (Leaving.) [15:30] *** RichardG_ is now known as RichardG [15:30] *** primus104 has joined #archiveteam [15:30] *** primus104 has quit IRC (Client Quit) [15:59] *** mistym has quit IRC (Remote host closed the connection) [16:01] *** JesseW has joined #archiveteam [16:06] *** Start has quit IRC (Quit: Disconnected.) [16:08] *** Start has joined #archiveteam [16:13] *** mistym has joined #archiveteam [16:24] SketchCow: I know I have asked this before and since your august jubilee has begun I decided to write you here and not over the mail. [16:25] The blip grab is running very good at the moment, we'll very likely get it all in time [16:25] Kenshin has moved 60T of blip data to a server from which we'll upload the data to IA [16:26] Can you please create the collection for blip so the upload to IA can be started? [16:28] *** habi has joined #archiveteam [16:35] Let's think of a good identifier for the blip.tv collection. [16:35] archiveteam_bliptv ? [16:36] i'd add the dot, doesnt hurt: archiveteam_blip.tv [16:36] The official name is blip, so I think we should do archiveteam_blip [16:37] *** habi has left [16:37] Formerly it was blip.tv, but now it is blip [16:37] true, that seems best [16:39] OK. [16:40] archiveteam_blip verified as unique and entry created. [16:41] I am now filling in some rough metadata. [16:42] *** philpem has joined #archiveteam [16:42] *** scyther has quit IRC (Read error: Operation timed out) [16:43] Wait, it's delayed for some reason. [16:43] (Errors when I try to edit) [16:43] Not sure what's going on. [16:45] *** JesseW has quit IRC (Ping timeout: 600 seconds) [16:48] *** JesseW has joined #archiveteam [16:50] I've got us looking at it. Keep on me if I don't have it fixed by later today. [16:50] Also, I fixed a typo in my sorting script, so Google Moderator may actually get a collection soon. [16:58] SketchCow: there's still stuff on Audit2014 for you to do (I think) -- if any of it could benefit from further clarification, let me know. [17:07] *** kristian_ has joined #archiveteam [17:07] hi [17:08] I just uploaded a 10Mb pdf ... now I get the "(this item is currently being modified/updated with a "book_op" task)" message ... how long will it take, do you think? [17:12] *** Jonimus has quit IRC (Ping timeout: 483 seconds) [17:16] kristian_: link to the details page? [17:17] JesseW: Yes, that'll be another runthrough for me. [17:18] JesseW: here: https://archive.org/details/murnau_4_devils-danish_programme [17:19] kristian_: AFAIK ,derive tasks vary a lot depending on how busy the servers are doing other stuff. Probably someone else can speak more accurately. [17:20] *** primus104 has joined #archiveteam [17:21] okay [17:21] kristian_: Consider it almost random - there's a LOT of work being done by many thousands of items and hundreds of simultaneous stuff. [17:21] thanks [17:21] yeah, I suppose so [17:24] *** Jonimus has joined #archiveteam [17:25] *** xk_id has quit IRC (Remote host closed the connection) [17:27] I guess I'll just do some other stuff for a while :) [17:27] *** aaaaaaaaa has joined #archiveteam [17:31] That's my approach. [17:36] it's up! https://archive.org/stream/murnau_4_devils-danish_programme#page/n3/mode/2up [17:37] gods damned [17:37] as you can tell, the formatting is wrong ... my bad [17:37] *** primus104 has quit IRC (Leaving.) [17:38] Archive Team Blip Collection is now set up: https://archive.org/details/archiveteam_blip [17:38] I need to know the e-mail address account names on archive to give upload access to who's uploading. [17:57] *** kristian_ has quit IRC (Remote host closed the connection) [18:19] *** SilSte has joined #archiveteam [18:29] *** pwnsrv has quit IRC (Ping timeout: 240 seconds) [18:32] *** pwnsrv has joined #archiveteam [18:46] can someone check blip.tv please? [18:46] SilSte: i checked it, looks like a video portal [18:47] lol [18:48] :D [18:49] schbirid: Its delivering only 3mb items... before they were MUCH larger... [18:49] schbirid: Perhaps someone took action on stuff... [18:49] i am not sure if that was a genuine post or not.......but it did make me chuckle [18:49] ^^ [18:54] *** mistym has quit IRC (Remote host closed the connection) [18:59] antomatic: Can you stop the tracker? [18:59] or @joepie91 ? [19:01] curl -A ArchiveTeam http://blip.tv/rss/flash/6391774 --> 301 redirect to https://twitter.com/textfiles/status/631508558643904512 [19:02] antomatic: arkiver balrog closure Coderjoe edsu Lord_Nigh SadDM xmc: Can someone stop the blip-tracker? [19:02] now that is snark [19:02] uhmmmm what? [19:03] balrog: Warriors get redirected to that tweet instead of downloading something useful [19:03] SilSte: Sorry, I don't have any access to the tracker at all [19:03] :( [19:03] we got a agent ban ;) [19:04] booo! [19:04] And no one with rights is online.. [19:04] ==> Lot of trash being archived.... [19:05] It should be possible to identify any items returned after the ban was put in place, and requeue them though [19:05] SketchCow: ^^ [19:05] has happened before [19:06] Huh, redirecting to tweet - seems like assholery [19:06] antomatic: yes [19:06] I wonder if it started around 2hrs ago [19:06] I hope this doesn't end up like, what was it, twitpic [19:06] that tweet suggests that blip was completely backed up already though [19:06] is that not the case? [19:07] [sorry, I've been away for a while, haven't been following it [19:07] ] [19:07] I don't know [19:07] nope [19:07] Not as far as the tracker knows [19:07] And I think there were still plenty of items that weren't added to the tracker yet [19:08] huh [19:09] ping chfoo arkiver yipdw for tracker pause on blip please [19:09] *** mistym has joined #archiveteam [19:13] *** garyrh has quit IRC (Quit: http://bnc4free.com/) [19:15] *** garyrh has joined #archiveteam [19:16] I thought we were done, otherwise I'd have not mentioned it. [19:17] SketchCow: Can you stop the tracker? ^^ [19:17] SketchCow, Will be interesting to see if it occurred shortly after your tweet [19:19] I'm curious in the timing as well. [19:19] I can't stop the tracker. [19:20] If it happened before then its unrelated [19:20] you can see the timing in the stats [19:20] its pretty obvious when things started to go wrong... [19:21] well shit happens [19:23] arkiver u guys got sorted for blip i saw? dont need another box atm? [19:23] what [19:24] blip is redirecting, need pause [19:24] blip have poisoned the well [19:24] well at least they did it in style [19:24] 1500 requests/minute, can't say I blame them [19:24] tracker limited to zero [19:25] ow [19:26] *** primus104 has joined #archiveteam [19:26] (y) [19:26] thx [19:34] Lesson learned! [19:34] if someone wants to do a workaround, I suggest more stealth [19:35] or slower download [19:43] *** mistym has quit IRC (Remote host closed the connection) [19:43] *** JesseW has quit IRC (Leaving.) [19:58] *** mistym has joined #archiveteam [20:48] *** tsp_ has quit IRC (Ping timeout: 258 seconds) [20:48] chfoo: can you please send me the logs for blip? I have to requeue some items due to the ban [20:49] *** tsp_ has joined #archiveteam [20:50] *** superkuh_ has joined #archiveteam [20:50] *** db48x has quit IRC (hub.efnet.us irc.Prison.NET) [20:50] *** yuuko_ has quit IRC (hub.efnet.us irc.Prison.NET) [20:50] *** superkuh has quit IRC (hub.efnet.us irc.Prison.NET) [20:50] *** nico_32 has quit IRC (hub.efnet.us irc.Prison.NET) [20:50] *** zenguy_pc has quit IRC (hub.efnet.us irc.Prison.NET) [20:53] *** db48x has joined #archiveteam [20:53] *** sunnymilk has joined #archiveteam [20:56] *** JesseW has joined #archiveteam [20:57] *** kristian_ has joined #archiveteam [20:57] hi again [20:57] https://archive.org/details/murnau_4_devils-danish_programme [20:57] why does the pdf get mangled like that? [20:58] like what? [20:58] (this might be better asked in #internetarchive , btw) [20:59] ah, I did not know there was something like that [21:00] turns out it's just in the viewer ... if you look at the file itself (rather big, btw) it's fine [21:02] #internetarchive is an UNOFFICIAL support channel. [21:03] ah ... so there is no official channel? [21:04] *** nico_32_ has joined #archiveteam [21:06] *** zenguy_pc has joined #archiveteam [21:06] kristian_: not an IRC channel, no. Basically, there aren't anywhere near enough official IA people to man something like that. There is an official email addr (info@archive.org ) and the forums are ... more or less official (I think). [21:06] OK, thanks [21:06] The IRC channel was just made by SketchCow as a place for people to ask him (and others) questions without clogging up other channels, I think. [21:08] *** will has quit IRC (Read error: Connection reset by peer) [21:09] *** schbirid has quit IRC (Leaving) [21:09] this is blatant piracy, should I report it somewhere? https://archive.org/details/Book1TheHungerGames [21:10] ha, the viewer fixed itself [21:10] kristian_: send a link to info@ , yes. [21:18] *** SmileyG has quit IRC (Read error: Connection reset by peer) [21:18] *** Smiley has joined #archiveteam [21:25] I'll feel like a total snitch for doing it, but something like that should not be what Archive.org is for [21:27] *** scyther has joined #archiveteam [21:29] * SketchCow eyes narrow [21:30] *** scyther has quit IRC (Read error: Connection reset by peer) [21:33] * ersi points and laughs at kristian_ [21:33] Yeah, Archive.org is part of the Internet. The Internet is for porn. Therefore, Archive.org is for porn. [21:34] some Victorian era stuff would be cool to have there [21:34] but not just stuff from Brazzers [21:35] although I guess it would be interesting in 100 years? [21:35] * kristian_ shrugs at ersi [21:36] *** scyther has joined #archiveteam [21:36] btw does the archive.org accept porn collections? [21:38] age verification would be needed [21:39] even wikipedia has some articles: https://en.wikipedia.org/wiki/Deep_Throat_(film) [21:39] https://en.wikipedia.org/wiki/Brazzers [21:39] wikipedia *definitely* accepts articles on porn-related topics. There are multiple essays (on wikipedia) about why. [21:42] interesting fact here: https://torrentfreak.com/can-porn-be-copyrighted-120817/ [21:44] I think this is quickly devolving into -bs territory [21:47] Agreed. /join #archiveteam-bs [21:49] *** aschmitz has quit IRC (Read error: Operation timed out) [22:05] SketchCow: we just got 60T ready for upload to IA, we're not finished yet [22:05] Ha. [22:05] Well, guess what. [22:05] Apologies for the miscommunication. [22:05] Looks like blip is cooperative though. [22:07] SketchCow: everyone has some miscommunication every now and then. [22:07] I got the logs from chfoo, will sort them and add them again [22:08] If ArchiveTeam doesn't get unblocked we're going to use 'real' useragents [22:15] *** JesseW has quit IRC (Leaving.) [22:17] So we just archived https://twitter.com/textfiles/status/631508558643904512 around 160000 times :) [22:17] SketchCow: your post probably broke some record somewhere ;) [22:24] *** db48x has quit IRC (Read error: Connection reset by peer) [22:26] *** nico_32_ is now known as nico_32 [22:28] that makes me wonder what single url is in the wayback machine the most [22:28] probably no easy way to tell, though [22:31] I think you people need to respect the power of my words [22:31] So saving it 160,000 seems reasonable [22:33] --------------------------------------------- [22:34] From BLIP: [22:34] hey, archiveteam request to blip.tv should work again now. you guys were using a lot of bandwidth and when i saw your "we're done" tweet, i redirected what i thought were duplicate / unnecessary requests. let me know when you're really done! [22:34] --------------------------------------------- [22:34] Proceed with caution [22:34] That means continue how we were going? [22:34] Yes [22:36] oh, they seem like nice people [22:36] feels good to have cooperation for once.. [22:37] Fromt Jeff: [22:37] no problem! i found http://tracker.archiveteam.org/blip/ and it seems the downloading is going well otherwise :) [22:42] SketchCow: looks like your're in contact with the person who set the redirect. Can you ask him is the redirect was set around 17:23 ? [22:42] Just to be sure [22:59] *** aschmitz has joined #archiveteam [23:11] I requeued all items that finished after 17:00:00. This means we're getting some duplicates, but we are sure then we're not missing any videos [23:14] Good [23:14] JUBILEE CONTINUES [23:15] By the way, co-worker Jeff had me make a script that, when run on a pile of items in a certain collection, ensures all the covers are what's shown in the graphics. [23:15] (There is a routine to determine the "first readable" page that doesn't work for magazines, comics, etc. Also, some of them are skipped entirely. [23:15] oh nice [23:16] sounds good! [23:17] So blip is running again. I'll add support for accounts and other non-video parts of blip tomorrow. [23:19] Ingress is transfering ownership [23:19] probably has to do with google and Alphabet [23:20] looks like data is safe, though. [23:27] *** arkiver sets mode: +o SketchCow [23:27] *** arkiver sets mode: +o yipdw [23:27] *** arkiver sets mode: +o chfoo [23:46] *** primus104 has quit IRC (Leaving.)