[00:18] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [00:20] *** SmileyG has joined #archiveteam [00:21] yahooanswers grab is started!! [00:21] 500000 items queued [00:24] *** Smiley has quit IRC (Read error: Operation timed out) [00:25] Oh no haha will fire up home server and remote machines [00:31] *** bai has quit IRC (Quit: server reboot) [00:36] * nicolas17 has updated http://archiveteam.org/index.php?title=Mapillary [00:49] *** OpticalSw has quit IRC (Ping timeout: 268 seconds) [00:56] *** bauruine has quit IRC (Ping timeout: 260 seconds) [01:00] not sure why you asked me specifically about Mapillary earlier, but FWIW, I agree with all your changes [01:09] *** BlueMaxim has joined #archiveteam [01:21] JesseW: I wanted to ask *somebody* for feedback before making my first edit to the wiki, you were quoted in the page (and I removed the quote because it seemed outdated by now), and we talked about it elsewhere :P [01:24] ah, that makes sense :-) [01:25] yes, thank you for removing my name (and the quote) -- I didn't need to be explicitly mentioned [01:25] also, shame on me, I still didn't ask them about the "wikimedia commons export shouldn't have watermark" thing [01:27] well, now you are reminded :-) [01:51] *** tuankiet has joined #archiveteam [01:56] I've uploaded pov-ray usenet archive to IA [02:08] tomaspark: nice -- feel free to add a link from somewhere relevant on the archiveteam wiki [02:08] neat :O [02:11] I wonder if the usual crowd still hangs out in povray.off-topic, I should peek in [02:12] * nicolas17 gets nostalgic [02:20] i've posted the url on the usenet talk page [02:22] I am currently downloading the mozilla/netscape usenet [03:07] *** RichardG has quit IRC (Read error: Connection reset by peer) [03:08] *** RichardG has joined #archiveteam [03:16] orkut all done :3 [03:16] :O [03:16] http://tracker.archiveteam.org/orkut/ [03:16] 0 to do [03:17] (unless that's a partial list?) [03:17] *** tomwsmf has quit IRC (Read error: Operation timed out) [03:17] *** dashcloud has quit IRC (Read error: Operation timed out) [03:22] while true; do curl 'https://a.mapillary.com/v2/stats/im?client_id=MkJKbDA0bnZuZlcxeTJHTmFqN3g1dzo1YTM0NjRkM2EyZGU5MzBh'; echo; sleep 60; done [03:23] *** dashcloud has joined #archiveteam [03:27] I've just found a list of usenet servers @ http://www.nyx.net/~bkraft/ [03:28] I have never used usenet [03:29] but the private NNTP servers I ever used were gmane, povray, and lugnet [03:29] * joepie91 whoop whoop off-topic siren [03:29] best move to #archiveteam-bs :P [03:43] *** nicolas17 has quit IRC (Read error: Operation timed out) [04:06] nicolas17 -- that client_id is invalid [04:08] flightcar.com shut down in July -- might as well stick it on the Deathwatch list [04:15] *** aMunster has joined #archiveteam [04:17] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:23] *** Sk1d has joined #archiveteam [04:29] ... but it's dead already [04:30] we can watch it be dead [04:30] (also http://archiveteam.org/index.php?title=Deathwatch#Dead_as_a_Doornail ) [04:33] we can be a place for historians in 20 years to ask "what services died in July 2016" and get an answer [04:37] *** d_rebel_ is now known as d_rebel [04:59] *** Start_ has joined #archiveteam [04:59] *** enr1c0 has quit IRC (Read error: Operation timed out) [05:02] *** Start has quit IRC (Read error: Operation timed out) [05:03] *** Start has joined #archiveteam [05:06] *** Start_ has quit IRC (Read error: Operation timed out) [05:22] *** redlob has quit IRC (Read error: Operation timed out) [05:28] *** redlob has joined #archiveteam [05:43] *** tomaspark has quit IRC (Ping timeout: 255 seconds) [05:45] *** mutoso_ has quit IRC (Read error: Operation timed out) [06:00] *** mutoso has joined #archiveteam [06:02] *** arrith has quit IRC (Read error: Operation timed out) [06:09] *** Honno has joined #archiveteam [06:12] *** Coderjoe has quit IRC (Read error: Operation timed out) [06:16] *** patrickod has quit IRC (west.us.hub irc.mzima.net) [06:16] *** Chorca has quit IRC (west.us.hub irc.mzima.net) [06:20] *** patricko- has joined #archiveteam [06:24] *** JesseW has quit IRC (Read error: Operation timed out) [06:39] *** Chorca has joined #archiveteam [06:43] *** Coderjoe has joined #archiveteam [06:54] *** Coderjoe has quit IRC (ircd.choopa.net irc.mzima.net) [06:58] *** Coderjoe_ has joined #archiveteam [07:30] *** zenguy has quit IRC (Read error: Operation timed out) [07:38] *** zenguy has joined #archiveteam [07:43] *** tomaspark has joined #archiveteam [07:55] *** BartoCH has joined #archiveteam [08:05] *** BartoCH has quit IRC (Remote host closed the connection) [08:05] *** BartoCH has joined #archiveteam [08:25] *** bzc6p has joined #archiveteam [08:25] *** swebb sets mode: +o bzc6p [08:25] *** bzc6p has left [08:43] *** phuzion has quit IRC (Read error: Operation timed out) [08:47] *** _vOYtEC has quit IRC (Ping timeout: 250 seconds) [08:48] *** phuzion has joined #archiveteam [09:31] *** WinterFox has joined #archiveteam [09:49] *** GLaDOS has quit IRC (Quit: Oh crap, I died.) [09:49] *** GLaDOS has joined #archiveteam [09:59] We also might be a series of embittered old men [10:39] *** tomaspark has quit IRC (Ping timeout: 255 seconds) [10:40] *** tomaspark has joined #archiveteam [10:45] *** W1nterFox has joined #archiveteam [10:50] *** WinterFox has quit IRC (Read error: Operation timed out) [10:55] *** W1nterFox has quit IRC (Ping timeout: 492 seconds) [10:59] *** nicolas17 has joined #archiveteam [10:59] *** WinterFox has joined #archiveteam [11:08] *** bzc6p has joined #archiveteam [11:08] *** swebb sets mode: +o bzc6p [11:09] *** bzc6p has left [11:14] *** alembic has quit IRC (Read error: Connection reset by peer) [11:15] *** alembic has joined #archiveteam [11:15] *** aMunster has quit IRC (Read error: Operation timed out) [11:15] *** dxrt has quit IRC (Read error: Operation timed out) [11:19] *** aMunster has joined #archiveteam [11:19] *** dxrt has joined #archiveteam [11:22] *** Madthias has quit IRC (Quit: ▒^٥ ▒^٥) [11:26] *** aMunster has quit IRC (Read error: Operation timed out) [11:34] *** aMunster has joined #archiveteam [12:06] #noanswers for Yahoo! Asnwers! [12:07] *** vOYtEC has joined #archiveteam [12:13] Problem with yahooanswers not running on the warrior is fixed. [12:16] reposting from -bs on YA: [12:16] For those looking at yahoo answers, Keep your concurrency low otherwise you get banned and get a 500 (printed in browser as error 999) [12:17] thanks [12:33] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [12:39] *** BartoCH has joined #archiveteam [12:41] *** nicolas17 has quit IRC (Ping timeout: 244 seconds) [13:01] *** BlueMaxim has quit IRC (Quit: Leaving) [13:04] *** dashcloud has quit IRC (Read error: Operation timed out) [13:07] *** dashcloud has joined #archiveteam [13:21] Need to find the dependencies for fedora 23... http://archiveteam.org/index.php?title=Talk:Wget_with_Lua_hooks [13:37] *** WinterFox has quit IRC (Read error: Operation timed out) [13:39] *** z00nx has quit IRC (Quit: WeeChat 1.5) [13:44] *** z00nx has joined #archiveteam [13:44] *** powerKitt has joined #archiveteam [13:47] *** z00nx has quit IRC (Client Quit) [13:49] So, Gawker.com is going to be closing its doors on the 25th. According to their shutdown announcement, they don't have a finalized plan for the site's archives. It's likely possible to just grab it with wget. Notably, two of the site's articles are hidden from robots.txt [13:51] *** powerKitt has quit IRC (Quit: Page closed) [14:07] We need to have a bot which responds to any mention of gawker with "It's done" [14:09] we have the title... [14:15] *** z00nx has joined #archiveteam [14:16] *** z00nx has quit IRC (Client Quit) [14:18] *** z00nx has joined #archiveteam [14:20] *** z00nx has quit IRC (Client Quit) [14:21] *** z00nx has joined #archiveteam [14:21] *** z00nx has quit IRC (Client Quit) [14:22] *** z00nx has joined #archiveteam [14:25] *** z00nx has quit IRC (Client Quit) [14:25] *** z00nx has joined #archiveteam [14:28] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [14:33] *** BartoCH has joined #archiveteam [14:49] tfw nobody reads topics [14:50] *** tomaspark has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** patricko- has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** godane has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** Jogie has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** db48x has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** yipdw has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** Fake-Nam1 has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** Igloo^ has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** midas has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** achip has quit IRC (hub.efnet.us irc.Prison.NET) [14:50] *** Fake-Name has joined #archiveteam [14:51] *** midas1 has joined #archiveteam [14:51] *** swebb sets mode: +o midas1 [14:51] *** yipdw_ has joined #archiveteam [14:51] *** Igloo^_ has joined #archiveteam [14:52] *** patrickod has joined #archiveteam [14:56] *** nicolas17 has joined #archiveteam [15:13] *** godane has joined #archiveteam [15:20] *** achip has joined #archiveteam [15:22] *** bzc6p has joined #archiveteam [15:22] *** swebb sets mode: +o bzc6p [15:22] *** bzc6p has left [15:25] *** Igloo^_ is now known as Igloo^ [15:27] TWO of the site's articles are hidden you guys [15:27] you guys you guys [15:44] *** JesseW has joined #archiveteam [15:49] Yahoo Answers still doesn't work with the warriror, at least for me. [16:14] *** RichardG has quit IRC (Ping timeout: 370 seconds) [16:17] *** JesseW has quit IRC (Read error: Operation timed out) [16:33] *** Morbus has quit IRC (Read error: Operation timed out) [16:49] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [17:00] *** AlexLehm has joined #archiveteam [17:13] *** BartoCH has joined #archiveteam [17:15] *** sep332 has joined #archiveteam [17:16] If you google the string of text princeton-alums-state-dept-staffer-compete-in-revolting-sex-contest like dozens of websites have that in their robots. txt for some reason [17:20] most/all gawker sites [17:35] heh somebody from the article must have lawyered up [17:36] it's still on Jezebel [17:36] http://jezebel.com/5723470/princeton-alums-state-dept-staffer-compete-in-revolting-sex-contest [17:36] can someone throw that into ArchiveBot? [17:38] why don't you go to #archivebot and !ao it yourself [17:38] because I can't [17:38] why not? [17:38] you have to have ops [17:39] not for !ao [17:39] ohhhhh no crap, thanks [17:40] ahhh ok o is no recursion, does that mean it basically saves the page itself and all assets of that page? [17:40] yes [17:40] gracias [17:47] *** kristian_ has joined #archiveteam [18:14] *** arrith has joined #archiveteam [18:20] *** RichardG has joined #archiveteam [18:43] *** Famicoman has quit IRC (Ping timeout: 260 seconds) [18:56] *** tomwsmf has joined #archiveteam [19:09] *** VerifiedJ has joined #archiveteam [19:36] *** kristian_ has quit IRC (Leaving) [19:38] *** arrith has quit IRC (Leaving) [19:47] *** RichardG has quit IRC (Ping timeout: 370 seconds) [19:48] *** schbirid has joined #archiveteam [20:13] https://archive.org/details/gawkeryoutube [20:22] *** kristian_ has joined #archiveteam [20:45] *** Jogie has joined #archiveteam [20:51] *** Famicoman has joined #archiveteam [20:55] *** tomaspark has joined #archiveteam [20:58] *** Martini-- has joined #archiveteam [20:58] Hi [20:59] ...by any chance is somebody here ? [20:59] nope [20:59] *** schbirid has quit IRC (Quit: Leaving) [21:00] Since you have more experience than me on the Intenet Archive, I was wondering if there are some tools to automate the uploding of files to the Archive. [21:01] ia cli toolk [21:01] tool [21:04] Martini--: https://pypi.python.org/pypi/internetarchive [21:04] https://github.com/jjjake/internetarchive [21:04] That one? [21:04] yep [21:11] Thanks for the pointers. [21:12] Do you know if someone has made some kind of frontend script to have sharing file website, but on the background the files are hosted on the Internet Archive? [21:12] I think that would lead to IA misuse [21:13] IA is not "free storage for anything" [21:14] The IA would probably be able to figure it out pretty quickly due to the types of files those sites tend to attract [21:14] (Encrypted multi-part RARs and obviously pirated content) [21:16] People already do this. [21:19] My single idea is not to pirating material. Is to make a file sharing service for OS/2 Warp. [21:20] I want to make something like hobbes - http://hobbes.nmsu.edu/h-browse.php?dir=/pub/multimedia/pointer [21:20] any "file sharing frontend website" will attract piracy [21:20] ...and at the same time store all files at Internet Archive. [21:21] On the one hand OS/2 software would probably be a good idea to collect and archive since it's a dead platform [21:21] On the other the IA can terminate accounts for any reason, including getting DMCA requests for content uploaded by them [21:22] ...the idea is not to have illegal files there. Only the files that the community shares for the platform. [21:25] Even if nobody uploads anything illegal they could still close your account and hide the items for essentially using them as a backend for a file sharing service [21:26] Martini--: and the idea of email is not to have spam, so? :P [21:26] Mirroring your content onto the IA would probably be a good idea, but it should only be used as a mirror/backup of your primary storage [21:28] The terms of use says "The Archive may immediately terminate this Agreement at its sole discretion at any time upon written notice..." but I can not find a wording yet that they forbide that practice. [21:30] MrRadar, have you seen any project/tools to make that mirroring easy? [21:31] No, though it should be easy enough to whip something up with the IA CLI tool [21:31] If you really want to use the IA as a backend for a file sharing site you should contact them directly about it and get permission from them [21:34] I would contact them if I found way to do it first :) [21:34] info@archive.org [21:35] Is it against the rules to make a site that has a "direct link" to an IA file? I don't see direct linking to a file as an issue to them. (if the file is legal) [21:35] nicolas17: Estas feliz sin los K en Argentina ? [21:36] OT [21:36] Yes, we should take this to #archiveteam-bs [21:36] good. [21:42] *** mls_ has joined #archiveteam [22:02] ...wow...I'm reading on Twitter that SketchCow shaved some days ago. [22:03] *** mls_ is now known as Kksmkrn [22:07] *** Honno has quit IRC (Read error: Operation timed out) [22:10] *** dashcloud has quit IRC (Read error: Operation timed out) [22:14] *** dashcloud has joined #archiveteam [22:24] *** Kksmkrn has quit IRC (leaving) [22:29] Bye, thanks for the pointers. [22:29] *** Martini-- has quit IRC (Quit: Page closed) [22:54] *** AlexLehm has quit IRC (Ping timeout: 260 seconds) [23:00] https://t.co/BcBVXmznGm [23:00] gah [23:00] http://urbanmilwaukee.com/2016/08/19/journal-sentinel-archive-disappears/