[00:07] *** zenguy_pc has joined #archiveteam [00:14] *** JesseW has joined #archiveteam [00:15] *** Start has joined #archiveteam [00:32] *** VADemon has quit IRC (Read error: Connection reset by peer) [00:35] *** xk_id_ has quit IRC (Remote host closed the connection) [00:35] *** xk_id has joined #archiveteam [00:53] *** xk_id has quit IRC (Read error: Operation timed out) [01:00] *** JesseW has quit IRC (Read error: Operation timed out) [01:09] *** JesseW has joined #archiveteam [01:17] *** JesseW has quit IRC (Read error: Operation timed out) [01:18] *** Atom__ has joined #archiveteam [01:25] *** Atom-- has quit IRC (Ping timeout: 506 seconds) [01:34] *** JesseW has joined #archiveteam [01:36] *** xk_id has joined #archiveteam [01:38] *** xk_id_ has joined #archiveteam [01:38] *** xk_id has quit IRC (Read error: Connection reset by peer) [01:44] *** primus104 has quit IRC (Leaving.) [01:59] *** aaaaaaaaa has joined #archiveteam [01:59] *** swebb sets mode: +o aaaaaaaaa [01:59] *** zenguy_pc has quit IRC (Read error: Operation timed out) [02:00] *** Sanqui has quit IRC (Quit: .) [02:04] *** JesseW has quit IRC (Ping timeout: 616 seconds) [02:05] *** Froggypwn has quit IRC (Ping timeout: 268 seconds) [02:07] *** zenguy_pc has joined #archiveteam [02:23] *** Sanqui has joined #archiveteam [02:27] *** nick_name has joined #archiveteam [02:27] Hello [02:28] Someone should archive 000webhost.com [02:29] and also maybe some windows 98 and windows 2000 hot-fix, update, and patches collections [02:29] Those websites are quite dangly [02:33] mdgx.com [02:33] ^that's quite a volatile one [02:33] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [02:33] It's down today... [02:34] *** nick_name has quit IRC (Quit: Page closed) [02:41] *** JesseW has joined #archiveteam [02:44] *** rxchivert has joined #archiveteam [02:44] hi there, I got a VPS to play around for a couple of days and I was trying to run an AT script on it [02:44] but can’t get wget-lua to work [02:45] *** SiBurning has joined #archiveteam [02:45] http://imgur.com/xZyAg7x [02:46] That is a known issue [02:46] (running on Ubuntu) [02:46] see under "wget-lua was not successfully built" in readme.md [02:47] thanks! [02:49] Wanted to stop in and say hi. Member of CHFB on Yuku, and have a programming background. [02:51] *** JesseW has quit IRC (Read error: Operation timed out) [02:52] hi there! [02:53] hi [02:53] Trying to read up on the wiki. Very cool project. Thanks for being there. [03:06] is there a way to get access to the web interface other than via VNC to the VPS? [03:13] try --address to bind to the vps's public address when you run the pipeline but this may be better on -bs or #warrior [03:23] *** RichardG has quit IRC (Ping timeout: 606 seconds) [03:26] *** JesseW has joined #archiveteam [03:47] *** JesseW has quit IRC (Read error: Connection reset by peer) [03:47] *** JesseW has joined #archiveteam [03:54] *** zenguy_pc has quit IRC (Read error: Operation timed out) [03:56] *** nertzy has joined #archiveteam [04:00] *** RichardG has joined #archiveteam [04:06] *** superkuh has joined #archiveteam [04:08] *** zenguy_pc has joined #archiveteam [04:10] *** briggs has joined #archiveteam [04:10] *** briggs has quit IRC (Client Quit) [04:13] *** bzc6p_ is now known as bzc6p [04:22] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [04:36] *** superkuh has quit IRC (Remote host closed the connection) [04:36] *** JesseW has quit IRC (Read error: Operation timed out) [04:36] *** aaaaaaaaa has quit IRC (Leaving) [04:45] *** Froggypwn has joined #archiveteam [04:57] *** SiBurning has quit IRC () [05:03] *** Ungstein has quit IRC (Quit: Leaving.) [05:03] *** Ungstein has joined #archiveteam [05:03] *** rxchivert has quit IRC (rxchivert) [05:28] *** JesseW has joined #archiveteam [05:51] the tracker appears to be freaking out: 507 Server Error: The tracker is out of resources. [05:53] *** WinterFox has joined #archiveteam [05:54] *** zenguy_pc has quit IRC (Read error: Operation timed out) [06:08] *** Dark_Star has quit IRC (Ping timeout: 360 seconds) [06:08] *** zenguy_pc has joined #archiveteam [06:25] *** JesseW has quit IRC (Read error: Operation timed out) [07:04] *** primus104 has joined #archiveteam [07:22] *** aliz has quit IRC (Remote host closed the connection) [07:50] *** schbirid has joined #archiveteam [07:56] *** zenguy_pc has quit IRC (Read error: Operation timed out) [08:00] *** atomotic has joined #archiveteam [08:09] *** primus104 has quit IRC (Leaving.) [08:11] *** zenguy_pc has joined #archiveteam [08:31] *** MMovie has joined #archiveteam [08:34] *** MMovie1 has quit IRC (Ping timeout: 310 seconds) [08:40] *** pokeball9 has quit IRC (Quit: Connection closed for inactivity) [08:46] *** BlueMaxim has quit IRC (Quit: Leaving) [09:00] *** Infreq_ has quit IRC (Quit: 始めましょう!) [09:25] *** VADemon has joined #archiveteam [09:31] *** WinterFox has quit IRC (Read error: Operation timed out) [09:41] *** WinterFox has joined #archiveteam [09:57] *** zenguy_pc has quit IRC (Read error: Operation timed out) [10:00] Uploaded three days ago and already 100000+ downloads: https://archive.org/details/archiveteam_archivebot_go_20151018060001 [10:10] *** zenguy_pc has joined #archiveteam [10:25] *** primus104 has joined #archiveteam [10:34] *** anomie has quit IRC (Read error: Connection reset by peer) [10:42] *** anomie has joined #archiveteam [10:43] *** bzc6p_ has joined #archiveteam [10:43] *** swebb sets mode: +o bzc6p_ [10:44] *** bzc6p has quit IRC (Read error: Operation timed out) [10:44] *** vegbrasil has quit IRC (Remote host closed the connection) [10:46] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [10:49] comcast is taken off of the tracker [10:50] it's done? [10:51] two more items are probably still running [10:51] those are very big online stores [10:51] so I'm not sure if they'll ever finish [10:51] all other sites have been done. [10:51] sweet [10:52] midas: to be more specific, all sites we could are done, if any other sites are found, please let me know and they'll be added [10:53] Exception: Unfortunately your IP or country is banned from GameFront. [10:53] damnit [10:53] which country is your IP from? [10:53] this was from OVH [10:53] france [10:54] ill try a dutch system to see if that works [10:54] midas: yeah, France is in the banlist: http://www.twcenter.net/forums/showthread.php?573967-Gamefront-has-banned-half-the-world [10:55] yeah i know [10:55] .nl probably too according to the list i dropped in #grillfront [10:56] afaik the netherlands is not banned [11:28] arkiver: re: the Comcast grab, were either of those two items claimed by one of my crates? I noticed one had been stuck in a loop fetching the same two pages over and over [11:29] the .warc.gzs are still on the machine if I can upload them manually somehow [11:37] *** pokeball9 has joined #archiveteam [11:57] *** zenguy_pc has quit IRC (Read error: Operation timed out) [12:05] hmmm. do we have anything about App.net? http://blog.app.net/2014/05/06/app-net-state-of-the-union/ [12:30] *** primus104 has quit IRC (Leaving.) [12:37] *** WinterFox has quit IRC (Remote host closed the connection) [12:44] *** zenguy_pc has joined #archiveteam [12:48] *** garyrh has quit IRC (Remote host closed the connection) [13:25] *** garyrh has joined #archiveteam [13:28] *** vegbrasil has joined #archiveteam [14:05] *** Boppen has joined #archiveteam [14:19] *** Start has quit IRC (Quit: Disconnected.) [14:32] *** xk_id_ has quit IRC (Remote host closed the connection) [14:33] *** xk_id has joined #archiveteam [14:35] *** xk_id has quit IRC (Remote host closed the connection) [14:42] *** PurpleSym has joined #archiveteam [14:42] *** Start has joined #archiveteam [14:48] *** xk_id has joined #archiveteam [14:52] *** xk_id has quit IRC (Remote host closed the connection) [14:52] *** xk_id has joined #archiveteam [14:59] *** xk_id has quit IRC (Read error: Operation timed out) [15:13] *** primus104 has joined #archiveteam [15:30] *** slang has joined #archiveteam [15:30] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [15:32] "hunter2" [15:34] *** bzc6p_ is now known as bzc6p [15:35] slang: what is your quest? [15:38] To seek the Holy Grail! [15:38] (well, and add some shorteners to the urlteam wiki...) [15:38] Then welcome here! The secret world is "yahoosucks". [15:38] *word [15:38] thanks! [15:42] *** primus104 has quit IRC (Leaving.) [15:44] *** JesseW has joined #archiveteam [15:50] bzc6p: one could call it a secret worLd also... [15:54] *** JesseW has quit IRC (Read error: Operation timed out) [15:54] *** nwf has quit IRC (Read error: Operation timed out) [15:54] *** nwf has joined #archiveteam [16:02] *** Start has quit IRC (Quit: Disconnected.) [16:10] *** xk_id has joined #archiveteam [16:14] *** xk_id_ has joined #archiveteam [16:14] *** xk_id has quit IRC (Read error: Connection reset by peer) [16:15] *** WubTheCap has joined #archiveteam [16:18] *** xk_id has joined #archiveteam [16:18] *** xk_id_ has quit IRC (Read error: Connection reset by peer) [16:19] Well, about matu.red that I posted here three days ago, it's now gone [16:19] Files on x.matu.red domain also don't exist anymore [16:19] https://git.pantsu.cat/WubTheCaptain/deathwatch-pomf#matured [16:19] Didn't get enough time to scrape places for links to archive [16:26] *** Ghost_of_ has joined #archiveteam [16:27] *** logan has quit IRC () [16:27] *** rxhivert has joined #archiveteam [16:28] hi there, can somebody add voir.ca to the archiving bot queue ? [16:29] it’s a small magazine in Quebec but the owners are planning a “rebranding” which includes getting rid of a number of articles/columns [16:31] Okay, matu.red might still be okay but it really feels like it starts needing attention soon [16:31] > Got suspended by my host for bandwidth reasons. Working on a solution. [16:32] Querying if the files are still there [16:45] *** xk_id_ has joined #archiveteam [16:46] *** xk_id has quit IRC (Read error: Connection reset by peer) [16:47] *** logan has joined #archiveteam [17:02] *** rxhivert has quit IRC (rxhivert) [17:02] *** philpem has joined #archiveteam [17:06] *** Dark_Star has joined #archiveteam [17:12] Ghost_of_: arkiver is a member who has organized a lot of ArchiveTeam projects recently. He is "just one user", but one of those whom we must thank the most, at least nowadays. [17:13] Thank you arkiver ! [17:14] hi bzc6p ... yes, I get the feeling that he's pretty active. [17:14] What ArchiveTeam needs the most these days is more arkivers. [17:15] by the way, there's currently complaints about ugly "your PC is infected" pop-ups at the CHFB ... I guess it won't make things easier? [17:16] hurm. http://www.yuku.com/home/goldstory/ [17:17] this is a bit hairy, it seems [17:18] BUT I guess the CHFB crowd would not mind paying (for example) arkiver towards a subscription, if it makes for a better result [17:23] most sites have ads anyway, i doubt this would be a problem [17:23] OK [17:26] we don't sanitize ads [17:26] as a general guideline anyway, of course there are exceptions [17:30] *** vitzli has joined #archiveteam [17:37] Also for that project, should we not be looking at the larger site rather than just one forum on it? or is it only this one forum at risk? [17:39] *** vitzli has quit IRC (Quit: Leaving) [17:41] larger site, ideally [17:41] arkiver knows more about that I think [17:41] :p [17:47] if this forum is at risk, they probably all are. I don't really know the others, so I asked for the CHFB initially [17:47] * Ghost_of_ is out ... take care! [17:47] *** Ghost_of_ has quit IRC (Leaving) [17:52] *** RedType_ has joined #archiveteam [17:52] *** primus104 has joined #archiveteam [17:54] *** RedType has quit IRC (Read error: Operation timed out) [17:55] *** rxhivert has joined #archiveteam [18:17] *** rxhivert has quit IRC (Quit: rxhivert) [18:22] *** aaaaaaaaa has joined #archiveteam [18:22] *** swebb sets mode: +o aaaaaaaaa [18:40] *** vOYtEC has quit IRC (Read error: Connection reset by peer) [18:40] *** DFJustin has quit IRC (Read error: Connection reset by peer) [18:40] *** DFJustin has joined #archiveteam [18:40] *** swebb sets mode: +o DFJustin [18:54] *** insane_al has quit IRC (Leaving) [18:57] *** vOYtEC has joined #archiveteam [19:43] *** db48x has quit IRC (Remote host closed the connection) [19:51] *** BlueMaxim has joined #archiveteam [19:54] *** WinterFox has joined #archiveteam [20:11] *** rxhivert has joined #archiveteam [20:15] *** rxhivert_ has joined #archiveteam [20:15] *** rxhivert_ has quit IRC (Connection closed) [20:18] *** rxhivert has quit IRC (Read error: Operation timed out) [20:27] *** WinterFox has quit IRC (Remote host closed the connection) [20:31] *** Froggypwn has quit IRC (Read error: Operation timed out) [20:32] *** Froggypwn has joined #archiveteam [20:41] *** PurpleSym has quit IRC (Remote host closed the connection) [20:44] *** Start has joined #archiveteam [20:50] *** SilSte has joined #archiveteam [20:55] bzc6p: WubTheCap: thanks! :) [20:56] So I talked with Nemo_bis and we're going to grab every single wiki as WARC file! [20:56] That means all wikis will also be available through the wayback machine [20:56] Currently only mediawiki wikis are supported [20:56] First WARC is here: https://archive.org/details/wikis-mediawiki_kucharka.wiki_api.php_kucharka.wiki_-20151021-223700.warc [20:58] SketchCow: please see above. We're going to work on saving all wikis into the wayback machine [20:58] What about (and feel free to tell me this is a stupid idea) Wikipedia [20:59] *** Start has quit IRC (Quit: Disconnected.) [21:04] HCross: Well I think at some point we might also do wikipedia with the wiki saving project, but it doesn't have a high priority [21:04] Wikipedia is very well covered by the wayback machine. [21:04] ah [21:04] The smaller not-so-popular wikis are not and should get priority over wikipedia now I think [21:05] *** xk_id_ has quit IRC (Remote host closed the connection) [21:07] *** xk_id has joined #archiveteam [21:07] *** xk_id_ has joined #archiveteam [21:07] *** xk_id has quit IRC (Read error: Connection reset by peer) [21:15] yeah, when wikispaces shut down all their free wikis, that was a giant sucking sound [21:35] *** rxhivert has joined #archiveteam [21:47] >We're not archive.org [21:47] This message is pretty prominent, but I still haven't been able to figure out what's the relationship between Archive Team and archive.org. It looks to me like archive.org is mostly driven by Archive Team contributions. [21:47] It's a bit confusing. [21:48] it is not [21:48] we are just doing some hobby projects [21:49] and archive.org lets us shove them into it [21:49] nighty [21:49] *** schbirid has quit IRC (Quit: Leaving) [21:50] fyi Digital Ocean is having some sales for the opening of their Canada data center [21:50] acchan: archive.org does a ton of things besides archive team [21:50] you can get spin a VPS for a month for free [21:51] Is there an archive.org IRC channel? [21:51] there is no official one [21:51] there is #internetarchive on efnet here but it's basically the same people as ehere [21:52] we don't do any of the book stuff for example: https://blog.archive.org/2015/05/08/thank-you-robert-miller-for-2-5-million-books-for-free-public-access/ [21:53] or the music stuff: https://blog.archive.org/2014/10/28/building-music-libraries/ [21:53] or the tv stuff: https://blog.archive.org/2012/09/17/launch-of-tv-news-search-borrow-with-350000-broadcasts/ [21:53] etc etc [21:54] I see. The website could do with some of that info. [21:54] they also do the majority of the web crawls for the wayback machine [21:57] basically they are a real organization with a building and paid employees whereas we are some goofballs on irc [21:58] So, if I have questions regarding archive.org, is this the appropriate place to ask? [21:58] depends, it is possible someone in here will know but for an "official" answer you should e-mail info@archive.org [21:59] Ok. I will keep that in mind. [22:00] SketchCow is the only person in the channel who actually works there [22:01] For now I would like to ask if a website that was on the Wayback Machine, but it's not available anymore due to robots.txt is lost forever. [22:01] I would like to be able to recover https://2dteleidoscope.wordpress.com [22:02] archive.org does retain the data in that case but I don't know if they would release it to the public without permission [22:03] Just knowing that it's retained is enough to give me some hope :) [22:06] There are several blogs like it that have died on me. Now I want to start archiving them all. Still figuring things out though. [22:07] By the way, what prompted me to come here was this thread: https://lainchan.org/cyb/res/16692.html [22:07] people from this channel might want to chime in there [22:08] also answer my questions maybe: https://lainchan.org/cyb/res/16692.html#17965 [22:12] *** xk_id_ has quit IRC (Remote host closed the connection) [22:19] *** Start has joined #archiveteam [22:20] *** nertzy has joined #archiveteam [22:25] *** rxhivert has quit IRC (Quit: rxhivert) [22:44] *** Ungstein has quit IRC (Ping timeout: 252 seconds) [22:45] *** nertzy has quit IRC (Quit: Leaving) [22:53] *** wednesday has quit IRC (Quit: Be the change that you wish to see in the world.) [22:57] *** rxhivert has joined #archiveteam [23:03] *** bzc6p_ has joined #archiveteam [23:03] *** swebb sets mode: +o bzc6p_ [23:03] *** nertzy has joined #archiveteam [23:10] *** bzc6p has quit IRC (Ping timeout: 615 seconds) [23:15] *** wednesday has joined #archiveteam [23:19] *** xk_id has joined #archiveteam [23:19] *** rxhivert has quit IRC (Quit: rxhivert) [23:48] *** rxhivert has joined #archiveteam [23:48] *** rxhivert has quit IRC (Connection closed) [23:54] *** philpem has quit IRC (Ping timeout: 252 seconds)