[00:27] *** jrwr has quit IRC (Remote host closed the connection) [00:28] *** jrwr has joined #archiveteam-bs [00:32] *** etudier has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [02:01] *** Honno has joined #archiveteam-bs [02:06] i'm starting to upload this: https://archive.org/details/acolumbinesite.com-cd-collection [03:33] *** Honno has quit IRC (Read error: Operation timed out) [04:01] *** ndizzle has joined #archiveteam-bs [04:07] *** ndizzle has quit IRC (Quit: Leaving) [04:08] *** ndiddy has quit IRC (Read error: Operation timed out) [05:17] *** Aranje has quit IRC (Quit: Three sheets to the wind) [05:22] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:29] *** Sk1d has joined #archiveteam-bs [05:37] *** Whopper has quit IRC (Remote host closed the connection) [05:39] *** Whopper has joined #archiveteam-bs [05:40] *** VADemon_ has quit IRC (Read error: Connection reset by peer) [05:42] i'm at 960k items now [05:43] where do you get all this stuff godane o.o [05:51] half of it is gov docs [06:39] *** BlueMaxim has quit IRC (Quit: Leaving) [06:43] *** kristian_ has joined #archiveteam-bs [07:29] *** Start_ has quit IRC (Read error: Connection reset by peer) [07:29] *** Start has joined #archiveteam-bs [08:16] *** GE has joined #archiveteam-bs [08:20] *** BlueMaxim has joined #archiveteam-bs [08:23] *** Honno has joined #archiveteam-bs [08:58] So did Tripod and Angelfire go down? I have been pre-occupied with other stuff the last few months, and I return to some projects and all the Angelfire/Tripod links I had in spreadsheets are gone :/ [08:58] http://quake3tweaks.tripod.com/commands.html [08:58] I checked logs and it was mentioned back in june of there being some issues [09:11] *** ravetcofx has quit IRC (Read error: Operation timed out) [09:12] it looks like they went back and torched all old accounts, so massive data loss [09:12] i0npulse: I happened to stumble upon a angelfire link yesterday, not sure what the message was but wouldn't load [09:13] 404 forbidden [09:14] https://webcache.googleusercontent.com/search?q=cache:K-QA_xA_3ngJ:http://www.angelfire.com/games4/stealthonline/mgs2artwork.html%2BAngelfire+stealth+online&hl=en&ct=clnk [09:14] 2 weeks ago [09:14] http://www.angelfire.lycos.com/ [09:15] You don't have permission to access / on this server. [09:15] whats funny is this page has been grab 40 times: https://web.archive.org/web/*/http://quake3tweaks.tripod.com/commands.html [09:15] the home page was only grab 3 times back in 2004 [09:16] i had been cought up with other stuff, life stuff and other archival projects and had set to return to site archiving for the next 6 months+ [09:16] really frustrating that this stuff dropped off the map [09:17] it happens [09:17] i was AOL files project for about 1 month [09:17] then it was gone [09:20] ah man, google cached this stuff on October 18th [09:20] i sure hope its just a network issue at lycos [09:20] https://webcache.googleusercontent.com/search?q=cache:EyLtWCxzG7UJ:http://quake3tweaks.tripod.com/configs.html%2BQuake+3+tripod&hl=en&ct=clnk [09:20] this page on the 26th [09:20] https://webcache.googleusercontent.com/search?q=cache:wGMH0XtIlt0J:http://kpush.tripod.com/tweaking/id4.html%2BQuake+3+tripod&hl=en&ct=clnk [09:20] man this stuff has to come back up [09:23] http://www.angelfire.lycos.com/ was snapshot cached by google on October 28th [09:23] friday so.. hoping its a network migration on their part [10:00] *** Yoshimura has joined #archiveteam-bs [10:19] *** kristian_ has quit IRC (Quit: Leaving) [10:40] *** BartoCH has joined #archiveteam-bs [10:49] *** kyounko has quit IRC (Ping timeout: 260 seconds) [12:00] *** BlueMaxim has quit IRC (Quit: Leaving) [13:11] *** Honno has quit IRC (Read error: Operation timed out) [13:48] *** GE has quit IRC (Remote host closed the connection) [14:12] *** pikhq_ has joined #archiveteam-bs [14:12] *** pikhq has quit IRC (Ping timeout: 260 seconds) [14:53] *** Jordan_ has joined #archiveteam-bs [14:54] *** Jordan has quit IRC (Ping timeout: 250 seconds) [14:54] *** Jordan_ is now known as Jordan [14:55] *** Kksmkrn has quit IRC (Ping timeout: 250 seconds) [14:55] *** HCross has quit IRC (Ping timeout: 250 seconds) [14:55] *** hawc145 has joined #archiveteam-bs [14:56] *** luckcolor has quit IRC (Ping timeout: 250 seconds) [14:56] *** luckcolor has joined #archiveteam-bs [14:58] *** hawc145 is now known as HCross [15:01] *** Madthias- has joined #archiveteam-bs [15:05] *** tephra_ has joined #archiveteam-bs [15:06] *** Madthias has quit IRC (Ping timeout: 250 seconds) [15:06] *** Medowar0 has quit IRC (Ping timeout: 250 seconds) [15:06] *** tephra has quit IRC (Ping timeout: 250 seconds) [15:21] *** Kksmkrn has joined #archiveteam-bs [15:25] *** signius has quit IRC (Read error: Operation timed out) [15:28] *** signius has joined #archiveteam-bs [15:41] *** GE has joined #archiveteam-bs [16:14] re angelfire/lycos, i got this email on 27-oct: [16:14] We will be upgrading our network on Friday, October 28 - Monday, October 31 so that we will be able to serve you better. The upgrade will help us to provide greater site stability, allow us to make major enhancements soon to our Tripod and Angelfire products, and generally to serve you better. [16:14] In order to allow you to understand what to expect, we have created a page at http://move.lycosstatus.com which will give you more information. We will frequently update it with what we're currently working on. [16:14] As with all things in life, this plan is subject to change due to unforseen circumstances. [16:14] We appreciate your patience while we make this important upgrade to your service. [16:16] got a similar one on the 25th [16:17] it's pretty much definitely planned, imo [17:06] *** kristian_ has joined #archiveteam-bs [17:32] *** ravetcofx has joined #archiveteam-bs [17:41] *** Simpbrain has joined #archiveteam-bs [18:09] sooooooo#] [18:09] http://www.itv.com/hub/meet-the-parents/2a4346a0004 [18:10] father in law is on this and would like a copy [18:10] If anyone can help get the vcideo that'd be great [18:10] my best suggestion is youtube-dl [18:11] my next best suggestion is to use this https://github.com/odie5533/WarcProxy and then see if you can extract it from the warc [18:16] My suggestion is to use of almighty rightclick [18:17] I dig the moderator [18:17] SmileyG: Well cannot help you much, it requires login, but can help you by direct assistance if you want. [18:21] rightclick? [18:21] it seems to be flash [18:28] Seems or is? [18:29] I would like to watch that too. We had the exact same thing in TV, except many many years ago. Seems like they copied that idea or sold. [18:31] ITV Hub is only available to viewers in the UK. If you want to watch in Europe, you may be able to use our ITV Essentials service, which offers our top pick of shows in selected countries. [19:00] *** RichardG has joined #archiveteam-bs [19:21] *** Simpbrain has quit IRC (Remote host closed the connection) [19:22] Yoshimura: that's exactly why I'm trying to get this vid :D [19:23] Yoshimura: well I right click, this is in chrome, and it says about adobe flash player ... [19:24] *** RichardG has quit IRC (Ping timeout: 370 seconds) [19:31] Then it's flash. I offered direct assistance. Now I am busy plus do not have time explaining stuff in detail. The WARCProxy would be good try. [19:33] *** kristian_ has quit IRC (Quit: Leaving) [19:42] k [20:52] *** ndiddy has joined #archiveteam-bs [21:06] *** kniffy has left [21:48] *** GE has quit IRC (Remote host closed the connection) [21:56] *** kristian_ has joined #archiveteam-bs [21:57] *** powerKitt has joined #archiveteam-bs [21:58] xmc: thanks for the lycos link [21:58] much appreciated [21:59] sure thing [22:12] how does one extract links from google? [22:13] Without hitting captcha (which is broken and does not let me continue even when typed) [22:14] Yoshimura: short answer: you don't [22:14] Yoshimura: longer answer: using a link extraction plugin for your browser of choice, and a lot of patience [22:15] (or paying somebody a lot of money to do it for you) [22:16] FireFox + Snaplinks extension is what I use [22:16] it lasso's up links on a page [22:16] copies them to the clipboard in the format you define in options, and just past into file/spreadsheet [22:17] Thanks. Well Yeah I was in ff. [22:19] www.startpage.com if you want to be anonymous [22:20] "anonymous" [22:20] they still see your searches, you're trusting them on their word that they're throwing that data away ;) [22:21] ok i will rephrase lol [22:21] obfuscated to certain parties [22:23] *** BlueMaxim has joined #archiveteam-bs [22:24] joepie91: I'd rather trust them than google :p [22:25] I don't know. it concerns me that startpage tries to bank on the privacy angle [22:26] *** powerKitt has quit IRC (Quit: Page closed) [22:26] even if they were lying, they don't have all my account info and other shit to link it to and profit from [22:27] they could be some kind of front for [adversary], but so too could Google. [22:30] if anything startpage is another portal into the google data, and could help skirt around some data formatting [22:30] For instance a lot of links pre-rollover look like this: [22:31] https://www.google.com/url?q=http://www.polygon.com/2016/9/9/12863124/dreamcast-new-games-homebrew-vmu&sa=U&ved=0ahUKEwj_sPH6yYPQAhWC2YMKHc0GCfAQFgg9MAs&usg=AFQjCNEupwKzM49y2iWAKIzM7Eyhc69UhA [22:31] in the google.com search results [22:31] startpage skirts around that URL format showing up on manual link scrapes [22:32] obviously some regex can solve for that as well ;) [22:32] as an aside I do find it quite annoying when I absent-mindedly copy a link from Google search results and I end up pasting a giant clusterfuck of a URL into a channel [22:33] That tracks [22:33] lol [22:46] *** Medowar67 has joined #archiveteam-bs [22:46] *** Medowar67 is now known as Medowar0 [23:50] arkiver: Can we get NXP by archivebot then fill in holes?