[00:09] *** Stilett0 has joined #archiveteam [00:21] *** ola_norsk has joined #archiveteam [00:21] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [00:34] *** Valentine has quit IRC (Read error: Connection reset by peer) [00:34] STATE YE PURPOSE FOR REQUIRING THINE SECRET WORD (and someone that has it will help you in a little bit) [00:34] *** TheLovina has joined #archiveteam [00:39] *** Valentine has joined #archiveteam [00:39] *** K4k has joined #archiveteam [00:43] *** Stiletto has joined #archiveteam [00:43] *** Stilett0 has quit IRC (Ping timeout: 250 seconds) [00:51] wp494: it's fine and dandy, i forgot the magic of irc logs :d [00:51] *** Valentine has quit IRC (Read error: Operation timed out) [00:51] PSA #1.5: roblox forum grabs are now available in warrior too, so forget that bit I said about it not being there [00:54] *** Stilett0 has joined #archiveteam [00:55] *** Valentine has joined #archiveteam [00:58] *** Stiletto has quit IRC (Ping timeout: 245 seconds) [01:20] SketchCow: I would like to access the wiki in order to add more URL shorterns to the list of URL shorteners and add information on search queries that one can use to easily discover URL shortners based on open source scripts [01:25] *** Stiletto has joined #archiveteam [01:26] *** Stilett0 has quit IRC (Ping timeout: 250 seconds) [01:28] *** Dimtree has quit IRC (Peace) [01:30] *** Stiletto has quit IRC (Read error: Operation timed out) [01:32] *** Stilett0 has joined #archiveteam [01:35] *** Dimtree has joined #archiveteam [01:58] *** heauxart has joined #archiveteam [02:03] *** Stilett0 has quit IRC (Ping timeout: 264 seconds) [02:05] *** ola_norsk has quit IRC (Ping timeout: 480 seconds) [02:08] *** mefiga has quit IRC () [02:10] *** heauxart has quit IRC (Ping timeout: 260 seconds) [02:10] *** kristian_ has joined #archiveteam [02:53] *** schbirid has quit IRC (Ping timeout: 255 seconds) [02:56] *** K4k has quit IRC (Read error: Operation timed out) [03:04] *** schbirid has joined #archiveteam [03:37] *** ld1 has quit IRC (Quit: ~) [03:44] *** ld1 has joined #archiveteam [03:55] *** kristian_ has quit IRC (Quit: Leaving) [03:57] *** Stilett0 has joined #archiveteam [04:08] *** qw3rty113 has joined #archiveteam [04:14] *** du_ has quit IRC (Ping timeout: 260 seconds) [04:15] *** qw3rty112 has quit IRC (Read error: Operation timed out) [04:18] *** Aerochrom has joined #archiveteam [04:31] *** Vito` has joined #archiveteam [04:31] is there a reason IA would say that a URL comes from the archive team collection, but there isn't an archive team collection item with that date? [05:12] *** Pixi` has joined #archiveteam [05:16] *** Pixi has quit IRC (Ping timeout: 255 seconds) [05:20] *** Pixi` has quit IRC (Quit: Pixi`) [05:21] *** Pixi has joined #archiveteam [06:09] *** wp494 has quit IRC (Read error: Operation timed out) [06:09] *** wp494 has joined #archiveteam [06:22] *** K4k has joined #archiveteam [07:40] treora: good to see some actually interesting projects in the prototype fund :D [09:05] *** icedice has joined #archiveteam [09:06] *** wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) [09:18] *** CoolCanuk has quit IRC (Quit: Connection closed for inactivity) [09:46] *** ZexaronS has joined #archiveteam [10:10] *** icedice has quit IRC (Read error: Connection reset by peer) [10:49] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [10:52] *** bwn has quit IRC (Read error: Operation timed out) [10:53] *** bwn has joined #archiveteam [11:03] *** ivan has quit IRC (Read error: Operation timed out) [11:03] *** marvinw has joined #archiveteam [11:03] *** liam has quit IRC (Read error: Operation timed out) [11:04] *** liam has joined #archiveteam [11:38] *** phaedra has joined #archiveteam [11:42] *** phaedra has quit IRC (Client Quit) [11:59] Vito`: The item isn't necessarily from the date of the grab. It can be at any later point in time too. To narrow it down, you'd have to search for items whose "firstfiledate" metadata field is lower than (or equal to) the date of the URL you're accessing and whose "lastfiledate" field is larger than (or equal to) that date. It doesn't look like IA's search supports those fields though. [12:02] (Firstfiledate and lastfiledate give the dates of the first (earliest) and last (latest) WARC record in an item.) [12:02] *** Morbus has quit IRC (Quit: http://www.disobey.com/) [12:23] *** zino_ has quit IRC (Read error: Operation timed out) [12:59] *** Specular has joined #archiveteam [13:01] *** zino_ has joined #archiveteam [13:01] *** kristian_ has joined #archiveteam [13:08] *** Mateon1 has quit IRC (Remote host closed the connection) [13:09] *** Mateon1 has joined #archiveteam [13:58] *** SketchCow has quit IRC (Read error: Connection reset by peer) [14:15] *** schbirid has quit IRC (Ping timeout: 255 seconds) [14:27] *** schbirid has joined #archiveteam [14:29] *** Morbus has joined #archiveteam [14:37] *** SketchCow has joined #archiveteam [14:38] *** paul2520 has joined #archiveteam [14:44] *** SketchCow has quit IRC (Read error: Connection reset by peer) [14:53] hi there - longtime archive.org user, first time here. I contribute to a wiki that is being decommissioned. Announcement officially went up today @ http://www.sascommunity.org/wiki/Main_Page I know many of the pages have been archived, but definitely plenty have not. What can I do to ensure it gets crawled? [14:54] Hi paul2520 [14:54] We can chuck it through archivebot [14:54] How longs the time to death? [14:54] So the official statement is "Our first step will be to put the site into ‘Read Only Mode’ on January 1, 2018." [14:54] Ok, I've just read that, So that's fine [14:54] seems like admin work will go on still, and possible move of pages elsewhere [14:55] Ok, I've added it to the crawler [14:56] thanks Igloo! you're the best :-) [15:12] *** Mateon1 has quit IRC (Ping timeout: 260 seconds) [15:13] *** Mateon1 has joined #archiveteam [15:40] *** kristian_ has quit IRC (Ping timeout: 360 seconds) [16:14] *** Aerochrom has quit IRC (Ping timeout: 248 seconds) [16:15] *** Aerochrom has joined #archiveteam [16:18] *** kristian_ has joined #archiveteam [16:34] paul2520: can you get them to make a full export? [16:34] If you want to keep the wiki alive elsewhere, you can use https://www.mediawiki.org/wiki/Manual:Grabbers to also migrate users (opt-in) [16:36] Nemo_bis: thanks for the tips - I've asked about a full export. Will follow-up, as they've asked me to be on the admin team to do some cleanup and think about what we can do as far as exporting. [16:44] *** du_ has joined #archiveteam [16:46] *** CoolCanuk has joined #archiveteam [16:53] *** fireglow has joined #archiveteam [17:08] *** kristian_ has quit IRC (Quit: Leaving) [17:35] *** icedice has joined #archiveteam [17:53] *** rbraun has joined #archiveteam [18:09] *** ranavalon has quit IRC (Read error: Connection reset by peer) [18:09] *** ranavalon has joined #archiveteam [18:12] *** Netham45 has joined #archiveteam [18:15] Hey, so I just found a siterip of xbox-scene.com from 2014 (site went down in mid 2016), was an active Xbox hacking forum with ~4 million posts, any chance you guys can archive it? Found the download here: https://www.reddit.com/r/originalxbox/comments/78qmop/xboxscenecom_forums/ . Got it up on my server right now, https://forums.xbox-scene.com.xbox-scene.tk/. [18:23] Netham45: I put it in archivebot [18:27] JAA: ah, okay, thanks. I'll see if I can find it in later drops. Is there any external search elsewhere? [18:31] *** icedice has quit IRC (Read error: Connection reset by peer) [18:49] *** icedice has joined #archiveteam [19:25] Vito`: I don't know any. Except for ArchiveBot, but not with the level of detail you need for your query. [19:28] thanks! [19:39] *** jschwart has joined #archiveteam [19:54] *** RichardG has quit IRC (Read error: Connection reset by peer) [19:56] *** RichardG has joined #archiveteam [20:01] *** wp494 has joined #archiveteam [20:03] *** icedice has quit IRC (Ping timeout: 260 seconds) [20:26] *** mefiga has joined #archiveteam [20:36] *** Stilett0 has quit IRC (Ping timeout: 260 seconds) [20:38] *** marvinw is now known as ivan [20:39] *** Stilett0 has joined #archiveteam [21:11] *** Morbus has quit IRC (Read error: Operation timed out) [21:11] *** Morbus has joined #archiveteam [21:24] *** SketchCow has joined #archiveteam [21:24] -------------------------- [21:24] hi, [21:24] just want to inform you that the main Polish blog hosting [21:24] (http://www.blog.pl/katalog) will be down at the end of January 2018 [21:24] (and it is online since 2005). I wonder if is it ok to ask you for [21:24] help with preserving its content (I have no server resources to [21:24] collect that blogs and turn them into WARC but maybe can help by [21:24] scraping the list of subdomains to collect - I work in R). [21:24] the official info (in Polish) [21:24] http://www.wirtualnemedia.pl/artykul/blog-pl-koniec-dzialalnosci-grupa-onet-rasp-to-efekt-rosnacej-roli-mediow-spolecznosciowych [21:24] regards! [21:24] -------------------------- [21:47] *** BlueMaxim has joined #archiveteam [21:53] SketchCow: your porn tapes are getting digitized right now [21:53] i'm on the 2nd one right now [21:54] i have like 6 tapes left from you [21:54] one of the tapes i just have to check if there is anything to capture on it [21:55] it as showtime airing of much ado about nothing [21:57] SketchCow: i think we can do faster tape ripping next time if we go after the t-60 and under tapes [21:57] Hurrah [21:57] Great [21:57] you can send me a ton of those and things like 5 to 8 tapes done a day [21:58] again this once i mail on of the boxes [21:58] *one of [22:22] *** ranavalon has quit IRC (Read error: Connection reset by peer) [22:22] *** ranavalon has joined #archiveteam [22:28] *** ranavalon has quit IRC (Read error: Connection reset by peer) [22:34] *** ranavalon has joined #archiveteam [22:39] *** ranavalon has quit IRC (Read error: Connection reset by peer) [22:40] *** ranavalon has joined #archiveteam [23:00] *** qw3rty114 has joined #archiveteam [23:03] *** qw3rty113 has quit IRC (Read error: Operation timed out) [23:09] *** qw3rty114 has quit IRC (Read error: Connection reset by peer) [23:12] *** ld1 has quit IRC (Quit: ~) [23:12] *** ld1 has joined #archiveteam [23:14] *** ld1 has quit IRC (Client Quit) [23:16] *** qw3rty114 has joined #archiveteam [23:17] *** ld1 has joined #archiveteam [23:32] *** Specular has quit IRC (Leaving) [23:43] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [23:44] *** ld1 has quit IRC (Quit: ~) [23:44] *** ld1 has joined #archiveteam [23:45] *** dashcloud has joined #archiveteam [23:49] *** Odd0002 has quit IRC (Quit: ZNC - http://znc.in)