[00:02] *** FireFly has joined #archiveteam-bs [00:20] Closedverse (closed.pizza) requires a login now, possibly due to my ArchiveBot job. [00:20] It got pretty far though, was still running fine last time I checked 1-2 hours ago, so it should've grabbed something like 500k post IDs (of 586k total). [00:42] *** indrora has joined #archiveteam-bs [01:09] wget keeps pulling down a file over and over. [01:20] indrora: What do you mean? [01:20] godane: How do you go about digitizing tapes? I've got someone that has a few really old ones. [01:21] its going good [01:21] i'm at tape 13 i think [01:21] *tape 12 [01:24] What do you use to digitize them though [01:24] a ezcap usb stick [01:25] need red,white, and yellow cables for it [01:26] k [01:42] hook54321: I'm trying to get a good archive of a wikispace to make sure it's done right, and it's pulling a single file, /s/blank.html over and over again. [01:44] hook54321: I'm using --page-prequisites and passing in a series of urls via -i and --reject 'blank.html' doesn't work; [01:44] Wartower posted an update regarding their shutdown: http://www.wartower.de/forum/showthread.php?t=1189076&page=3&p=8422871&viewfull=1#post8422871 [01:47] TL;DR/translation: They'll sort-of merge with GW2Community but continue running Wartower as a separate entity with reduced content. They already started deleting content (about 1/8th of the posts is gone). [01:47] Ewww' [01:49] Can someone validate that these WARCs look good? http://tsunami.zaibatsutel.net/wiki-vacancies/ [01:59] I know that there's a problem where if there's urlencoded data in the url list it gets broken [02:06] so one tape has a bit of plastic in it [02:07] i will see about fix it later [02:07] anyways digitize tape 13 right now [02:17] Fantastic, I found a perl oneliner that does what I need [02:18] *** superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) [02:31] *** atrocity has joined #archiveteam-bs [02:38] *** ItsYoda has quit IRC (Ping timeout: 260 seconds) [02:40] indrora: I don't know that much about wget, someone else might though. [02:49] When passing urls via -i it expects them to be de-urlencoded [02:49] It's working now [02:55] Weird, the behavior for wget isn't totally consistent. There's files on this test wiki I'm mirroring that have percent-encoded filenames [02:56] http://corylibrary.wikispaces.com/file/view/Cory%20Cougar%20Logo.jpg <-- The wiki software is expecting this to be urlencoded (making it doubly-urlencoded) [03:12] *** ItsYoda has joined #archiveteam-bs [03:27] I just found out about this, but looks like blizzard forums are going into read-only, and soon to be deleted on Jan 20 [03:27] https://us.battle.net/forums/en/overwatch/topic/20761366034 [03:28] This is more of an Archive bot project im assuming [03:28] So if someone could que that it would be much apperciated [03:28] *** robink has quit IRC (Ping timeout: 246 seconds) [03:42] (Feb 20, not Jan) [03:42] mundus: Is it just the overwatch section of the forums, or? [03:44] Sorry, Feb [03:44] ill try and see [03:45] k [03:45] Must be just the overwatch section [03:45] >This is why we’re excited to announce that we will be launching new Overwatch forums next month. [03:53] hmm. It's also on the Europe version of their forums as well. https://eu.battle.net/forums/en/overwatch/topic/17617891711 [03:58] It's probably on all languages/regions [04:03] Is there a list of all the regions and languages with links to their forums? [04:05] !d x39guix37xhpk9pqjzpu8qaq 12500 12500 [04:05] oops [04:21] *** ndiddy_ has quit IRC () [04:28] *** ItsYoda has quit IRC (Ping timeout: 260 seconds) [04:34] *** Stilett0 has joined #archiveteam-bs [04:34] *** Stiletto has quit IRC (Read error: Operation timed out) [04:43] *** qw3rty116 has joined #archiveteam-bs [04:43] *** BlueMax has joined #archiveteam-bs [04:47] *** qw3rty115 has quit IRC (Read error: Operation timed out) [05:34] *** ItsYoda has joined #archiveteam-bs [06:39] *** odemg has quit IRC (Read error: Operation timed out) [06:41] *** odemg has joined #archiveteam-bs [08:02] *** zhongfu has quit IRC (Remote host closed the connection) [08:03] *** zhongfu has joined #archiveteam-bs [08:21] i'm on to tape 14 [08:21] i'm uploading tape 13 as is to FOS [08:21] based on what i can tell i came from jan 1995 [08:21] this took up most of it: http://cartoonnetwork.wikia.com/wiki/Night_of_the_Vampire_Robots [08:22] its fussy though [08:58] so one box is empty [08:58] i'm giving the vcr a rest for now [09:03] also tape 14 is done [09:04] it was just south part s02e02 episode airing [10:14] *** BlueMax has quit IRC (Leaving) [12:22] *** Jusque_ has joined #archiveteam-bs [12:28] *** Jusque has quit IRC (Read error: Operation timed out) [12:28] *** Jusque_ is now known as Jusque [12:57] *** Sanqui has quit IRC (Ping timeout: 260 seconds) [12:59] *** Sanqui has joined #archiveteam-bs [14:28] *** superkuh has joined #archiveteam-bs [16:31] *** Mateon1 has quit IRC (Ping timeout: 252 seconds) [16:31] *** Mateon1 has joined #archiveteam-bs [17:44] *** jschwart has joined #archiveteam-bs [18:06] *** K4k has joined #archiveteam-bs [19:53] *** ranavalon has joined #archiveteam-bs [19:53] *** ranavalon has quit IRC (Remote host closed the connection) [19:53] *** ranavalon has joined #archiveteam-bs [19:54] *** ola_norsk has joined #archiveteam-bs [19:55] forgot to add the -bs when i joined [20:01] someone get the noose [20:04] schbirid: hehe, it would be quite disruptite if i continued that habit [20:04] * ola_norsk is the rambling king [20:05] kind* [20:06] rambling king [20:06] lol [20:10] sometimes the fingers fraudian slip i guess [20:13] i'm at 1,200,370 items now [20:17] *** icedice has joined #archiveteam-bs [20:41] i'm at 5 :) [20:45] 2. [21:48] anyway to tweak this into _actually_ appending subjects? http://paste.ubuntu.com/p/4nffshqgqD/ (the input textfile has one "subject" per line) [21:50] as it is, it simply contatenates $p to the string .. [21:52] does each subject need a single "--metadata=subject:" ? [21:59] *** icedice has quit IRC (Read error: Operation timed out) [22:02] *** BlueMax has joined #archiveteam-bs [22:26] *** Ravenloft has joined #archiveteam-bs [22:30] *** schbirid has quit IRC (Leaving) [22:39] this guys seems adament to get his name removed from internet .. https://archive.org/post/1087754/delete-request [22:39] guy* [22:42] i figured since it was 'faqs' section of forum, i can give personal opinion? [22:51] i would say from experience it's easer to simply change name. ~3 months of form processing. ~3-8+ months updating online/electronic services to the fact .. ~restofyourlife trying to get your grandparent to respect name change [22:52] lol [22:53] it's still easier thoug... [23:17] does IA delete things off of wayback based on 'my name is mentioned' though? If so, how do they verify that the request is from the actual person mentioned in/on item/page? [23:19] i mean, if that's the case, someone could just get entire blogs wiped by claiming to be the owner.. [23:20] author* [23:26] that specific twitlong seems incredibly "waaaah, i feel scammed eventhough i didn't ever call police", anyway...so i can't even fathom why anyone would care [23:30] *** ola_norsk has quit IRC (I just recieved 1 trillon -1 stars review on Yelp! I quit internet! https://youtu.be/4JL6vKB5qMU) [23:35] *** jschwart has quit IRC (Quit: Konversation terminated!) [23:36] *** Sanqui has quit IRC (Ping timeout: 260 seconds) [23:37] This needs seeding long term, it's only 462MB [23:37] https://www.reddit.com/r/JustArchivistThings/comments/7yr9jw/data_release_proponent_of_free_energy_doesnt