[09:04] Are there examples on how to do a proper mirror of a website to a warc? [09:39] On the wget page of archiveteam.org. [10:52] So, to convert those ugly cbr files I ended up using KRename and PeaZip. [13:44] Why did you rename them? [13:44] archive can take .cbr and .cbz now [13:51] SketchCow: because I had no idea. [13:51] SketchCow: do _images.cbr work too? The content is not tidy at all. [13:57] Ah, the _images is not even needed. http://blog.archive.org/2012/05/24/uploading-images-for-text-items/ [13:57] Sigh, I had even read that post I think. :/ [14:04] Yes [14:04] I had them put in cbr and cbz support to save time [14:04] The system unpacks them and makes it work [14:07] Also, you're likely doing some magazines double. [14:07] It's easier in the future just to shove these files at me. [14:09] You got the attention of one of the developers from your uploading efforts, so congrats on that. [14:10] http://archive.org/details/your-computer-magazine [14:10] For example. [14:11] I did check them before downloading. [14:11] Let's see what happened in this case. [14:14] Hm no idea. [14:15] Most of the work is metadata, not downloading; so if you just want me to send you files you'd better directly download them. :) [14:15] There's a list at http://archiveteam.org/index.php?title=Magazines_and_journals in case you lack ideas.:p [14:16] I'm very sorry for YourComputer. :/ [14:22] SketchCow: I stopped the current upload of YourComputer but I can't stop derives. There shouldn't be many more duplicates, except among darkened stuff which I can't search. [14:38] A couple collections I'm really eager to upload are "Meccano magazine" (1916-1981) and "Zzap!" (Italian version). :) [14:42] Yes, those are much more relevant. [14:43] by the way: the amount I like that we have piratebay links up on archiveteam.org: zero [14:45] Better than negative? [14:45] Linking is not illegal [14:46] SketchCow: how many issues do you have in https://archive.org/details/microhobby-magazine ? [14:47] * Nemo_bis trying to check better for duplicates now, not only search. [14:48] Yes, thank you. "Linking is not illegal" [14:49] I cruise the pirate bay and a bunch of other scanning efforts that are public and I put them into archive.org constantly. [14:49] So that action is pretty redundant. [14:50] Which action? [14:50] downlaod ALL the torrents. [14:50] I'd like to see us come up with a system for submitting metadata improvements to collections on archive.org. The culture of the place (and logic) dictate a wikipedia-like maintenance of the metadata will never come forward, but coming up with a way to submit metadata so I can manually shove it in would be very helpful. [14:52] That would surely be wonderful. [14:53] SketchCow: thanks for the attack of the show collection [14:54] How the hell are you staying on top of me doing that? [14:54] It doesn't notify you, does it? [14:54] https://archive.org/search.php?query=jscott&sort=-publicdate ? [14:56] https://archive.org/details/railwaymodeller [14:56] Yes, I realize it's not hard to FIND OUT what I do - it's more a question of how fast one would track what I'm doing. [14:56] i have digit magazine [14:57] the items would have to be called something like digit-india-magazine-v#i# [14:57] also not all have covers [14:58] SketchCow: it depends on how often one presses F5 I guess. :D [15:00] i have a lot of waiting tasks just for editing meta data [15:03] Anyway SketchCow, most of what I uploaded is from a private Italian tracker, in general I agree that it's nothing special but it doesn't harm uploading some stuff I bump into while looking for other things. [15:03] (I doesn't harm unless I upload duplicates of course. :"( ) [15:03] yeah, and that's fine. I'm just saying that riding pirate bay is not the best way to go - I'm already doing that as part of mypaid-for job [15:04] SketchCow: that's why I put the list on the wiki, of course if you manage to do that stuff directly (or decide that it shouldn't be done) it will be much better (faster and better done). [15:05] Yes, but I'm saying the entire "pull items from TPB" action isn't necessary to track. [15:05] I was mostly looking for Italian stuff to avoid spending hundreds euros on buying magazines which someone already put on torrents. :p [15:05] Statistically, I will go through all of the magazine and honestly even document and large-size torrents [15:05] What do you mean, aren't todos useful? [15:06] todos are useful in a roundabout sense [15:06] uh? [15:06] To-dos are useful when you are working on a set of items and want to close it down, or have a set of items multiple people are handling. [15:07] And under the "pull items from The Pirate Bay to potentially put on archive.org", I have that one handled. [15:07] "Pull items from private italian trackers", obviously you have that handled. [15:08] Some (admittely few) of those items weren't so trivial to find, so you might have missed them. [15:08] *admittedly [15:08] Weren't so trivial to find.... on the pirate bay? [15:08] Jesus, someone uploaded 1,441 newspapers. http://archive.org/details/narberthcivicassociation [15:09] Meaning with very obscure name, no description, foreign language and so on [15:09] https://archive.org/details/RedfishMagazine << on the right, does it always just list the file type, or can I have the names appear (i.e. I'm doing something wrong?) [15:10] Let's bring this whole thing to #internetarchive [15:10] But yes, for TPB most I can do is saving you some boring clicks/browsing, which is not bad though if I find something interesting? [15:11] I am not indicating how strongly I do not like The Pirate Bay links on archiveteam.org [15:11] Should those be replaced with titles without links? [15:12] Sent to you as suggestion by email so that you can do when you have time/if they're worth it? [15:12] I am always up for working with someone on suggested projects. [15:12] if you find what you think are hidden gems, I always appreciate a mail. I get those a lot. [15:12] I can absorb a torrent faster than nearly anyone, and have scripts to inject those items into archive.org very, very fast. [15:13] metadata's a separate issue - I have a goal to make it a collaborative software issue. [15:13] This is, again, us not in #internetarchive [15:14] SmileyG: Please get over there too. [15:27] http://www.flickr.com/photos/textfiles/sets/72157632295912594/with/8290464381/ [15:27] By the way., [15:29] SketchCow: all the way by truck? [15:30] No, halfway we switched to a food cart [15:30] Ah, good, they're much faster than trains. [15:36] 788MPH?! [15:36] * SmileyG shuts up now