[00:35] anyone have thoughts on removing watermarks from pdfs [00:35] oops, forgot my question mark :( https://groups.google.com/group/science-liberation-front/t/c68964cf55d8f6fa [00:42] i hate ruby, it's probably not ruby's fault but its the most nitpicky install of a language ever!!! [00:43] im just trying to run one script, and it wants top copmpile itself all over again!?!? [01:04] hiya. i'm fixin' to update the reject-regex for yahooblogs grab, but the current regex seems off to me: [\\\\"\'] [01:04] am i wrong in thinking there's an extra backslash there? [01:05] if, indeed, the character set is \ and " and ' [01:32] Four backslashes sounds like two are to get passed the language's escaping, which gives you anotehr two which end up in the expression [01:42] aye. but there are only three unaccounted for. one is escaping the double quote. [01:42] well. if it's not a mistake, alard can verbally abuse me :) pull request is pending, btw. [02:20] kanzure: yeah, download it twice, keep the content that is the same [09:11] ARGH [09:11] alard: if I upload a txt file to s3 with a .7z extension, will s3 delete an existing 7z file? [09:12] I just did something as stupid as that and now I fear I'm going to lose my 49 GiB 7zip :( https://www.us.archive.org/log_show.php?task_id=140173799 [09:15] Nemo_bis: I think it overwrites files, yes. But since the task has apparently failed, you have a little time to download the original file. (Or ask someone with a fast connection to do that.) [09:15] Ooh, relief. I found the "interrupt" button. [09:15] alard: I killed it. :) [09:15] Hopefully this shouldn't break anything. [09:16] Is there an interrupt button? [09:17] Perhaps you should rename the file on archive.org, just to be sure. (Perhaps the task is automatically restarted.) [09:17] It's waiting for admin. [09:17] There's an interrupt button on the history page, for admins. [09:17] As far as I know one should never use it. :p [09:19] And download is awfully slow even from a USA server, doesn't go above 2.4 MiB/s and averages at 1, meh. [09:20] hmm [09:20] SKIPPING UPDATE to ftp-ftp.rta.nato.int_archive.torrent IN /35/items/ftp-ftp.rta.nato.int... item (50155 MB) exceeds maximum size (25600 MB) [09:21] The rename worked, but waited for the other task anyway, I had to also pass that one. [09:43] does anyone have a copy of the jstor charter/constitution? [14:23] The wiki says: Warning: file_get_contents(/home/archivet/public_html/extensions/SpamBlacklist/wikimedia_blacklist) [function.file-get-contents]: failed to open stream: No such file or directory in /home/archivet/public_html/extensions/SpamBlacklist/SpamBlacklist_body.php on line 123 [14:30] Maybe it was configured for a local file BL? [14:30] It should only be configured for fetch Meta-Wiki 's blacklist and [[MediaWiki:Spamblacklist]] [14:31] Perhaps I should add that everything still works, it's just a warning. It's not a very urgent problem. [14:32] alard: I took this screenshot and forgot to share it: http://imgur.com/Fnst3 [14:32] IPs are not behaving http://archiveteam.org/index.php?title=GeoCities&curid=78&diff=9132&oldid=9131 [14:35] And rollbacking is a pain, with captchas :) [14:36] What's the word that rhymes with hiccups? [14:36] backups [14:36] Ah, I always have to skip that too [14:36] Ah. [14:36] oh right :( [14:36] I'll replace that [14:36] I'm going to make a captcha-bookmarklet. [14:37] SketchCow: you should perhaps disable anonymous editing and at the same time try removing the captcha for new links. [14:38] Nemo_bis: That's a nice screenshot, but what browser is it? It doesn't look familiar. [14:38] Give me the LocalSettings.php [14:38] entries [14:39] I thought it was no anonymous editing [14:52] alard: firefox [14:53] alard: IIRC; but there are only the scrollbars, what are you looking at? ^^' [14:54] SketchCow: $wgGroupPermissions['*']['edit'] = false; [14:55] SketchCow: and after that, for captcha, if you wish $wgCaptchaTriggersOnNamespace[NS_MAIN]['addurl'] = false; [15:06] good news [15:06] i got the exterinal images form thebox.bz [15:08] https://gist.github.com/44e4e20da5777688dbe3 [15:30] SketchCow: this item is a typo: http://archive.org/details/cnetbuzz_120106_ [15:31] i kill that upload and reupload as cnetbuzz_120106 [18:17] i'm uploading all my thebox.bz forums warc.gz into one item [18:17] most are very small so i decide to to do it this way [21:37] aww, still anonymous edits http://archiveteam.org/index.php?title=Special:ListGroupRights