[00:10] HCross: awesome! [01:10] *** Honno has quit IRC (Read error: Operation timed out) [01:11] *** asfd has joined #archiveteam-bs [01:11] *** Honno has joined #archiveteam-bs [01:12] *** PotcFdk has left Leaving [01:13] *** PotcFdk has joined #archiveteam-bs [01:14] *** BlueMaxim has quit IRC (Read error: Operation timed out) [01:15] *** BlueMaxim has joined #archiveteam-bs [01:20] *** Honno has quit IRC (Read error: Operation timed out) [01:23] *** Honno has joined #archiveteam-bs [01:31] *** tomwsmf-a has joined #archiveteam-bs [02:05] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [02:27] *** asfd has quit IRC (Quit: Leaving) [02:34] *** BlueMaxim has quit IRC (Quit: Leaving) [02:56] i'm at 666k items [02:59] beastin' [03:08] *** bwn has quit IRC (Read error: Operation timed out) [03:21] *** tomwsmf-a has joined #archiveteam-bs [03:55] so all of 1990 tagesschau 2000 is uploaded [03:57] all of the newer ones so far: https://archive.org/details/godaneinbox?sort=-publicdate&and[]=subject%3A%22tagesschau%22 [04:04] *** BlueMaxim has joined #archiveteam-bs [04:18] *** wyatt8750 has quit IRC (Read error: Operation timed out) [04:21] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:28] *** Sk1d has joined #archiveteam-bs [04:50] *** wp494 has quit IRC (Read error: Connection reset by peer) [04:57] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [05:07] *** hook54321 has joined #archiveteam-bs [05:07] anyone know if their is a limit to how much someone can store on archive.org? [05:10] *** wp494 has joined #archiveteam-bs [05:10] *** wp494 has quit IRC (Excess Flood) [05:11] *** wp494 has joined #archiveteam-bs [05:27] *** JesseW has joined #archiveteam-bs [05:36] *** wyatt8740 has joined #archiveteam-bs [05:43] *** vitzli has joined #archiveteam-bs [05:46] hook54321: As long as what you are storing is seen (by IA staff) as contributing to the mission, there's no practical limit. If what you store creates legal or other trouble for IA, or is noticed for some other reason, and having been noticed, is seen as not contributing to the mission, then it may (likely will) be made inaccessible and the account you used to upload it may be prevented from uploading anything further. The best way to think about IA i [05:46] JesseW: you got cut off at "about IA i" [05:47] Frogging: thanks, will repost in smaller chunks. [05:47] hook54321: As long as what you are storing is seen (by IA staff) as contributing to the mission, there's no practical limit. [05:47] If what you store creates legal or other trouble for IA, or is noticed for some other reason, and having been noticed, is seen as not contributing to the mission, then it may (likely will) be made inaccessible and the account you used to upload it may be prevented from uploading anything further. [05:47] The best way to think about IA is not as a place for *you* to store stuff, but a place where stuff of general social value can be stored, and you can volunteer to be a custodian for some of it (by uploading it through your account). [05:47] If you want to volunteer to be a custodian for a whole LOT of stuff, as long as it's of value, that's welcomed. [05:47] (I am not a staffer at IA, or otherwise have much of any insider knowledge; this is just my observations from the outside.) [06:03] *** hook54321 has quit IRC (Ping timeout: 268 seconds) [06:46] *** bwn has joined #archiveteam-bs [07:02] *** metalcamp has joined #archiveteam-bs [07:14] *** JesseW has quit IRC (Quit: Leaving.) [07:18] looks like gawker.com has everything up to 2015 now [07:26] i'm uploading tagesschau 20 clock evening news for 1991-01 [07:26] and eric archive pdfs [07:43] looks like 1947-12 and 1994-05 sky and telescope was not uploaded [07:44] they are uploaded now i think [07:51] *** bwn has quit IRC (Read error: Operation timed out) [08:43] i'm official at 667k items now [08:59] *** bwn has joined #archiveteam-bs [09:06] *** bwn_ has joined #archiveteam-bs [09:11] *** bwn__ has joined #archiveteam-bs [09:16] morning all :) busy moving house, found my box of floppies and tapes and such, will see if there's anything interesting on them soon! [09:19] *** bwn has quit IRC (Read error: Operation timed out) [09:25] *** bwn_ has quit IRC (Read error: Operation timed out) [09:36] *** zino has joined #archiveteam-bs [09:38] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [09:39] *** dashcloud has joined #archiveteam-bs [09:58] *** chazchaz has quit IRC (Read error: Operation timed out) [09:59] *** espes__ has quit IRC (Read error: Operation timed out) [10:01] *** dxrt- has quit IRC (Read error: Operation timed out) [10:01] *** chazchaz has joined #archiveteam-bs [10:04] *** dan- has quit IRC (Ping timeout: 260 seconds) [10:05] *** BlueMaxim has quit IRC (Read error: Operation timed out) [10:05] *** zino has quit IRC (Ping timeout: 1208 seconds) [10:06] *** BlueMaxim has joined #archiveteam-bs [10:08] *** BlueMaxim has quit IRC (Client Quit) [10:14] *** dan- has joined #archiveteam-bs [10:18] *** espes__ has joined #archiveteam-bs [11:00] *** alfie has quit IRC (Quit: Seeeya! - ZNC 1.6.3+deb1+jessie0) [11:00] *** alfie has joined #archiveteam-bs [11:11] *** dxrt-50 has joined #archiveteam-bs [11:11] *** dan- has quit IRC (Ping timeout: 260 seconds) [11:12] *** dxrt-50 is now known as dxrt- [11:39] *** dan- has joined #archiveteam-bs [12:59] i'm grabbing mugenarchive.com download links [12:59] just cause its there [13:23] now that i'm a member of mugenarchive.com i'm getting all of the download files [13:24] good news is one actionhash works for all download ids [13:46] *** Fusl has quit IRC (Quit: Contact: http://hallowe.lt/) [14:23] *** alfie has quit IRC (Quit: Seeeya! - ZNC 1.6.3+deb1+jessie0) [14:23] *** alfie has joined #archiveteam-bs [14:24] *** vitzli has quit IRC (Leaving) [14:28] *** alfie has quit IRC (Client Quit) [14:28] *** alfie has joined #archiveteam-bs [15:03] *** Fusl has joined #archiveteam-bs [15:06] *** wyatt8740 has quit IRC (Read error: Operation timed out) [15:35] alfie: here's some info on dumping DOS/Windows floppies: http://digitize.archiveteam.org/index.php/Floppy_Disks [15:42] 'An IRC pal gave me a "fun" tip recently, and I've been trying it: when you find a site that's awful, change to a mobile user agent and be amazed at how your desktop experience is suddenly usable again! This works depressingly well depressingly often. On a related note, it also works well to prepend "Mobile " to your elinks UA.' [15:47] except for the fact that then you get bombarded with "Download our app!" "Open this in our app!" [15:47] *** VADemon has quit IRC (Read error: Operation timed out) [15:50] lol [15:57] sigh: https://eev.ee/blog/2016/03/06/maybe-we-could-tone-down-the-javascript/#comment-2555914780 [15:57] "I hate webdev so I'm going to write shitty apps" [15:57] then maybe don't do webdev...? [16:22] irony using that site [16:22] comments: Apologies, but part of running a static blog is that the comments are served by Disqus's JavaScript slurry. [16:22] =) [16:33] yep [16:35] *** wyatt8740 has joined #archiveteam-bs [17:50] *** fpoee has quit IRC (Ping timeout: 633 seconds) [17:55] *** zino has joined #archiveteam-bs [17:59] hey is bitsnoop.com working for people or have they been fbi'ed? [18:02] SimpBrain, works from OVH GRA [18:02] hmm, i get a fbi cyber crime page [18:03] DNS might be changing [18:03] What does it resolve to for you? [18:03] https://www.fbi.gov/about-us/investigate/cyber [18:04] this is from my online.net vpn [18:04] strange, fine from M247 [18:05] my webpage proxy gives actual bitsnoop website [18:05] really strange indeed [18:06] Might I suggest we download the heck out of it? [18:08] I get it from Online too [18:09] data dumps seems to be available from the online.net vpn, but not the homepage [18:10] traceroute to www.bitsnoop.com (31.7.59.14), 30 hops max, 60 byte packets [18:11] I tried it from my home line, and was met with http://assets.virginmedia.com/site-blocked.html [18:13] i wont drop to my sky ip but it'll prob be blocked [18:14] Its not blocked from whatever archivebot pipeline is downloading it [18:15] really strange [18:15] whatever dns i have must be redirecting to fbi [18:15] Trying to get it fast, if its is being taken down [18:15] Resolves to 31.7.59.14 for me too and works fine. [18:17] *** JesseW has joined #archiveteam-bs [18:17] We wont get a lot, but getting what I can in the time that we can [18:18] try and grab any api data dumps too [18:18] http://ext.bitsnoop.com/export/b3_all.txt.gz [18:18] http://ext.bitsnoop.com/export/b3_verified.txt.gz [18:18] Not sure how to do that [18:18] http://ext.bitsnoop.com/export/b3_e003_trackers.txt.gz [18:18] http://ext.bitsnoop.com/export/b3_e003_torrents.txt.gz [18:19] Is that htem all? [18:19] there's some more urls but they rely on the site url being correct [18:20] http://www.bitsnoop.com/api/latest_tz.php?t=all [18:20] http://www.bitsnoop.com/api/latest_tz.php?t=verified [18:21] 24.0 million torrents [18:21] 29.7 PB of files [18:21] at least 15 pb will be 0 seed :p [18:22] Yeah [18:22] dead torrents make me sad [18:22] I'm the final seed on a bunch, and I can't let them go [18:23] someone out there probably has them archived on discs somewhere [18:23] little use to anyone if they're not online [18:46] *** metalcamp has quit IRC (Quit: Bye) [19:02] Frogging: anything interesting? [19:02] nah, not really :p [19:03] rare shows? i mostly collect semi rare and one off shows and movies and things of that nature. [19:04] High quality encodes of Fringe. High-quality anything is hard to come by on public P2P [19:04] *** metalcamp has joined #archiveteam-bs [19:05] got that on blu ray [19:05] amazing show [19:08] i found all of kablam, including the one off pilot henry and june show. [19:09] JesseW: how goes the repacking, just the big three to go? [19:27] yep [19:27] I've been away from my computer for the last couple of days, so thanks for the reminder to start on those [19:28] also I need to combine all the tiny (and not so tiny) csv files into one big (either csv or sqlite) database. [19:28] bsmith093: could you confirm that the other zip files made it up to FOS correctly (by say downloading one of the smaller ones and checking it)? [19:37] * JesseW started the all-the-H's-except-Harry-Potter zip job [19:45] *** bwn__ has quit IRC (Read error: Operation timed out) [19:53] heh [20:00] any Go devs here? [20:02] I do a little bit of go stuff [20:03] dan-: would you happen to have some spare time to help out an open-source project (Gogs) security-wise by changing string-concatenated queries to parameterized queries? [20:04] o.O [20:06] joepie91: ah, I use gogs! unfortunately I don't really right now, busy looking for new job [20:06] I'll take a look and see what I can do today [20:07] dan-: would be much appreciated - thread is at https://github.com/gogits/gogs/issues/2892 [20:07] maintainers indicated they're welcoming PRs to fix it [20:07] so even changing over a part of it would be a good thing [20:08] ah thanks for the link, once I start looking, if I'm able to switch some over I'll add a comment in there [20:09] is it adding "'s to the db? [20:10] * SimpBrain quickly looks at it [20:13] *** bwn__ has joined #archiveteam-bs [20:14] I'll see if I can look at it [20:15] grep command for (I think) locating all the instances of string concat [20:15] grep -rE "\+.*ToStr\(" . [20:15] cc dan- JesseW [20:15] it seems to be limited [20:16] bsmith093: looks like my effort to ignore the Harry Potter stuff didn't work -- trying again after *moving* it aside [20:16] *** tomwsmf-a has joined #archiveteam-bs [20:23] joepie91: so gogs is a github-alternative, ish? [20:25] basically yeah, self-hosted alternative, kinda like gitlab [20:28] what's the difference between it an gitlab? [20:29] JesseW: gogs is an almost literal github clone, and is easy to deploy [20:29] unlike gitlab which is Rails :) [20:29] seems like gogs is focused somewhat more on single-person/project deployments, vs gitlab for larger ones. [20:29] (and near unusable, imo) [20:29] Ah, ok [20:40] *** JesseW has quit IRC (Quit: Leaving.) [20:45] IA is uploading book images to Flickr? https://www.flickr.com/photos/internetarchivebookimages/ [20:50] archiveception [21:19] *** RichardG has quit IRC (Read error: Operation timed out) [21:20] *** RichardG has joined #archiveteam-bs [21:25] Probably the access drives preservation mindset. grab all the things and then provide access to as many as possible [21:42] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [21:48] joepie91: i know i'm late to the party, but i tried to deploy gitlab once. because i was (god forbid) running other shit on that webserver too, i couldn't use their one-click deployment whatever and the docs were SORELY LACKING when it came to "how2 existing nginx" which was a pain because no matter what i tried (sane setup wise) it just wouldn't play nice [21:48] so i gave up and just used plain ol' git and ssh [21:54] of course [21:54] gitlab was never an option for me [21:54] it's Ruby [21:55] that automatically disqualifies it from voluntary deployment [21:55] :p [21:58] joepie91: So Ruby is your PHP? First time I've met that particular aversion. :) [22:04] *** hook54321 has joined #archiveteam-bs [22:12] *** JesseW has joined #archiveteam-bs [22:12] zino: nah, PHP has been barred entry also, but for different reasons [22:13] SketchCow: 2012-12 of kpfa is being uploaded [22:13] zino: Ruby translates roughly to "deployment and dependency nightmare" [22:13] every time i attempt to install anything Ruby, desktop app or server daemon, doesn't matter... it inevitably ends up with me chasing dependencies, having to install 20 different package and version managers, crawling through obscure errors, and then finding out that one Ruby thing conflicts with another [22:13] I'm sick of it [22:13] so everything Ruby is now just automatically denied entry [22:14] until they can get their dep ecosystem in order [22:20] bsmith093: did H (minus Harry Potter), now doing N (minutes Naruto) [22:30] *** Muad-Dib has quit IRC (Quit: ZNC - http://znc.in) [22:31] joepie91: I'll be somewhat delayed in helping out with gogs, because I'm currently back on debian wheezy, and this is enough of a prompt to get me to (finally) upgrade to jessie, for which I want to make a full backup first, so... delays. [22:32] JesseW: sure :P there's some progress in the thread right now anyway [22:34] yeah, reading now [22:45] *** RichardG has quit IRC (Ping timeout: 499 seconds) [22:52] joepie91: What's your opinion on python, then? [22:54] ersi: slightly less disastrous. still a mess. I prefer avoiding it, but it's not outright banned [22:54] mostly because Python software tends to either vendor in deps or limit the amount of them [22:54] and/or deps have endless backwards compat [22:55] (the reason PHP is banned, is security, btw) [22:56] bsmith093: N (minus Naruto) done, now working on T (minus Twilight) [22:57] JesseW: way ahead of you, i dl-d them as soon as you were done uploading. do you happen to have an md5 file i could check, but they seem to open file [22:57] H_rest came out to be 6G, and N_rest at 1.8G [22:58] *fine, and ectract as well [22:58] I haven't made md5sums for them yet; I'll do that. [22:59] actually, I think I'll wait until I finish all of them, then md5sum the lot. [23:01] http://i.imgur.com/QyRxSmh.png [23:01] joepie91: what is that from? [23:01] and hey, at least it hasn't degrenerated to random keyboard smash yet... [23:08] I'm md5-ing what i have, and you can compare when you're done zipping [23:10] cool, thanks [23:14] JesseW: private repo for dumping misc stuff in [23:14] :p [23:14] I have Gogs running now [23:15] https://git.cryto.net/ [23:15] just need to import my repos now [23:17] hm, might be worth hacking up a shell alias to insert random words as commit messages. :-P [23:17] lol [23:21] bsmith093: got your md5s, thanks [23:21] np [23:38] *** BlueMaxim has joined #archiveteam-bs [23:49] *** VADemon has joined #archiveteam-bs