[00:02] i seen the DITC site before about 2 weeks ago [00:03] but didn't know how to do it until now [00:03] there are 2 number systems it looks like [00:03] ADAxxxxxx numbering [00:04] then there is a AD0xxxxxx numbering system [00:06] for example anything with www.dtic.mil/dtic/tr/fulltext/u2/a003342.pdf is a ADA number type [00:07] haha,am i the only one that sees these typos on jason's site,or does SketchCow just not correct them? [00:07] without the "a" in the file it will be AD0 type [00:08] "We labelled as best we can", on the September 2nd post [00:09] Is pokeball doing anything useful? Because I'm sick of him. [00:09] D: [00:09] Someone justify him, like a kitten or a tattoo [00:09] im with bibanon [00:10] You don't get a say [00:10] he help me figure out DITC when searching about the ebay book he link here [00:11] That's something. [00:11] i also have Vintage Airplane collection [00:11] Can't wait [00:11] i been wanting to ask you about releasing the ~10 million threads/post of 4chan [00:12] Not answering. [00:12] In fact, going idle. It's 1am here in Brussels and I have a full full day tomorow. [00:13] alright SketchCow,im going to pm a little something about releasing them [00:13] OK, that's it. [00:13] *** SketchCow sets mode: +b *!*uid118096@*.tooting.irccloud.com [00:13] *** pokeball9 was kicked by SketchCow (pokeball9) [00:14] Anyway, gamefront uploading continues apace [00:15] I'll be winking in and out here, butwill do my best to make sure FOS doesn't fill [00:16] SketchCow: Did you see the thing about making a FOS rsync target for the #nohome project? [00:24] *** aaaaaaaa_ has joined #archiveteam-bs [00:24] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [00:28] *** Jordan_ has joined #archiveteam-bs [00:30] yes. go ahead. [00:33] *** aaaaaaaa_ is now known as aaaaaaaaa [00:48] *** primus104 has quit IRC (Leaving.) [01:05] *** dashcloud has quit IRC (Read error: Operation timed out) [01:09] *** dashcloud has joined #archiveteam-bs [02:30] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [02:30] *** zenguy_pc has joined #archiveteam-bs [03:08] *** dan-| has joined #archiveteam-bs [03:10] *** dan- has quit IRC (Ping timeout: 606 seconds) [03:16] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [03:33] *** zenguy_pc has joined #archiveteam-bs [04:06] *** vitzli has joined #archiveteam-bs [04:29] *** aaaaaaaaa has quit IRC (Leaving) [06:12] *** Spring has joined #archiveteam-bs [06:12] is there a script that can scape a person's Facebook public feed for photos, without being a member? [06:13] Dude, she's not coming back [06:13] lol. An illustrator died recently and there's a feed with his work that would be nice to save [06:15] It's not easy [06:15] You think it would be [06:15] Google had the Google Liberation Front [06:15] We should write a facebook scraper [06:15] Just open-source tht shiznat [06:18] hmm, there's a userscript that appears to do this but requires an account. Wonder if the account is simply needed for some key. https://greasyfork.org/en/scripts/2180-download-fb-album-mod [06:18] screenshots, https://docs.google.com/document/d/11ICWMx6PtEd6tXdJ0SSnZkdV7Y340bDINuC8ydxL-Lc/edit?pli=1 [06:40] *** JesseW has joined #archiveteam-bs [06:40] grumble... there are still 13 urlteam dumps whose torrents are broken by a IA bug that I reported a month ago, but hasn't been resolved. [06:41] there's a workaround -- adding a review fixes it -- but it's still irritating. [06:45] DTIC ADA005002: Integer Programming by Group Theory: Some Computational Results : https://archive.org/details/DTIC_ADA005002 [06:47] godane: neat! [06:48] thanks [06:49] i'm going to bed for now [06:49] bbl [06:57] g'night! [07:27] *** JesseW has quit IRC (Leaving.) [07:27] *** JesseW has joined #archiveteam-bs [07:30] *** JesseW has quit IRC (Read error: Operation timed out) [07:46] *** primus104 has joined #archiveteam-bs [07:49] I don't suppose anyone here has a Facebook account they could try this Perl script on, http://www.sat.dundee.ac.uk/~arb/facebook/facebook-album-downloader.pl [07:52] *** JesseW has joined #archiveteam-bs [08:06] *** JesseW has quit IRC (Read error: Operation timed out) [08:12] *** schbirid has joined #archiveteam-bs [08:43] *** primus104 has quit IRC (Leaving.) [09:22] *** vitzli has quit IRC (Quit: Leaving) [09:22] *** vitzli has joined #archiveteam-bs [09:42] *** mksplg has quit IRC (WeeChat 0.4.2) [09:46] *** BlueMaxim has quit IRC (Quit: Leaving) [10:10] *** arkiver2 has joined #archiveteam-bs [10:13] Spring: Is there a reason you don't just make a throwaway account? [10:14] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [10:15] chazchaz, never looked into it as I thought it required real names now [10:16] *** brayden has joined #archiveteam-bs [10:17] *** anomie has quit IRC (Read error: Connection reset by peer) [10:24] *** anomie has joined #archiveteam-bs [10:40] Ugh, need an alternate email to verify the disposable one [10:45] http://10minutemail.com/ ? [10:47] Microsoft seems to blacklist these temp mail sites [10:48] also need one I can come back to [10:48] why? [10:49] I've been blocked from my other live.com email at times due to IP anomalies (I sometimes switch between IPs) and they need your alternate email [10:49] it's a disposable account.. [10:50] thats like registrating your burnerphone so you can trace it [10:50] I'm not going through the time and hassle of creating two independent accounts for single use if I'm doing this [10:51] may as well do it right so next time I don't have to repeat the process [10:54] looks like Microsoft disables the 'create account' button if you use a temp email account... [10:55] gah, there needs to be some online bot [10:56] *** arkiver2 has joined #archiveteam-bs [11:00] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [11:01] Spring: can you create gmail account -> create facebook one? [11:01] or something dodgy like that [11:01] still requires a temp email account for the Gmail process [11:01] *** dan-| is now known as dan- [11:02] or should I say /an/ email account [11:03] if done correctly a disposable email shouldn't need a legitimate email for registration, but they appear to be blacklisted [11:05] use russian mail providers like mail.ru, they are crap, but ok for one-time use [11:06] Gmail uses SMS verification now, too [11:06] they may ask for a phone number, but I think there is an option to skip this step [11:06] nope. I'm on the screen currently [11:07] ah, sorry, you're referring to the .ru mail [11:10] maybe yandex.ru/.net is less crap, but I'm not sure, they all have questionable business practices [11:10] Google seems to have downgraded their safe browsing diagnostics tool, too, which is a pity. [11:11] used to give a fairly detailed summary of any malware found on a given site, including dates and which sites had the malware. Now it just states 'Recently dangerous' if anything is found. [11:18] russian mail: go to yandex.com, register one and then always refer to it as mail@yandex.ru: multiple domains, one email address [11:18] *** VADemon has quit IRC (left4dead) [11:33] FINALLY [11:33] SUCCESS [11:43] Lesson 1 learned. The userscript fails. [12:02] Lesson 2. Perl hangs on the other script. fml :p [12:07] brb [12:07] *** Spring has quit IRC (Leaving) [12:11] *** Spring has joined #archiveteam-bs [12:13] I give up. Installed the correct Perl modules only to discover they're already installed in the correct location but the Perl isn't seeing them due to an extra sub directory. And with the custom -I library path option it hangs. [12:13] So I have no idea how I'm meant to archive these images. [12:14] *** diacope has quit IRC (Ping timeout: 252 seconds) [12:15] *** chfoo- has quit IRC (Ping timeout: 252 seconds) [12:15] *** Rickster has quit IRC (Ping timeout: 252 seconds) [12:16] *** szalwia has quit IRC (Ping timeout: 252 seconds) [12:17] *** Famicoman has quit IRC (Ping timeout: 252 seconds) [12:17] *** Fletcher has quit IRC (Ping timeout: 252 seconds) [12:18] *** szalwia has joined #archiveteam-bs [12:18] *** primus104 has joined #archiveteam-bs [12:19] *** Rickster has joined #archiveteam-bs [12:19] *** chfoo- has joined #archiveteam-bs [12:23] *** diacope has joined #archiveteam-bs [12:50] DTIC AD0006423: ANTIGENIC STUDIES ON INFLUENZA VIRUS : https://archive.org/details/DTIC_AD0006423 [12:52] *** Fletcher has joined #archiveteam-bs [12:52] SketchCow: Do you get to read info@archive.org? I sent an email a week ago asking for a collection for Yahoo Groups, but haven’t received a response yet. [12:53] *** Famicoman has joined #archiveteam-bs [13:50] *** arkiver2 has joined #archiveteam-bs [13:55] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [13:59] *** Start has quit IRC (Quit: Disconnected.) [14:12] *** useretail has joined #archiveteam-bs [14:25] *** anomie has quit IRC (Read error: Connection reset by peer) [14:31] *** anomie has joined #archiveteam-bs [14:42] *** pwnsrv has joined #archiveteam-bs [14:50] *** joepie91 has joined #archiveteam-bs [14:50] *** midas sets mode: +o joepie91 [14:50] hm [14:50] I accidentally parted, it seems. [14:50] anyhow. [14:50] https://www.reddit.com/r/PHP/comments/3qi52e/how_to_securely_allow_users_to_upload_files/cwfqj5l?context=1 [14:50] 20 YEARS OF ENTERPRISE EXPERIENCE [14:51] use FTP? [14:51] vitzli: clicky [14:51] vitzli: you'll see why [14:51] the bi-monthly "just outsource to a third party so you don't have to understand it!" fool [14:51] *** dashcloud has quit IRC (Read error: Operation timed out) [14:53] also, "S3 is the starting point for modern webdev" - no, no it is not [14:53] Spring: If you need a disposable address, you can always register a free subdomain at https://freedns.afraid.org/ and set your mx record to mailinator.com [14:53] then you can make up addresses on a domain that isn't blocked whenever you need one [14:54] zero security though [14:54] thanks for the tip, chazchaz [14:54] *** dashcloud has joined #archiveteam-bs [14:56] afraid does make you log in every 6 months or something, so keep that in mind, especially if you don't give them a real address [14:57] starting to upload DTIC stuff again [14:57] i had a 115 items waiting to be deriving [14:59] *** arkiver2 has joined #archiveteam-bs [15:02] it's ok article, but I would do mime.magic on file instead of trusting mimetype blindly [15:03] *** Start has joined #archiveteam-bs [15:04] *** brayden has quit IRC (Read error: Operation timed out) [15:04] Comments are interesting [15:06] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [15:15] joepie91: s3 is the solution to all his problems. ALL of them. [15:16] 20 years experience, S3. [15:16] developer? S3. [15:23] *** arkiver2 has joined #archiveteam-bs [15:33] *** primus104 has quit IRC (Leaving.) [15:37] *** JesseW has joined #archiveteam-bs [15:40] *** arkiver2 has quit IRC (Quit: Nettalk6 - www.ntalk.de) [15:41] so i found something interesing [15:42] i was looking at Doc Thompson show on CBS Detroit from 2012 [15:42] now i may not be able to get full episodes of that [15:42] i did find a patten [15:42] example: http://ak.podcast.play.it:80/media/d0/d0/d1/d0/dA/dM/dA/10AMA_3.MP3 [15:43] 10AMA is repeated in the folder structure [15:43] exactly [15:43] but also this [15:43] example: http://ak.podcast.play.it:80/media/d0/d0/d1/d0/dA/dM/dC/10AMC_3.MP3 [15:44] we can mirror this [15:44] 3PM.3_CMA01\Cd\Md\Ad\0d\1d\0d\0d\media\08:ti.yalp.tsacdop.ka\\:ptth [15:45] okay but what does that accomplish? [15:45] * myself ducks [15:45] *** JesseW has quit IRC (Read error: Operation timed out) [15:55] 13.5 million plaintext passwords leaked from 000webhost (since March!): www.troyhunt.com/2015/10/breaches-traders-plain-text-passwords.html [15:57] Is that why the 000webhost site is being archived by ArchiveBot right now? [16:02] *** Start has quit IRC (Quit: Disconnected.) [16:06] *** JesseW has joined #archiveteam-bs [16:08] *** Specular has joined #archiveteam-bs [16:08] zhongfu, one mistake after another... [16:08] That article was like being the only sober person at karaoke. [16:09] It's just a succession of awful things, each worse than the last, and you can't imagine it'll keep going but you know it must because you haven't reached the end yet. [16:09] *** Spring has quit IRC (Read error: Operation timed out) [16:21] *** JesseW has quit IRC (Read error: Operation timed out) [16:35] zhongfu: MrRadar: Specular: myself: http://www.lowendtalk.com/discussion/55653/aurimas-rapalis-the-owner-of-hostinger-was-murdered-on-30th-of-last-month [16:35] relevant [16:36] oh snap [16:37] can you say bizarre? [16:37] "Have you heard of 000WebHost or youhosting(It gives Free reseller webhosting ) or vps.me or Hosting24 ? All of them are owned by Hostinger" [16:37] jeepers [16:37] how on earth did you find this? [16:37] *** VADemon has joined #archiveteam-bs [16:37] Specular: I hang around LET [16:38] Specular: there's often little bits of interesting info floating around there [16:38] that otherwise haven't escaped local media [16:42] *** Start has joined #archiveteam-bs [16:56] a example set of nyc.podcast.play.it grab: https://archive.org/details/nyc.podcast.play.it-mp3-UQMx-ids [17:00] *** brayden has joined #archiveteam-bs [17:20] *** brayden_ has joined #archiveteam-bs [17:25] *** brayden has quit IRC (Read error: Operation timed out) [17:25] *** brayden has joined #archiveteam-bs [17:26] *** primus104 has joined #archiveteam-bs [17:29] *** brayden_ has quit IRC (Read error: Operation timed out) [17:38] *** Start has quit IRC (Quit: Disconnected.) [17:44] SketchCow: here is one author in my DTIC archive collection: https://archive.org/search.php?query=subject%3A%22Hill%2C+Christopher+T%22 [17:44] 6 papers are authored by him [17:44] that we know of so far [17:56] *** vitzli has quit IRC (Quit: Leaving) [18:12] *** brayden_ has joined #archiveteam-bs [18:12] *** Gfy has quit IRC (ircd.choopa.net irc2.choopa.net) [18:12] *** primus104 has quit IRC (Leaving.) [18:17] *** brayden has quit IRC (Read error: Operation timed out) [18:33] useful page, https://weakdh.org/sysadmin.html [18:36] See also Mozilla's guides for configuring SSL/TLS and OpenSSH: https://wiki.mozilla.org/Security/Server_Side_TLS https://wiki.mozilla.org/Security/Guidelines/OpenSSH [18:36] *** aaaaaaaaa has joined #archiveteam-bs [18:48] *** insane_al has joined #archiveteam-bs [18:49] *** diacope has quit IRC (Ping timeout: 252 seconds) [18:49] *** Fletcher has quit IRC (Ping timeout: 252 seconds) [18:51] *** brayden has joined #archiveteam-bs [18:54] *** brayden_ has quit IRC (Read error: Operation timed out) [18:57] *** brayden_ has joined #archiveteam-bs [18:59] *** brayden has quit IRC (Read error: Operation timed out) [19:01] *** brayden has joined #archiveteam-bs [19:04] *** primus104 has joined #archiveteam-bs [19:05] *** brayden_ has quit IRC (Read error: Operation timed out) [19:12] *** diacope has joined #archiveteam-bs [19:27] *** Fletcher has joined #archiveteam-bs [19:33] *** brayden_ has joined #archiveteam-bs [19:36] my god [19:36] how does jamendo manage to make every version of the site worse and worse [19:36] now I can't select text because it goes into some dumb mobile drag mode [19:36] ffs [19:37] it's full of (scroll)bars? [19:39] *** brayden has quit IRC (Read error: Operation timed out) [19:39] * joepie91 rants into feedback page [19:39] myself: https://www.jamendo.com/album/151736/aboutime-adagio-for-7-cellos-and-1-piano [19:39] try selecting the album title [19:40] joepie91, works here [19:40] Firefox [19:41] for me it just goes into drag mode [19:41] in Chrome [19:45] hey midas ^ ;) [19:46] *** RichardG has quit IRC (Ping timeout: 252 seconds) [19:47] this shit is so broken it's not even funny [19:48] *** Start has joined #archiveteam-bs [19:48] welcome to jamendo [19:49] that reminds me, did anyone go for the flac archive of jamendo? [19:50] I have not gotten around to it yet, but there are multiple avenues [19:50] well [19:50] were [19:50] seems they broke one of them [19:50] because the artist info API seems to be completely down as well [19:50] returning empty 200 responses [19:50] urgh [19:50] so, GOOD JOB JAMENDO [19:50] fucking hell. [19:50] just throw version -2 back up and call it a day, because that one actually, y'know, worked [19:51] oh and OGG seems perma-gone now [19:51] i want the blue site back [19:51] schbirid: that was probably version -2? [19:51] schbirid: the one before they did the fancy on-page player stuff [19:51] dunno, iirc it still had ed2k links [19:51] and where you could pick a filetype when downloading [19:51] oh, that's an earlier one I think [19:52] Wow, ed2k. I haven't heard/thought about that in *years* [19:53] *** Stiletto has quit IRC (Ping timeout: 255 seconds) [19:53] *** Stiletto has joined #archiveteam-bs [19:53] *** RichardG has joined #archiveteam-bs [19:56] Yup, goes into scrolly shit for me too. That's lovely. [19:56] Just like some tumblrs and stuff that try to prevent you from right-clicking. Press F12, find what you want in the page source. Wipe hands on pants. [20:00] *** brayden has joined #archiveteam-bs [20:00] *** Gfy has joined #archiveteam-bs [20:01] joepie91: flac and ogg download works fine for me. it's for tracks only, not albums [20:02] https://developer.jamendo.com/v3.0/tracks/file [20:02] schbirid: different API [20:02] schbirid: also, does it have documented FLAC download now? [20:03] ah [20:03] schbirid: right, this one requires auth. [20:03] I DONT WANT TO CARE ABOUT JAMENDO ANYMORE [20:03] schbirid: PM [20:06] *** brayden_ has quit IRC (Read error: Operation timed out) [20:07] the old Opera had a per-site setting that could disable scripts hijacking right-clicks [20:08] one of the many things I miss [20:08] I DONT WANT TO CARE ABOUT GOOD OLD OPERA ANYMORE [20:09] *** Fletcher has quit IRC (Ping timeout: 252 seconds) [20:09] *** diacope has quit IRC (Ping timeout: 252 seconds) [20:11] *** Stiletto has quit IRC (Read error: Operation timed out) [20:11] dom.event.contextmenu.enabled in forefox [20:12] not per site though [20:13] the phantom pain [20:18] I have good news and I have bad news [20:18] *** brayden_ has joined #archiveteam-bs [20:18] the good news is that the old API isn't broken [20:18] the bad news is that it simply doesn't contain newer entries and seems to run off an entirely separate database [20:18] :||| [20:19] I guess that is one way to handle migration [20:21] increasingly tempted to punch a Jamendo dev [20:23] *** brayden has quit IRC (Read error: Operation timed out) [20:26] *** diacope has joined #archiveteam-bs [20:37] *** chfoo has quit IRC (Quit: chfoo) [20:52] oh, excellent [20:52] v3 api is giving me empty responses [20:52] for tracks that exist [20:52] :|||||| [20:59] *** brayden has joined #archiveteam-bs [21:05] erm, youtube "smack my bit" gives me "smack my bits up the prodigy" in the auto suggestions. "smack my bitc" gives nothing. [21:05] *** brayden_ has quit IRC (Read error: Operation timed out) [21:05] lol [21:05] in today's episode of dumb idea theatre [21:05] https://github.com/felixge/node-mysql/issues/1120#issuecomment-151867072 [21:05] let's reintroduce SQLi! [21:06] * joepie91 accumulates reasons to want to punch somebody tonight [21:06] is it javascript? [21:06] schbirid: yes, but not specific to JS. [21:07] i hope you realise it has more than 5000 stars on github [21:07] *** Fletcher has joined #archiveteam-bs [21:07] he is a rockstar developer node-js lean bro [21:07] schbirid: not node-mysql [21:07] the suggestion I linked to [21:07] SQL template strings [21:07] yesyes [21:07] fucking awful idea [21:07] you just dont get it, he is smarter [21:07] * joepie91 glares at schbirid [21:08] wait, felixge != felixfbecker [21:08] sorry [21:08] move along [21:08] * schbirid goes zzz [21:08] *** schbirid has quit IRC (Quit: Leaving) [21:08] yes, different person [21:08] lol [21:12] *** brayden_ has joined #archiveteam-bs [21:16] *** Start has quit IRC (Quit: Disconnected.) [21:16] *** brayden has quit IRC (Read error: Operation timed out) [21:16] ......... [21:17] "Select only tracks of a certain type. By default we return only albumtracks to avoid the high risk of bugging applications (especially those built before 2015, that is before the existence of singles). Using 'type=single albumtrack' you will select both types" [21:17] *** SadDM has quit IRC (Read error: Operation timed out) [21:17] *** lexicon has quit IRC (Read error: Operation timed out) [21:21] *** Kksmkrn has joined #archiveteam-bs [21:31] *** logchfoo starts logging #archiveteam-bs at Wed Oct 28 21:31:32 2015 [21:31] *** logchfoo has joined #archiveteam-bs [21:33] *** brayden has joined #archiveteam-bs [21:35] *** brayden_ has quit IRC (Read error: Operation timed out) [21:38] *** brayden_ has joined #archiveteam-bs [21:44] *** brayden has quit IRC (Read error: Operation timed out) [21:45] *** dashcloud has joined #archiveteam-bs [21:49] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [21:49] *** aaaaaaaa_ has joined #archiveteam-bs [21:54] *** phiren has joined #archiveteam-bs [21:56] *** aaaaaaaa_ is now known as aaaaaaaaa [22:05] *** insane_al has quit IRC (Leaving) [22:27] *** Stiletto has joined #archiveteam-bs [22:41] *** Stilett0 has joined #archiveteam-bs [22:47] *** Stiletto has quit IRC (Read error: Operation timed out) [22:55] *** SadDM has joined #archiveteam-bs [22:59] *** lexicon has joined #archiveteam-bs [23:12] *** Ghost_of_ has joined #archiveteam-bs [23:12] out of interest, I tried feeding a porn tube link to the WM [23:13] would you know what kind of policy IA has here? Do they consider porn culturally important? [23:13] Order some more petaboxes! [23:14] IA is fine with anything as long as it's not illegal to store [23:15] (I think) [23:15] I know that, arkiver, but I suppose that some things is given more importance than others? [23:16] for instance, I'd rate cnn.com over bronyrotica.com (if that exists) [23:17] I think if IA is short on storage/money then cn..com would be chosen over that other site [23:17] But I think while IA is not short on storage all bits are equal [23:19] makes sense ... and no-one knows what is interesting to people in 500 or 1000 years. After all, we freak out when we find garbage piles from certain eras. [23:20] whoops, time to go to bed ... see ya [23:21] have a good night [23:22] thanks [23:22] *** Ghost_of_ has quit IRC (Quit: Leaving) [23:44] *** aaaaaaaa_ has joined #archiveteam-bs [23:44] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [23:48] *** aaaaaaaa_ is now known as aaaaaaaaa