[00:03] *** sims has quit IRC (Ping timeout: 268 seconds) [00:05] joepie91, this user was banned from github when they had their Open Code of Conduct bullshit going on (the one that specifically stated they would not take action against reverse-racism etc). https://github.com/nodejs/TSC/issues/8 Banned from repo, then banned from github for posting an eggplant emoji. [00:07] http://todogroup.org/opencodeofconduct/ [00:10] Anyway, not going to go down that rabbit hole in this channel. There were a lot of stupid bans during that time period. [00:23] oh, that one [00:23] I never knew that resulted in a github-wide ban [00:26] "Nest warned customers that its internet-connected security cameras and smartphone apps were not functioning properly – as in, weren't recording video footage for several hours – as a result of the AWS blunder." [00:28] Let's hook up our entire house to the cloud! [00:36] *** odemg has quit IRC (Remote host closed the connection) [00:41] *** bsmith093 has quit IRC (Quit: Leaving.) [00:50] As if the NSA hadn't enough already [00:55] *** odemg has joined #archiveteam-bs [00:57] *** BlueMaxim has quit IRC (Quit: Leaving) [01:18] *** icedice has quit IRC (Ping timeout: 250 seconds) [01:24] is there a way to search part of the title in IA [01:24] like something like this: title:EJ69 [01:25] i trying to search for every title that has EJ69 in it [01:35] *** odemg has quit IRC (Remote host closed the connection) [01:36] nevermind i just grab the item pages to make sure they all have pdfs [01:41] due to dropbox nuking "public folders" on march 15, should we try to archive all links of the form https://dl.dropboxusercontent.com/u// that we can find on the net? [01:41] that stuff will all disappear after march 15 for free dropbox users and will disappear on september 1? for paid users [01:42] the "share" type links which look like https://dl.dropboxusercontent.com/s// will still work [01:42] number, in the /u/ links, is assocated with a specific user, while the is completely random and seems to be generated as needed for each shared file [01:53] *** Ravenloft has joined #archiveteam-bs [02:05] *** alfie has quit IRC (Ping timeout: 260 seconds) [02:06] *** BlueMaxim has joined #archiveteam-bs [02:27] *** alfie has joined #archiveteam-bs [02:29] *** ndiddy has joined #archiveteam-bs [02:34] *** alfie has quit IRC (Ping timeout: 244 seconds) [02:45] *** alfie has joined #archiveteam-bs [02:50] *** VADemon has quit IRC (Quit: left4dead) [02:52] *** kyounko has joined #archiveteam-bs [02:56] is there a Dropbox folder download script? [02:56] *** alfie has quit IRC (Ping timeout: 244 seconds) [02:57] *** schbirid has quit IRC (Ping timeout: 255 seconds) [02:58] *** Ravenloft has quit IRC (Ping timeout: 260 seconds) [03:09] *** schbirid has joined #archiveteam-bs [03:09] *** alfie has joined #archiveteam-bs [03:27] *** alfie has quit IRC (Ping timeout: 244 seconds) [03:36] *** alfie has joined #archiveteam-bs [04:08] *** Ravenloft has joined #archiveteam-bs [04:11] *** alfie has quit IRC (Ping timeout: 260 seconds) [04:13] *** alfie has joined #archiveteam-bs [04:22] *** alfie has quit IRC (Ping timeout: 244 seconds) [04:25] *** alfie has joined #archiveteam-bs [04:34] *** alfie has quit IRC (Ping timeout: 260 seconds) [04:35] *** alfie has joined #archiveteam-bs [04:39] SketchCow: http://www.ebay.com/itm/Smart-Computing-PC-Novice-Back-issues-1995-2011-Complete-Lot-180-issues-/272545898197 [04:40] you may need to grab that [04:41] that guy may have every issue up to 2011 [04:45] *** alfie has quit IRC (Ping timeout: 244 seconds) [04:48] *** alfie has joined #archiveteam-bs [05:04] *** ndiddy has quit IRC (Read error: Connection reset by peer) [05:07] *** Sk1d has joined #archiveteam-bs [05:11] *** ravetcofx has joined #archiveteam-bs [05:21] *** alfie has quit IRC (Ping timeout: 260 seconds) [05:26] *** alfie has joined #archiveteam-bs [05:32] *** ravetcofx has quit IRC (Read error: Operation timed out) [05:35] *** alfie has quit IRC (Ping timeout: 244 seconds) [05:35] *** ravetcofx has joined #archiveteam-bs [05:42] *** alfie has joined #archiveteam-bs [06:22] *** alfie has quit IRC (Ping timeout: 260 seconds) [06:24] *** alfie has joined #archiveteam-bs [06:34] *** alfie has quit IRC (Ping timeout: 244 seconds) [06:47] *** alfie has joined #archiveteam-bs [07:04] *** ZexaronS- has quit IRC (Read error: Connection reset by peer) [07:05] *** nrp3c has quit IRC (Read error: Operation timed out) [07:07] *** ZexaronS has joined #archiveteam-bs [07:11] *** alfie has quit IRC (Ping timeout: 260 seconds) [07:13] *** vitzli has joined #archiveteam-bs [07:14] *** alfie has joined #archiveteam-bs [07:19] *** nrp3c has joined #archiveteam-bs [07:36] *** alfie has quit IRC (Ping timeout: 244 seconds) [07:54] anyone planning to archive the berkeley videos they have to take down due to the ADA complaint? [07:56] I have someone who seems to be doing it pretty intently. [08:21] *** alfie has joined #archiveteam-bs [08:32] *** alfie has quit IRC (Ping timeout: 244 seconds) [08:38] *** alfie has joined #archiveteam-bs [08:49] *** alfie has quit IRC (Ping timeout: 260 seconds) [08:49] *** alfie has joined #archiveteam-bs [08:54] *** alfie has quit IRC (Ping timeout: 244 seconds) [09:01] *** GE has joined #archiveteam-bs [09:01] *** alfie has joined #archiveteam-bs [09:24] *** vitzli has quit IRC (Leaving) [09:26] *** ravetcofx has quit IRC (Read error: Operation timed out) [09:28] *** Asparagir has quit IRC (Read error: Operation timed out) [09:35] *** Asparagir has joined #archiveteam-bs [10:08] *** HCross2 has quit IRC (Quit: Connection closed for inactivity) [11:13] *** pizzaiolo has joined #archiveteam-bs [11:22] *** HCross2 has joined #archiveteam-bs [11:23] *** GE has quit IRC (Remote host closed the connection) [11:28] *** BlueMaxim has quit IRC (Quit: Leaving) [11:35] does anyone have good solutions for backup up Mediawiki based wikis via their api.php? everything I've tried sucks. [11:49] wikiteam's dumpgenerator?!!!!!! :O [12:07] Best project to throw my warriors at? [12:09] *** Silvan has joined #archiveteam-bs [12:12] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [12:12] *** SilSte has quit IRC (Read error: Operation timed out) [12:13] *** dashcloud has joined #archiveteam-bs [12:47] *** GE has joined #archiveteam-bs [12:48] *** t2t2 has quit IRC (Read error: Operation timed out) [12:54] *** VADemon has joined #archiveteam-bs [13:12] *** JSharp___ has quit IRC (Read error: Connection reset by peer) [13:12] *** alembic has quit IRC (Read error: Connection reset by peer) [13:13] *** JSharp___ has joined #archiveteam-bs [13:13] *** alembic has joined #archiveteam-bs [13:23] *** passerby has quit IRC () [13:44] *** passerby has joined #archiveteam-bs [14:28] *** mls has quit IRC (Read error: Connection reset by peer) [14:29] *** mls has joined #archiveteam-bs [14:52] Best project to throw my warriors at? <-- Probably Yahoo Answers right now. [14:52] Concurrency 4 or less. [14:53] https://arstechnica.co.uk/security/2017/03/marissa-mayer-forgoes-bonus-after-yahoo-botches-hack-investigation/ [14:53] More Yahoo circling the drain. [14:53] we'll have app.net running soon [14:53] that will require some resources probably [14:54] I have not heard of that. [14:54] #crapp.net [14:55] Excellent name. [15:36] *** DopefishJ has joined #archiveteam-bs [15:36] *** swebb sets mode: +o DopefishJ [15:37] *** DFJustin has quit IRC (Ping timeout: 260 seconds) [15:39] *** Stilett0 has quit IRC (Read error: Connection reset by peer) [15:46] *** Stilett0 has joined #archiveteam-bs [16:18] *** mls has quit IRC (Quit: leaving) [16:26] I have a really nice grab of a site before it was taken down: https://web.archive.org/web/20170105011245/http://www.fictiongrill.com/ but the IA chrome extension keeps sending me to a earlier, really shitty grab: https://web.archive.org/web/20160701035124/http://www.fictiongrill.com/ :( [16:26] *** DopefishJ is now known as DFJustin [16:28] :( [16:28] that is a very good grab, though. nice work :) [16:30] schbirid: I should look at that. using some shit wrapped around git-remote-mediawiki atm [16:48] hm, i should survey all our warc items and sort them by hits per megabyte-day [16:48] see what's popular [16:51] I was really confused why my items were getting views until I realized it counted the people who viewed it on the wayback machine. [16:52] yeah! [16:58] *** Asparagir has quit IRC (Read error: Connection reset by peer) [16:59] *** Asparagir has joined #archiveteam-bs [17:07] *** BlueMaxim has joined #archiveteam-bs [17:32] *** icedice has joined #archiveteam-bs [17:36] *** ravetcofx has joined #archiveteam-bs [17:57] *** kyounko has quit IRC (Read error: Connection reset by peer) [18:03] *** kyounko has joined #archiveteam-bs [18:14] Jon: worked very well for me in the past [19:37] *** GE has quit IRC (Remote host closed the connection) [19:43] *** Ravenloft has quit IRC (Ping timeout: 633 seconds) [19:53] *** j08nY has joined #archiveteam-bs [20:04] Is the archiving of the Berkely Course Captures likely to be an issue? Also, is there someone who can decide on a irc channel name, and someone who could make a small software that uses the youtube-dl api and automatically outputs the correct metadata.xml files, in correctly named folders? I would if I did not belive to do more good by working for an initial Valhalla Proof-Of-Concept funding (which [20:04] is slightly dual use in not only archiving for Valhalla). [20:05] ThisAsYou: for easiest operation give all your uploaded videos the same unique tag [20:06] ThisAsYou: then contact IA or SketchCow directly to ask for a collection with the stuff containing that tag [20:06] ThisAsYou: this is generic advice however [20:06] in the specific case of UC Berkeley it seems like there might be a coordinated effort instead [20:06] By standard I guess just using the normal youtube-id in the name too would be good, I guess [20:07] Also, if we actually do make tar files, we should maybe include the format id from youtube-dl in it. [20:08] The Berkeley videos are: [20:08] - Well publicized [20:08] - Easy to get [20:09] - Likely to be overrun with "an Hero"s who are going to do it all sorts of ways [20:09] I'd rather not wake up to 5 different Speed Racer k-razy kars of klownkiving all pouring gigs of videos into the archive with heavily variant metadata transfer [20:11] Me neither. SketchCow: you got a channel name? [20:11] Bruce [20:11] I propose #berkeney [20:12] That doesn't even make sense [20:12] #berklost [20:12] I also have another minor announcement [20:13] I might be a bit wrong on how to combine humor. Anyway, i mistyped. It should have been spelled berkenay. Which is still bad. [20:13] Post heart attack, I've been rather focused on cleaning up to-dos and making sure none of my machines have much data in the way of "should go on the archive". [20:13] One of the machines (out of 4) just hit that state today. [20:13] It still does things, and I should document them, but it no longer has any "this should go on the archive" [20:14] Minor announcement. But it makes me happy. Next is FOS and I expect that to be more annoying [20:15] Hope you're doing ok, heart attacks sound scary. [20:16] They are, and illuminating [20:18] Anyway, I can help with UCBerkley. I've never helped with IA stuff other than the warrior. I'm throwing the channel on GDrive as we speak and gathering all the metadata returned by youtube-dl, as well as thumbnails. I'll hold off on putting anything on IA. [20:19] Anyone able to outlay the specifics for naming the IA itmes? Please come and join #berklost ThisAsYou [20:22] SketchCow: yay for getting things off the obligations list! [20:35] *** tobbez has joined #archiveteam-bs [20:53] *** bsmith093 has joined #archiveteam-bs [20:54] *** Marcelo has joined #archiveteam-bs [21:01] *** GE has joined #archiveteam-bs [21:10] *** Stilett0 has quit IRC (Ping timeout: 246 seconds) [21:12] *** namespace has quit IRC (Read error: Operation timed out) [21:16] *** Marcelo has left [21:20] Can an admin give green light for using tubeup.py to upload the berkeley course captures #berklost into the IA? [21:22] ThisAsYou: would need permission to upload, I could schedule the rest of the load tomorrow, so then I would also modify the tubeup script to upload into the correct collection. I hope to get green light untill then, if not, it would be helpfull to know enough as necessary to determine a suitable alternative, as I do not know why we should not use official script in the recommended configuration. [21:23] i'm uploading more koreanet-2 gwangju power shows [21:23] i'm also uploading more of the mike slater show [21:29] and so it begins: https://twitter.com/joepie91/status/837413786843820034 [21:30] (re; the neveragain.tech pledge) [21:33] nice [21:44] SketchCow: looks like i can upload/fix the items missing the pdfs [21:44] in the eric archive [21:44] :-D [21:45] makes may life a lot earlier since i have 58 pdfs in the ERIC_EJ72xxxx area that i need to upload [21:46] and i think some more pdfs in the ERIC_EJ74xxxx and ERIC_EJ75xxxx [21:49] SketchCow: i may not be able to do UC Berkeley videos [21:49] i'm drowning in my own data that i need to upload [21:50] if i can get another hard drive this weekend then maybe i can help [21:51] *** tpw_rules has quit IRC (Read error: Operation timed out) [21:53] *** odemg has joined #archiveteam-bs [21:53] *** odemg has quit IRC (Connection closed) [21:54] *** odemg has joined #archiveteam-bs [22:01] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [22:09] i will see about saving space by re-downloading the PC Magazine [22:09] cause the newer copy will be like Infoworld and save +60% space [22:10] which will be about 60gb on my end [22:10] *** tpw_rules has joined #archiveteam-bs [22:11] *** Lord_Nigh has joined #archiveteam-bs [22:14] *** tpw_rules has quit IRC (Read error: Operation timed out) [22:26] *** schbirid has quit IRC (Quit: Leaving) [22:33] *** tpw_rules has joined #archiveteam-bs [22:38] *** icedice has quit IRC (Ping timeout: 244 seconds) [22:43] *** icedice has joined #archiveteam-bs [23:15] godane: Do not do them [23:17] ok [23:18] anyways uploading 20 more eric pdfs that are missing [23:20] Thanks, godane [23:27] *** odemg has quit IRC (Remote host closed the connection) [23:39] *** LastNinja has quit IRC (Ping timeout: 245 seconds) [23:50] *** Aranje has quit IRC (Quit: Three sheets to the wind) [23:56] *** Aranje has joined #archiveteam-bs