#archiveteam-bs 2017-03-15,Wed

↑back Search

Time Nickname Message
00:00 🔗 nyany what's good everyone
00:00 🔗 joepie91 https://twitter.com/CityPolicePIPCU/status/841695126665736194
00:00 🔗 joepie91 :|
00:00 🔗 SketchCow Fried Tofu
00:00 🔗 SketchCow Surprised me, too
00:01 🔗 nyany https://twitter.com/CityPolicePIPCU/status/841699007973928960
00:01 🔗 nyany Since WHEN were torrents illegal
00:01 🔗 nyany e.e
00:02 🔗 joepie91 surprisingly not covered on torrentfreak yet
00:03 🔗 SketchCow I have a 2gb file sitting here that's been here for months, because I kind of forgot what it is.
00:07 🔗 SketchCow It's..... 2gb of World of Warcraft Armory XML
00:07 🔗 SketchCow Why do I have this.
00:09 🔗 pizzaiolo because why not
00:09 🔗 SketchCow Yeah but I don't even know how I got to getting it now.
00:17 🔗 SketchCow Anyway, it's flung on the Archive and I can keep going with my cleanups
00:17 🔗 SketchCow I learned a lot importing the Frostbyte, mostly how I need to be more careful in the future
00:33 🔗 joepie91 https://twitter.com/maddow/status/841795163664089089
00:33 🔗 joepie91 "BREAKING: We've got Trump tax returns. Tonight, 9pm ET. MSNBC. (Seriously)."
00:37 🔗 SketchCow Yeah
00:37 🔗 SketchCow It's blowing up
00:37 🔗 SketchCow This is a good time for everyone not in the US to know about Bartnicki v. Vopper
00:38 🔗 SketchCow Which says, basically, fuck it, we got it, we're the press, suck a dick
00:46 🔗 passerby_ has joined #archiveteam-bs
00:48 🔗 pizzaiolo I love the (Seriously)
00:48 🔗 pizzaiolo inb4 his tax returns are empty spreadsheets
00:49 🔗 GE has quit IRC (Remote host closed the connection)
00:52 🔗 passerby has quit IRC (Read error: Operation timed out)
00:57 🔗 odemg has joined #archiveteam-bs
01:09 🔗 Pudsey has joined #archiveteam-bs
01:21 🔗 Pudsey has quit IRC (Remote host closed the connection)
01:21 🔗 passerby has joined #archiveteam-bs
01:24 🔗 passerby_ has quit IRC (Read error: Operation timed out)
01:24 🔗 RichardG has quit IRC (Read error: Operation timed out)
01:28 🔗 RichardG has joined #archiveteam-bs
01:29 🔗 passerby_ has joined #archiveteam-bs
01:32 🔗 passerby_ has quit IRC (Client Quit)
01:32 🔗 passerby has quit IRC (Ping timeout: 492 seconds)
01:32 🔗 passerby has joined #archiveteam-bs
01:53 🔗 pnJay has quit IRC (Leaving)
01:53 🔗 j08nY has quit IRC (Quit: Leaving)
02:16 🔗 Pudsey has joined #archiveteam-bs
02:18 🔗 Pudsey has quit IRC (Remote host closed the connection)
02:20 🔗 JensRex Trump tax returns... oh lord, Reddit is going to be an even more insuffereable place than usual.
02:34 🔗 BlueMaxim has joined #archiveteam-bs
02:39 🔗 winr4r has quit IRC (Read error: Operation timed out)
02:45 🔗 ndiddy has quit IRC ()
03:00 🔗 winr4r has joined #archiveteam-bs
03:04 🔗 Coderjo has quit IRC (Ping timeout: 260 seconds)
03:24 🔗 Coderjo has joined #archiveteam-bs
03:38 🔗 Coderjo has quit IRC (Remote host closed the connection)
03:45 🔗 RichardG has quit IRC (Read error: Operation timed out)
03:45 🔗 RichardG has joined #archiveteam-bs
03:46 🔗 Somebody2 Sorry for the flaky connection a few days ago. And thanks Lord_Nigh for speaking up for me. (now sent to right channel)
03:57 🔗 pizzaiolo has left
05:05 🔗 RichardG has quit IRC (Read error: Operation timed out)
05:05 🔗 RichardG has joined #archiveteam-bs
05:25 🔗 HCross2 Is it about time to do another census?
05:28 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
05:30 🔗 Muad-Dib has quit IRC (Ping timeout: 260 seconds)
05:36 🔗 Somebody2 HCross2: yes please!
05:36 🔗 HCross2 Ill take a look over the weekend
05:36 🔗 Somebody2 If you're willing to step up, I'd be delighted to troubleshoot any problems you run into.
05:37 🔗 HCross2 There are scripts on the wiki
05:37 🔗 HCross2 Somebody2: what timezone are you in?
05:37 🔗 Somebody2 Pacific Time (same as IA).
05:38 🔗 Somebody2 But I'm up at weird hours.
05:38 🔗 Somebody2 and unavailable during usual work hours.
05:38 🔗 Somebody2 And yes, the scripts on the wiki are where I'd suggest you start.
05:39 🔗 HCross2 I'll do some reading over it all. I'm in GMT so we may get the odd timezone issue. I've already briefly read the scripts, it looks simple ish
05:41 🔗 Somebody2 Yep, mostly what had me stuck was uncertanity about what kinds/parts of the freely-downloadable data IA would prefer not to be aggregated and published.
05:51 🔗 icedice has joined #archiveteam-bs
05:54 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
05:57 🔗 Muad-Dib has joined #archiveteam-bs
06:01 🔗 Sk1d has joined #archiveteam-bs
06:06 🔗 bwn HCross2: i can help as much as i can as well
06:11 🔗 icedice has quit IRC (Quit: Leaving)
06:32 🔗 RichardG has quit IRC (Read error: Operation timed out)
06:32 🔗 RichardG has joined #archiveteam-bs
06:37 🔗 Yurume has quit IRC (Read error: Operation timed out)
06:39 🔗 Yurume has joined #archiveteam-bs
07:00 🔗 BlueMaxim has joined #archiveteam-bs
07:00 🔗 GE has joined #archiveteam-bs
07:13 🔗 Honno has joined #archiveteam-bs
07:29 🔗 masterX24 has joined #archiveteam-bs
07:29 🔗 masterX24 back now after sleeping, still needing guidance on the upload of the crawl
07:43 🔗 masterX24 paging xmc and SketchCow
07:47 🔗 godane looks like the buck sexton show ended on theblaze
07:52 🔗 Coderjo has joined #archiveteam-bs
07:54 🔗 odemg has quit IRC (Remote host closed the connection)
07:55 🔗 godane i'm grabbing it by hourly mp3s cause its easier to grab
07:56 🔗 godane also cause jan 27 and 31 was only the first 2 hours
07:57 🔗 kristian_ has joined #archiveteam-bs
08:25 🔗 RichardG has quit IRC (Read error: Operation timed out)
08:25 🔗 RichardG has joined #archiveteam-bs
08:28 🔗 antomatic has joined #archiveteam-bs
08:28 🔗 swebb sets mode: +o antomatic
08:31 🔗 antomati_ has quit IRC (Ping timeout: 244 seconds)
08:37 🔗 schbirid has joined #archiveteam-bs
08:40 🔗 Riviera has joined #archiveteam-bs
08:53 🔗 SketchCow Whut
08:53 🔗 SketchCow I'd say hold off for a bit, please
08:54 🔗 SketchCow (Upload of crawl)
08:56 🔗 Honno has quit IRC (Ping timeout: 370 seconds)
08:56 🔗 Jonison has joined #archiveteam-bs
09:12 🔗 masterX24 crawl is currently taking 10% of my webserver's disk capacity, (downloading to home computer and then uploading later isnt a viable option due to very slow uplink at home)
09:16 🔗 masterX24 and holding off how long? @ sketchcow ?
09:55 🔗 bwn has quit IRC (Read error: Operation timed out)
10:00 🔗 JAA has quit IRC (Quit: Page closed)
10:01 🔗 JAA has joined #archiveteam-bs
10:02 🔗 Coderjo I wish I could mirror the entire Canopus/GVG software download space. I hate that it is all hidden away behind a registration system. Even for ancient products like my Canopus ADVC-300
10:10 🔗 j08nY has joined #archiveteam-bs
10:13 🔗 JensRex Urgent: Need a tracker admin to requeue app.net jobs.
10:17 🔗 bwn has joined #archiveteam-bs
10:32 🔗 pnJay has joined #archiveteam-bs
10:52 🔗 j08nY has quit IRC (Read error: Operation timed out)
10:55 🔗 GE has quit IRC (Remote host closed the connection)
11:13 🔗 pizzaiolo has joined #archiveteam-bs
11:15 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
11:17 🔗 j08nY has joined #archiveteam-bs
11:27 🔗 BartoCH has joined #archiveteam-bs
11:49 🔗 kristian_ has quit IRC (Quit: Leaving)
11:54 🔗 RichardG has quit IRC (Read error: Operation timed out)
11:54 🔗 RichardG has joined #archiveteam-bs
11:58 🔗 j08nY has quit IRC (Read error: Operation timed out)
12:07 🔗 Honno has joined #archiveteam-bs
12:33 🔗 j08nY has joined #archiveteam-bs
12:57 🔗 RichardG has quit IRC (Read error: Operation timed out)
12:58 🔗 RichardG has joined #archiveteam-bs
13:02 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
13:20 🔗 RichardG has quit IRC (Read error: Operation timed out)
13:20 🔗 RichardG has joined #archiveteam-bs
13:28 🔗 Honno_ has joined #archiveteam-bs
13:32 🔗 Honno has quit IRC (Ping timeout: 370 seconds)
13:53 🔗 GE has joined #archiveteam-bs
13:55 🔗 RichardG has quit IRC (Read error: Operation timed out)
13:55 🔗 RichardG has joined #archiveteam-bs
14:22 🔗 RichardG has quit IRC (Read error: Operation timed out)
14:22 🔗 RichardG has joined #archiveteam-bs
14:31 🔗 kyounko|2 has quit IRC (Read error: Connection reset by peer)
14:59 🔗 RichardG has quit IRC (Read error: Operation timed out)
14:59 🔗 RichardG has joined #archiveteam-bs
15:25 🔗 RichardG has quit IRC (Read error: Operation timed out)
15:25 🔗 RichardG has joined #archiveteam-bs
15:29 🔗 sep332 has joined #archiveteam-bs
15:32 🔗 sep332_ has quit IRC (Ping timeout: 260 seconds)
15:32 🔗 antomatic has quit IRC (Read error: Connection reset by peer)
15:33 🔗 antomatic has joined #archiveteam-bs
15:33 🔗 swebb sets mode: +o antomatic
15:53 🔗 masterX24 has quit IRC (Ping timeout: 268 seconds)
16:42 🔗 j08nY has quit IRC (Remote host closed the connection)
16:47 🔗 Stiletto has quit IRC ()
17:18 🔗 odemg has joined #archiveteam-bs
17:22 🔗 icedice has joined #archiveteam-bs
17:32 🔗 Stilett0 has joined #archiveteam-bs
17:46 🔗 godane so i'm uploading koreanet-1 changwon world
17:46 🔗 godane its kids singing basicly
17:48 🔗 godane first one i could find: https://archive.org/details/koreanet-1_changwon_world-20010801
17:54 🔗 me is now known as yipdw
18:10 🔗 Roelandus has joined #archiveteam-bs
18:10 🔗 SmileyG has joined #archiveteam-bs
18:11 🔗 SmileyG has quit IRC (Client Quit)
18:25 🔗 ndiddy has joined #archiveteam-bs
18:28 🔗 Coderjo I have a (potentially incomplete) mobileme item that does not appear to be in the IA collection. (itemname is "inclusive.solutions")
18:29 🔗 Coderjo I found it while trying to clean up disk space on a system I am nearing quota on
18:29 🔗 Coderjo what should I do with it?
18:41 🔗 Roelandus Could you explain "mobileme item"?
18:41 🔗 Coderjo http://archiveteam.org/index.php?title=MobileMe
18:41 🔗 Coderjo from that grab effort
18:42 🔗 xmc aaaages ago
18:44 🔗 Coderjo indeed. I've been gone for awhile, mainly due to personal stuff.
18:48 🔗 schbirid can i make wpull strip things from URLs when recursing?
18:48 🔗 schbirid eg instead of grabbing a gazillion copies of http://media-cdn.sueddeutsche.de/image/sz.1.440537/135x101?v=1357596425000 with varying timestamps
18:48 🔗 schbirid grab http://media-cdn.sueddeutsche.de/image/sz.1.440537/135x101 once?
18:48 🔗 schbirid 's#\?v=##' :}
18:54 🔗 j08nY has joined #archiveteam-bs
19:03 🔗 Coderjo gone from AT stuff, that is.
19:09 🔗 pizzaiolo has quit IRC (Ping timeout: 260 seconds)
19:23 🔗 MrRadar schbirid: You could probably do that with a plugin script. I know Archivebot/grab-site does something similar to strip session IDs from URLs
19:23 🔗 MrRadar Exactly *how*, I couldn't tell you
19:32 🔗 schbirid 8)
19:48 🔗 GE has quit IRC (Remote host closed the connection)
19:57 🔗 RichardG has quit IRC (Read error: Operation timed out)
19:57 🔗 RichardG has joined #archiveteam-bs
20:22 🔗 RichardG has quit IRC (Read error: Operation timed out)
20:22 🔗 RichardG has joined #archiveteam-bs
20:27 🔗 odemg has quit IRC (Remote host closed the connection)
20:36 🔗 rocode nightpool, it is common for open source business acquisitions. I generally grab a snapshot of the webpage anyways just in case.
20:37 🔗 rocode Be careful of gitter however, it is a walking tarpit because of how their pages update.
20:42 🔗 nightpool rocode: I think the /archives pages are all basically static html?
20:45 🔗 rocode Yeah, but you will need to whitelist the URL.
20:47 🔗 nightpool Yeah, just straight crawls aren't going to work
20:48 🔗 odemg has joined #archiveteam-bs
20:53 🔗 Aranje has quit IRC (Quit: Three sheets to the wind)
20:57 🔗 Aranje has joined #archiveteam-bs
21:14 🔗 GE has joined #archiveteam-bs
21:23 🔗 MrRadar Ars has new article praising emulation for the purpose of game preservation: https://arstechnica.com/gaming/2017/03/how-emulation-helped-save-two-video-game-rarities/
21:24 🔗 RichardG has quit IRC (Read error: Operation timed out)
21:24 🔗 RichardG has joined #archiveteam-bs
21:40 🔗 kristian_ has joined #archiveteam-bs
21:57 🔗 BlueMaxim has joined #archiveteam-bs
22:15 🔗 kristian_ has quit IRC (Quit: Leaving)
22:24 🔗 schbirid has quit IRC (Quit: Leaving)
22:27 🔗 ranma is there any reason to back up something on archive bot if it's already on IA?
22:27 🔗 ranma I can't remember
22:30 🔗 xmc not really
22:30 🔗 xmc if the IA grab is incomplete, that's a reason
22:49 🔗 rocode If it isn't actually an item on IA, it is probably a partial grab.
23:02 🔗 MrRadar One reason to grab something through Archivebot is you can download the archivebot WARC whereas IA keeps their raw scrape data private
23:02 🔗 xmc yes, also that
23:11 🔗 Aranje has quit IRC (Quit: Three sheets to the wind)
23:15 🔗 ranma kk thanks
23:16 🔗 bottymcbo has joined #archiveteam-bs
23:16 🔗 bottymcbo KeyError: Identifier('#archiveteam-bs') (file "/usr/local/lib/python3.5/dist-packages/sopel/coretasks.py", line 363, in track_join)
23:16 🔗 JensRex oh ffs you idiotic bot.
23:16 🔗 bottymcbo has quit IRC (Client Quit)
23:19 🔗 JAA Interesting. I never noticed that those "liveweb" items on IA aren't downloadable.
23:19 🔗 xmc JensRex: no bots that talk in #archiveteam or #archiveteam-bs, please
23:20 🔗 JAA Not that it would make much sense to do so (unless you're backing up the Wayback Machine), given that those would contain all sorts of unrelated things
23:20 🔗 JensRex xmc: I know. It's not supposed to spew errors in chat... testing in seperate channel now.
23:48 🔗 RichardG has quit IRC (Read error: Operation timed out)
23:48 🔗 RichardG has joined #archiveteam-bs

irclogger-viewer