#archiveteam-bs 2016-06-23,Thu

↑back Search

Time Nickname Message
00:08 πŸ”— RichardG_ has joined #archiveteam-bs
00:15 πŸ”— RichardG has quit IRC (Ping timeout: 499 seconds)
00:16 πŸ”— RichardG_ is now known as RichardG
00:20 πŸ”— ris has quit IRC ()
00:21 πŸ”— JesseW has joined #archiveteam-bs
00:25 πŸ”— VADemon has quit IRC (left4dead)
00:27 πŸ”— DoomTay has quit IRC (Ping timeout: 268 seconds)
00:34 πŸ”— DoomTay has joined #archiveteam-bs
00:55 πŸ”— Jeroen52 has quit IRC (Ping timeout: 260 seconds)
00:58 πŸ”— coretx has quit IRC (Ping timeout: 506 seconds)
01:02 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
01:03 πŸ”— Jeroen52 has joined #archiveteam-bs
01:11 πŸ”— coretx has joined #archiveteam-bs
01:37 πŸ”— mutoso has joined #archiveteam-bs
01:37 πŸ”— mutoso_ has quit IRC (Read error: Connection reset by peer)
01:41 πŸ”— davidar_ has quit IRC (Quit: Connection closed for inactivity)
01:55 πŸ”— arkiver has quit IRC (Read error: Operation timed out)
01:57 πŸ”— arkiver has joined #archiveteam-bs
02:00 πŸ”— zenguy has quit IRC (Ping timeout: 370 seconds)
02:01 πŸ”— zenguy has joined #archiveteam-bs
02:02 πŸ”— dcmorton has quit IRC (Ping timeout: 370 seconds)
02:03 πŸ”— winr5r has joined #archiveteam-bs
02:03 πŸ”— winr4r has quit IRC (Read error: Operation timed out)
02:07 πŸ”— dcmorton has joined #archiveteam-bs
02:09 πŸ”— BlueMaxim has quit IRC (Read error: Operation timed out)
02:09 πŸ”— dcmorton has quit IRC (Excess Flood)
02:10 πŸ”— dcmorton has joined #archiveteam-bs
02:10 πŸ”— dcmorton has quit IRC (Excess Flood)
02:11 πŸ”— dcmorton has joined #archiveteam-bs
02:12 πŸ”— BlueMaxim has joined #archiveteam-bs
02:35 πŸ”— dcmorton has quit IRC (Ping timeout: 370 seconds)
02:41 πŸ”— dcmorton has joined #archiveteam-bs
02:56 πŸ”— Coderjoe has quit IRC (Read error: Operation timed out)
03:06 πŸ”— dcmorton has quit IRC (Ping timeout: 370 seconds)
03:12 πŸ”— dcmorton has joined #archiveteam-bs
03:21 πŸ”— Coderjoe has joined #archiveteam-bs
03:21 πŸ”— nickname_ has joined #archiveteam-bs
03:44 πŸ”— dcmorton has quit IRC (Excess Flood)
03:47 πŸ”— dcmorton has joined #archiveteam-bs
04:04 πŸ”— JesseW has joined #archiveteam-bs
04:16 πŸ”— dcmorton has quit IRC (Max SendQ exceeded)
04:19 πŸ”— dcmorton has joined #archiveteam-bs
04:40 πŸ”— DFJustin has quit IRC (Remote host closed the connection)
04:42 πŸ”— DFJustin has joined #archiveteam-bs
04:42 πŸ”— swebb sets mode: +o DFJustin
04:43 πŸ”— nickname_ has quit IRC (Read error: Operation timed out)
05:01 πŸ”— jut has joined #archiveteam-bs
05:01 πŸ”— Sk1d has quit IRC (Ping timeout: 250 seconds)
05:10 πŸ”— Sk1d has joined #archiveteam-bs
05:11 πŸ”— hook54321 has quit IRC (Quit: Connection closed for inactivity)
05:30 πŸ”— antomati_ has quit IRC (Ping timeout: 258 seconds)
05:50 πŸ”— godane SketchCow: i'm up to 2015 with deadspin.com grab
05:51 πŸ”— godane i'm uploading 2013 to 2015 right now of it
05:51 πŸ”— godane i'm also grab 2016-01- to 2016-05 of deadspin.com
05:52 πŸ”— JesseW godane: btw, we're working on grabbing GSOC web pages right now in #archivebot -- your help could probably be useful
05:58 πŸ”— DoomTay has quit IRC (Quit: Page closed)
06:11 πŸ”— Cameron_D has quit IRC (Ping timeout: 370 seconds)
06:17 πŸ”— Cameron_D has joined #archiveteam-bs
06:22 πŸ”— BlueMaxim has quit IRC (Read error: Operation timed out)
06:24 πŸ”— BlueMaxim has joined #archiveteam-bs
06:25 πŸ”— aschmitz has quit IRC (Read error: Operation timed out)
06:48 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
06:58 πŸ”— aschmitz has joined #archiveteam-bs
07:18 πŸ”— vtyl has joined #archiveteam-bs
07:18 πŸ”— lytv has quit IRC (Ping timeout: 258 seconds)
07:58 πŸ”— tomwsmf-a has quit IRC (Read error: Operation timed out)
08:25 πŸ”— schbirid has joined #archiveteam-bs
09:01 πŸ”— antomatic has joined #archiveteam-bs
09:01 πŸ”— swebb sets mode: +o antomatic
09:02 πŸ”— antomatic has quit IRC (Client Quit)
09:09 πŸ”— antomatic has joined #archiveteam-bs
09:09 πŸ”— swebb sets mode: +o antomatic
09:16 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
09:20 πŸ”— dashcloud has joined #archiveteam-bs
10:45 πŸ”— anjacks0n has joined #archiveteam-bs
10:53 πŸ”— anjacks0n has quit IRC (anjacks0n)
11:15 πŸ”— signius has quit IRC (Ping timeout: 260 seconds)
11:22 πŸ”— signius has joined #archiveteam-bs
11:40 πŸ”— anjacks0n has joined #archiveteam-bs
11:41 πŸ”— anjacks0n has quit IRC (Client Quit)
11:48 πŸ”— hook54321 has joined #archiveteam-bs
12:24 πŸ”— jut has quit IRC (Read error: Connection reset by peer)
12:37 πŸ”— anjacks0n has joined #archiveteam-bs
12:53 πŸ”— Boppen has joined #archiveteam-bs
12:58 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
13:02 πŸ”— anjacks0n has quit IRC (anjacks0n)
13:23 πŸ”— anjacks0n has joined #archiveteam-bs
13:41 πŸ”— anjacks0n has quit IRC (anjacks0n)
13:42 πŸ”— anjacks0n has joined #archiveteam-bs
13:47 πŸ”— kristian_ has joined #archiveteam-bs
13:47 πŸ”— anjacks0n has quit IRC (anjacks0n)
13:48 πŸ”— anjacks0n has joined #archiveteam-bs
13:49 πŸ”— anjacks0n has quit IRC (Client Quit)
13:50 πŸ”— anjacks0n has joined #archiveteam-bs
13:50 πŸ”— anjacks0n has quit IRC (Client Quit)
13:54 πŸ”— SketchCow So, don't spread to social media or post anywhere...
13:54 πŸ”— SketchCow ...there's a new beta version of the next iteration of the Wayback machine.
13:57 πŸ”— SketchCow https://web-beta.archive.org
13:57 πŸ”— SketchCow Please consider yourselves invited to bang the living shit out of it.
13:58 πŸ”— SketchCow If you hit something SUPER broken, mail Mark at mark@archive.org.
13:58 πŸ”— SketchCow He's head of Wayback
14:00 πŸ”— Atluxity cool
14:06 πŸ”— HCross it shows the source of the crawl, that is awesome. https://wayback-beta.archive.org/web/20160312075544/http://www.whtimes.co.uk/home :)
14:07 πŸ”— Frogging is that good? it may lead to people going after WARCs to get them darked
14:08 πŸ”— HCross wouldnt they just contact the IA and go "delete xxxx.co.uk please"
14:08 πŸ”— HCross anyway, without the warc
14:09 πŸ”— Frogging eh, maybe they'd just throw robots.txt at it
14:09 πŸ”— Frogging dunno, just idle speculation :p
14:11 πŸ”— hook54321 has quit IRC (Quit: Connection closed for inactivity)
14:13 πŸ”— DoomTay has joined #archiveteam-bs
14:17 πŸ”— SketchCow For everyone who wishes they could look at 1,500 of my Japan photos: https://www.flickr.com/photos/textfiles/albums/72157669136764700
14:18 πŸ”— anjacks0n has joined #archiveteam-bs
14:25 πŸ”— DoomTay So how's that GCI sweeping going in?
14:37 πŸ”— j08nY has joined #archiveteam-bs
14:38 πŸ”— anjacks0n has quit IRC (anjacks0n)
14:39 πŸ”— nickname_ has joined #archiveteam-bs
15:20 πŸ”— VADemon has joined #archiveteam-bs
15:40 πŸ”— Kenshin has quit IRC (Remote host closed the connection)
15:46 πŸ”— JesseW has joined #archiveteam-bs
15:48 πŸ”— kristian_ has quit IRC (Leaving)
15:48 πŸ”— nickname_ has quit IRC (Read error: Operation timed out)
15:50 πŸ”— Kenshin has joined #archiveteam-bs
15:53 πŸ”— nickname_ has joined #archiveteam-bs
16:11 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
16:13 πŸ”— RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue)
16:16 πŸ”— nickname_ has quit IRC (Read error: Connection reset by peer)
16:21 πŸ”— RichardG has joined #archiveteam-bs
17:17 πŸ”— joepie91 SketchCow: minor UI bug that makes it nigh impossible to click the "why" items in the domain timeline because the chart is overlapping it.... should I report that via that email as well, or is that just for severe functionality breakage?
17:22 πŸ”— joepie91 meh, found some breakage, I'll just combine it into one email
17:23 πŸ”— metalcamp has joined #archiveteam-bs
17:24 πŸ”— joepie91 hm. Google is not visible due to robots.txt, in the current wayback machine? wut?
17:25 πŸ”— Stilett0 has quit IRC ()
17:30 πŸ”— DoomTay Works fine for me
17:34 πŸ”— * joepie91 plays QA engineer
17:34 πŸ”— joepie91 up to 6 issues: 2 functionality issues, 2 UI quirks, 2 possible UI improvements
17:52 πŸ”— VADemon has quit IRC (Quit: left4dead)
17:53 πŸ”— VADemon has joined #archiveteam-bs
18:05 πŸ”— JW_work has quit IRC (Quit: Leaving.)
18:07 πŸ”— mutoso has quit IRC (Read error: Operation timed out)
18:08 πŸ”— JW_work has joined #archiveteam-bs
18:09 πŸ”— mutoso has joined #archiveteam-bs
18:23 πŸ”— ris has joined #archiveteam-bs
18:27 πŸ”— luckcolor guys just installed latest update of wpull 2.0.1
18:27 πŸ”— luckcolor Traceback (most recent call last):
18:27 πŸ”— luckcolor File "/usr/local/bin/grab-site", line 4, in <module>
18:27 πŸ”— luckcolor main.main()
18:27 πŸ”— luckcolor File "/usr/local/lib/python3.4/site-packages/click/core.py", line 716, in __call__
18:27 πŸ”— luckcolor return self.main(*args, **kwargs)
18:27 πŸ”— luckcolor File "/usr/local/lib/python3.4/site-packages/click/core.py", line 696, in main
18:27 πŸ”— luckcolor rv = self.invoke(ctx)
18:28 πŸ”— luckcolor File "/usr/local/lib/python3.4/site-packages/click/core.py", line 889, in invoke
18:28 πŸ”— luckcolor return ctx.invoke(self.callback, **ctx.params)
18:28 πŸ”— luckcolor File "/usr/local/lib/python3.4/site-packages/click/core.py", line 534, in invoke
18:28 πŸ”— luckcolor return callback(*args, **kwargs)
18:28 πŸ”— luckcolor File "/usr/local/lib/python3.4/site-packages/libgrabsite/main.py", line 359, in main
18:28 πŸ”— luckcolor from wpull.app import Application
18:28 πŸ”— luckcolor ImportError: No module named 'wpull.app'
18:28 πŸ”— luckcolor any ideas?
18:28 πŸ”— luckcolor already tried to re install it
18:43 πŸ”— Meroje leftover .pyc files ?
18:44 πŸ”— luckcolor mmh
18:44 πŸ”— luckcolor i'll do a rm -r *.pyc
18:44 πŸ”— Meroje this is not recursive
18:45 πŸ”— Meroje I usually do `find . -name '*.pyc' -delete`
18:46 πŸ”— Meroje (I missed the -r in your command, sorry)
18:47 πŸ”— luckcolor nope
18:47 πŸ”— luckcolor doesn't work
18:47 πŸ”— luckcolor maybe it's grabsite?
18:48 πŸ”— luckcolor i'll reboot archivebot and se if it has errors
18:53 πŸ”— arrith has quit IRC (Read error: Operation timed out)
19:03 πŸ”— luckcolor ok archivebot works
19:03 πŸ”— luckcolor wich means is a a grab.site bug
19:03 πŸ”— luckcolor *grab-site
19:05 πŸ”— luckcolor will file issue if someone can make a patch soon it would be amazing
19:06 πŸ”— godane SketchCow: deadspin.com is up to 2016-05 now and all uploaded
19:06 πŸ”— godane i'm grabbing gizmodo.com right now
19:11 πŸ”— luckcolor https://github.com/ludios/grab-site/issues/92
19:11 πŸ”— luckcolor for who is interested
19:13 πŸ”— joepie91 [20:46] <Meroje> (I missed the -r in your command, sorry)
19:13 πŸ”— joepie91 still wouldn't make it recursive I think
19:14 πŸ”— joepie91 since you're only selecting *.pyc and not all folders
19:14 πŸ”— joepie91 you'd need something like... **/*.pyc?
19:14 πŸ”— Meroje yeah I thought of the expansion after that
19:14 πŸ”— joepie91 ("zero or more path segments containing anything, followed by *.pyc")
19:15 πŸ”— luckcolor anywatΓ¬ys that wasn't the problem
19:15 πŸ”— luckcolor well
19:15 πŸ”— luckcolor for what i know ofc
19:49 πŸ”— Stiletto has joined #archiveteam-bs
20:09 πŸ”— DoomTay Okay, I started getting WARCs of a site that is going to change in the 20th. What's the next step? How do I get these into Wayback Machine?
20:12 πŸ”— tomwsmf-a has joined #archiveteam-bs
20:15 πŸ”— schbirid has quit IRC (Quit: Leaving)
21:06 πŸ”— arkiver DoomTay: what site
21:06 πŸ”— DoomTay portalgraphics.net
21:07 πŸ”— DoomTay Yes, I sicced ArchiveBot on that site twice, though I remember that yipdw kinds threw a fit the second time
21:08 πŸ”— DoomTay Plusi f there's another adavantage to the way I'm doing it now, cookie injection means the site will always come out in English
21:09 πŸ”— arkiver why not just use /en/ for english?
21:10 πŸ”— DoomTay Hmm..lemme try that real quick....
21:12 πŸ”— DoomTay Agh, knew it. Did no good for http://www.portalgraphics.net/pg/illust/?image_id=90308. Putting "&lang=en" did no good either
21:14 πŸ”— arkiver What cookie are you using
21:14 πŸ”— arkiver (haven't had a look at them yet)
21:14 πŸ”— DoomTay langset=en
21:17 πŸ”— decay has quit IRC (Read error: Operation timed out)
21:17 πŸ”— decay has joined #archiveteam-bs
21:17 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
21:17 πŸ”— arkiver And why English? It looks like the website wants to server Japanese by default.
21:18 πŸ”— arkiver so I'm not sure if you'd want to force english
21:18 πŸ”— Lord_Nigh has joined #archiveteam-bs
21:18 πŸ”— arkiver maybe do both
21:18 πŸ”— arkiver Also, a normal grab probably won't grab http://www.portalgraphics.net/pg/illust/?image_id=90301 correctly
21:19 πŸ”— ring has quit IRC (Read error: Operation timed out)
21:19 πŸ”— luckcolor has quit IRC (Read error: Operation timed out)
21:19 πŸ”— SilSte has quit IRC (Read error: Operation timed out)
21:19 πŸ”— j08nY has quit IRC (Read error: Operation timed out)
21:19 πŸ”— MrRadar has quit IRC (Read error: Operation timed out)
21:19 πŸ”— arkiver it looks like the flash player loads http://www.portalgraphics.net/pg/movie/address.php?image%5Fid=90301
21:19 πŸ”— DoomTay Oh, right
21:19 πŸ”— MrRadar has joined #archiveteam-bs
21:19 πŸ”— luckcolor has joined #archiveteam-bs
21:19 πŸ”— DoomTay Hmm...
21:19 πŸ”— chazchaz_ has quit IRC (Read error: Operation timed out)
21:19 πŸ”— arkiver which contains info, movie and image
21:19 πŸ”— chazchaz has joined #archiveteam-bs
21:19 πŸ”— DoomTay Apart from that, language selection seems to be random
21:20 πŸ”— DoomTay Hell, I don't know if it would even be possible to save both
21:20 πŸ”— Fletcher has quit IRC (Read error: Operation timed out)
21:20 πŸ”— DoomTay And havethem both on Wayback Machine
21:20 πŸ”— alfie has quit IRC (Read error: Operation timed out)
21:20 πŸ”— DoomTay Well, at least wget could pull them both
21:20 πŸ”— arkiver It is possible. But language selection in the Wayback Machine would be 'random' too
21:20 πŸ”— arkiver But it doesn't save the video items correctly
21:21 πŸ”— brayden has quit IRC (Read error: Operation timed out)
21:21 πŸ”— alfie has joined #archiveteam-bs
21:21 πŸ”— Fletcher has joined #archiveteam-bs
21:21 πŸ”— Baljem_ has quit IRC (Read error: Connection reset by peer)
21:22 πŸ”— ring has joined #archiveteam-bs
21:22 πŸ”— SilSte has joined #archiveteam-bs
21:22 πŸ”— Baljem has joined #archiveteam-bs
21:23 πŸ”— joepie91 has quit IRC (Excess Flood)
21:23 πŸ”— DoomTay At least we know the movie file is at http://www.portalgraphics.net/data/movie/90000/90301.mp4
21:23 πŸ”— DoomTay The URL would be pretty easy to guess for others
21:24 πŸ”— DoomTay I could probably fix the "not accessed properly" part on Wayback Machine with a userscript when it comes time
21:24 πŸ”— arkiver the movie path is in http://www.portalgraphics.net/pg/movie/address.php?image%5Fid=90301
21:24 πŸ”— arkiver <uri type="movie" href="http://www.portalgraphics.net/pg/movie/movie.php?movie_path=90000/90301" />
21:24 πŸ”— arkiver http://www.portalgraphics.net/pg/movie/movie.php?movie_path=90000/90301 redirects to http://www.portalgraphics.net/data/movie/90000/90301.mp4
21:25 πŸ”— joepie91 has joined #archiveteam-bs
21:25 πŸ”— midas sets mode: +o joepie91
21:29 πŸ”— arkiver also, URLs like http://www.portalgraphics.net/pg/movie/pg_player/res_movie_data.php?mid=90301&lang=en won't be extracted by wget or wpull from http://www.portalgraphics.net/pg/illust/?image_id=90301
21:29 πŸ”— Aranje has quit IRC (Quit: Three sheets to the wind)
21:29 πŸ”— arkiver they also contain URLs that should be grabbed
21:32 πŸ”— Stiletto has quit IRC (Ping timeout: 260 seconds)
21:33 πŸ”— DoomTay Hmm
21:38 πŸ”— Aranje has joined #archiveteam-bs
21:51 πŸ”— hook54321 has joined #archiveteam-bs
21:56 πŸ”— Stiletto has joined #archiveteam-bs
22:05 πŸ”— metalcamp has quit IRC (Ping timeout: 244 seconds)
22:16 πŸ”— HCross https://en.wikipedia.org/wiki/Wikipedia:TWA/Portal this is pretty cool
22:21 πŸ”— DoomTay Huh, there's http://www.portalgraphics.net/lang.php?lang=en&url=http://www.portalgraphics.net/pg/illust/?image_id=90301
22:22 πŸ”— DoomTay Okay, never mind, that completely failed
22:37 πŸ”— DoomTay Still, getting that "information page" and fullsize image for each thing would be miles better than nothing
22:39 πŸ”— JesseW has joined #archiveteam-bs
22:46 πŸ”— DoomTay Besides, the URL for each of those can be guessed easily for other images
22:46 πŸ”— Aranje has quit IRC (Quit: Three sheets to the wind)
22:51 πŸ”— arkiver when is portalgraphics closing
23:10 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
23:18 πŸ”— BlueMaxim has joined #archiveteam-bs
23:20 πŸ”— DoomTay has quit IRC (Ping timeout: 268 seconds)
23:39 πŸ”— DoomTay has joined #archiveteam-bs
23:39 πŸ”— DoomTay Well, it's not closing per se, by on 7/20, they will be deleting accound user data and associated data, according to http://www.portalgraphics.net/pg/guide/news20160520.html

irclogger-viewer