#archiveteam 2016-07-10,Sun

↑back Search

Time Nickname Message
00:01 πŸ”— DiscantX has joined #archiveteam
00:16 πŸ”— DoomTay has joined #archiveteam
00:17 πŸ”— namespace has joined #archiveteam
00:20 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
00:34 πŸ”— WinterFox has joined #archiveteam
00:38 πŸ”— rsanek has joined #archiveteam
00:39 πŸ”— rsanek WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
00:42 πŸ”— namespace " Google Groups: "Gone within a year" (SketchCow, 2016-06-07). "
00:42 πŸ”— namespace Couldn't find anything with google.
00:42 πŸ”— namespace Source?
00:42 πŸ”— Frogging rsanek: What is your quest with the wiki, friend?
00:43 πŸ”— rsanek just wanted to edit a date, though I found the secret in an irc log
00:43 πŸ”— Frogging ah okay :p
00:44 πŸ”— Frogging i guess we're fine as long as spambots don't figure that one out
00:44 πŸ”— Frogging ;p
00:44 πŸ”— rsanek yeah lets hope
00:44 πŸ”— rsanek has quit IRC (Quit: Page closed)
00:44 πŸ”— Frogging yeah, bye
00:44 πŸ”— philpem has quit IRC (Remote host closed the connection)
00:46 πŸ”— Sue_ has quit IRC (Read error: Operation timed out)
00:48 πŸ”— namespace Do we have any crawlers that can do JS?
00:48 πŸ”— namespace Google Groups is pure JS slurry, at least to get the machine readable DOM part of it.
00:49 πŸ”— philpem has joined #archiveteam
00:50 πŸ”— Frogging we do, ArchiveBot does phantomJS but I think putting a general purpose crawler onto something as big as that would be asking for trouble
00:50 πŸ”— Frogging but it is possible, since that's what you're asking
00:51 πŸ”— namespace Noted.
00:51 πŸ”— namespace How would you handle a behemoth of that size then?
00:52 πŸ”— Frogging Warrior job
00:52 πŸ”— namespace (I wanted to do this in high school, but I was technically incapable at the time.)
00:52 πŸ”— namespace (I can probably actually write up the warrior scripts now.)
00:58 πŸ”— JesseW has joined #archiveteam
01:01 πŸ”— BlueMaxim has joined #archiveteam
01:01 πŸ”— SDr has quit IRC ()
01:17 πŸ”— JesseW has quit IRC (Quit: Leaving.)
01:17 πŸ”— JesseW has joined #archiveteam
01:36 πŸ”— DiscantX has quit IRC (Ping timeout: 244 seconds)
02:13 πŸ”— DiscantX has joined #archiveteam
02:20 πŸ”— DiscantX has quit IRC (Ping timeout: 244 seconds)
02:32 πŸ”— philpem has quit IRC (Ping timeout: 260 seconds)
02:55 πŸ”— DiscantX has joined #archiveteam
03:04 πŸ”— DiscantX has quit IRC (Ping timeout: 244 seconds)
03:12 πŸ”— ravetcofx has quit IRC (Ping timeout: 506 seconds)
03:20 πŸ”— ravetcofx has joined #archiveteam
03:22 πŸ”— Coderjoe has quit IRC (Read error: Connection reset by peer)
03:30 πŸ”— Coderjoe has joined #archiveteam
04:24 πŸ”— RichardG has quit IRC (Ping timeout: 258 seconds)
04:28 πŸ”— ravetcofx has quit IRC (Ping timeout: 506 seconds)
04:42 πŸ”— ravetcofx has joined #archiveteam
04:49 πŸ”— Kitaru has joined #archiveteam
04:54 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
04:55 πŸ”— Kitaru has quit IRC (Quit: This computer has gone to sleep)
05:00 πŸ”— Sk1d has quit IRC (Ping timeout: 194 seconds)
05:03 πŸ”— SketchCow ahahahhahaha
05:03 πŸ”— SketchCow It's called someone leaked the info to me
05:04 πŸ”— metalcamp has joined #archiveteam
05:06 πŸ”— ravetcofx has joined #archiveteam
05:06 πŸ”— Sk1d has joined #archiveteam
05:08 πŸ”— DFJustin there's still a robots.txt bug preventing google groups from being viewable in wayback https://web.archive.org/web/20110514012530/http://groups.google.com/group/google.public.support.general/msg/d88f36fb3e2c0aac
05:09 πŸ”— DFJustin it does seem to be working better now on other sites though
05:10 πŸ”— DoomTay foxbox.tv seems to be "working"
05:10 πŸ”— DoomTay That is, it's not affected anymore, but it turns out that a good chunk of stuff is gone-gone
05:19 πŸ”— metal_cam has joined #archiveteam
05:20 πŸ”— metalcamp has quit IRC (Ping timeout: 244 seconds)
05:21 πŸ”— ndiddy has quit IRC (Quit: Leaving)
05:44 πŸ”— Jeroen52 has quit IRC (Ping timeout: 260 seconds)
05:48 πŸ”— Jeroen52 has joined #archiveteam
05:52 πŸ”— tomwsmf-a has joined #archiveteam
06:00 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
06:38 πŸ”— DoomTay has quit IRC (Quit: Page closed)
06:55 πŸ”— namespace SketchCow: K.
06:56 πŸ”— namespace Also wow I don't know if you tried scouting the directory structure of Groups, but it's really bad. All the top level categories have random numbers (at least in so far as I can tell, they're random). Then each post inside of a group has a unique (random?) ID.
06:56 πŸ”— namespace Wondering if it's not random and actually just a hex string or something.
06:57 πŸ”— PurpleSym namespace: We can use the JWT(?) API.
06:58 πŸ”— PurpleSym I’ve seen scripts on GitHub, but I can’t find them anymore.
06:59 πŸ”— PurpleSym *GWT
07:00 πŸ”— anjacks0n has joined #archiveteam
07:30 πŸ”— anjacks0n has quit IRC (anjacks0n)
07:33 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
07:43 πŸ”— ravetcofx has joined #archiveteam
07:53 πŸ”— tomwsmf-a has quit IRC (Read error: Operation timed out)
07:54 πŸ”— anjacks0n has joined #archiveteam
07:57 πŸ”— anjacks0n has quit IRC (anjacks0n)
08:04 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
08:17 πŸ”— ravetcofx has joined #archiveteam
08:36 πŸ”— ravetcofx has quit IRC (Remote host closed the connection)
09:04 πŸ”— robink has quit IRC (Ping timeout: 633 seconds)
09:13 πŸ”— robink has joined #archiveteam
09:32 πŸ”— pfallenop has quit IRC (Ping timeout: 244 seconds)
09:34 πŸ”— pfallenop has joined #archiveteam
09:41 πŸ”— Emcy has quit IRC (Read error: Operation timed out)
09:45 πŸ”— Emcy has joined #archiveteam
10:20 πŸ”— Tomcat_ has joined #archiveteam
10:38 πŸ”— Tomcat_ has quit IRC (Ping timeout: 258 seconds)
10:40 πŸ”— philpem has joined #archiveteam
10:48 πŸ”— kristian_ has joined #archiveteam
11:00 πŸ”— luckcolor PurpleSym: gggd actually only uses rss for updating exstisting crawls
11:00 πŸ”— luckcolor wrong chat
11:17 πŸ”— Tomcat_ has joined #archiveteam
11:35 πŸ”— Tomcat_ has quit IRC (Remote host closed the connection)
11:57 πŸ”— signius has quit IRC (Ping timeout: 260 seconds)
12:11 πŸ”— signius has joined #archiveteam
12:56 πŸ”— anjacks0n has joined #archiveteam
13:07 πŸ”— anjacks0n has quit IRC (anjacks0n)
13:14 πŸ”— dashcloud has quit IRC (Read error: Connection reset by peer)
13:15 πŸ”— dashcloud has joined #archiveteam
13:20 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
13:20 πŸ”— BartoCH has joined #archiveteam
13:25 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
13:52 πŸ”— anjacks0n has joined #archiveteam
13:57 πŸ”— anjacks0n has quit IRC (anjacks0n)
14:07 πŸ”— VADemon has joined #archiveteam
14:28 πŸ”— ndiddy has joined #archiveteam
14:45 πŸ”— WinterFox has quit IRC (Read error: Operation timed out)
14:53 πŸ”— anjacks0n has joined #archiveteam
15:12 πŸ”— BartoCH has joined #archiveteam
15:14 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
15:17 πŸ”— JesseW has joined #archiveteam
15:18 πŸ”— RichardG has joined #archiveteam
15:52 πŸ”— ravetcofx has joined #archiveteam
15:52 πŸ”— RichardG has quit IRC (Read error: Operation timed out)
15:53 πŸ”— RichardG has joined #archiveteam
16:00 πŸ”— anjacks0n has quit IRC (anjacks0n)
16:01 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
16:04 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
16:22 πŸ”— BartoCH has joined #archiveteam
16:25 πŸ”— anjacks0n has joined #archiveteam
16:36 πŸ”— anjacks0n has quit IRC (anjacks0n)
16:44 πŸ”— Kitaru has joined #archiveteam
16:48 πŸ”— DoomTay has joined #archiveteam
16:50 πŸ”— Medowar_ has joined #archiveteam
16:51 πŸ”— Medowar_ has quit IRC (Remote host closed the connection)
16:52 πŸ”— namespace has quit IRC (Read error: Operation timed out)
17:00 πŸ”— banderas6 has joined #archiveteam
17:04 πŸ”— VADemon has quit IRC (Quit: left4dead)
17:05 πŸ”— kristian_ has quit IRC (Leaving)
17:05 πŸ”— banderas6 has quit IRC (Ping timeout: 268 seconds)
17:29 πŸ”— tomwsmf-a has joined #archiveteam
17:29 πŸ”— schbirid has joined #archiveteam
17:38 πŸ”— anjacks0n has joined #archiveteam
17:52 πŸ”— anjacks0n has quit IRC (anjacks0n)
17:58 πŸ”— db48x has quit IRC (Read error: Connection reset by peer)
17:59 πŸ”— anjacks0n has joined #archiveteam
18:30 πŸ”— db48x has joined #archiveteam
18:36 πŸ”— VADemon has joined #archiveteam
18:53 πŸ”— DiscantX has joined #archiveteam
18:58 πŸ”— Kitaru has quit IRC (Quit: This computer has gone to sleep)
19:00 πŸ”— DiscantX has quit IRC (Ping timeout: 244 seconds)
19:01 πŸ”— JesseW has joined #archiveteam
19:11 πŸ”— Kitaru has joined #archiveteam
19:14 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
19:16 πŸ”— DiscantX has joined #archiveteam
19:28 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
19:33 πŸ”— dashcloud has joined #archiveteam
19:36 πŸ”— REiN^ has quit IRC ()
19:51 πŸ”— tomwsmf-a has quit IRC (Read error: Operation timed out)
19:53 πŸ”— Kitaru has quit IRC (Quit: This computer has gone to sleep)
20:06 πŸ”— metal_cam has quit IRC (Ping timeout: 250 seconds)
20:07 πŸ”— metalcamp has joined #archiveteam
20:12 πŸ”— schbirid has quit IRC (Quit: Leaving)
20:29 πŸ”— DiscantX has quit IRC (Ping timeout: 244 seconds)
20:31 πŸ”— DoomTay has quit IRC (Quit: Page closed)
20:39 πŸ”— xXx_ndidd has joined #archiveteam
20:40 πŸ”— ndiddy has quit IRC (Ping timeout: 244 seconds)
21:01 πŸ”— REiN^ has joined #archiveteam
21:24 πŸ”— Kitaru has joined #archiveteam
21:27 πŸ”— JesseW has joined #archiveteam
21:27 πŸ”— metalcamp has quit IRC (Ping timeout: 244 seconds)
21:35 πŸ”— Kitaru has quit IRC (Quit: This computer has gone to sleep)
21:36 πŸ”— VADemon has quit IRC (Quit: left4dead)
21:38 πŸ”— Kitaru has joined #archiveteam
21:57 πŸ”— Start_ has joined #archiveteam
21:57 πŸ”— Start has quit IRC (Read error: Connection reset by peer)
22:30 πŸ”— dashcloud has quit IRC (Read error: Connection reset by peer)
22:32 πŸ”— dashcloud has joined #archiveteam
23:07 πŸ”— JesseW Anyone happen to have a copy of wikipedia-logs-2001-08-17.7z (used to be at http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z six years ago)? IA search doesn't turn up a copy...
23:15 πŸ”— db48x JesseW: https://web.archive.org/web/20130501000000*/http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z
23:17 πŸ”— JesseW strange, when I looked I didn't find that
23:22 πŸ”— DoomTay has joined #archiveteam
23:27 πŸ”— divingk has joined #archiveteam
23:28 πŸ”— divingk Good god.
23:28 πŸ”— divingk Digging through a bunch of games and finding their source code.
23:28 πŸ”— divingk And I've only dug through games on three platforms, tops.
23:30 πŸ”— JesseW divingk: say more?
23:30 πŸ”— divingk Well...it's interesting to say the least.
23:30 πŸ”— divingk What I've been doing is using Astrogrep over ROM collections.
23:31 πŸ”— divingk I knew I would find bits of code, but I wasn't aware of the potential scale behind this.
23:31 πŸ”— JesseW what do you mean "finding their source code" -- where are you finding it? Included in the ROMs, or?
23:31 πŸ”— divingk Yes, source code accidentally included in ROMs.
23:31 πŸ”— divingk I can provide a lot of examples of this.
23:31 πŸ”— JesseW neat!
23:31 πŸ”— divingk https://tcrf.net/Ometron
23:31 πŸ”— JesseW That's very good, no?
23:31 πŸ”— divingk https://tcrf.net/Invasion_(ZX_Spectrum,_Bulldog_Software)
23:32 πŸ”— JesseW More interesting data to examine and learn from.
23:32 πŸ”— divingk Good, but I can't help but think there are many games out there with this sort of thing/
23:32 πŸ”— divingk I haven't looked into the C64 library.
23:32 πŸ”— divingk No doubt it's one of the most interesting things to find in a game,
23:32 πŸ”— divingk that is depending on how long said fragments are.
23:33 πŸ”— divingk For instance, here's one case where most of the code was discovered: https://tcrf.net/Exodus_(ZX_Spectrum,_Firebird_Software)
23:33 πŸ”— divingk Whereas here, there's only a snippet: https://tcrf.net/Robotron:_2084_(ZX_Spectrum)
23:34 πŸ”— divingk Most of the ones found so far are on the ZX Spectrum.
23:34 πŸ”— divingk I've found some on the Amstrad CPC too, plus I wrote up one for the Supervision.
23:35 πŸ”— divingk https://tcrf.net/Arcade_Flight_Simulator_(ZX_Spectrum)
23:35 πŸ”— divingk A rare example of a Codemasters game with code sprawling about.
23:36 πŸ”— divingk Early Ocean games, like Hunchback or Eskimo Eddie, also have bits of code.
23:36 πŸ”— divingk https://tcrf.net/Hunchback_(ZX_Spectrum)
23:36 πŸ”— divingk But yeah, curious if anyone here knows about this...
23:36 πŸ”— JesseW (you may want to move this to #archiveteam-bs, as this channel is generally reserved for quick announcements, rather than longer discussions)
23:36 πŸ”— divingk Oh.
23:36 πŸ”— divingk Mind if I copy and paste what I said here over to there?
23:37 πŸ”— JesseW Better to just link it from the public log (which I'll do)
23:38 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
23:38 πŸ”— RichardG has joined #archiveteam
23:42 πŸ”— JesseW Hm, the wiki doesn't seem to have an entry for "The Cutting Room Floor" (video game history site: https://tcrf.net/ ) yet -- someone should add one.
23:45 πŸ”— WinterFox has joined #archiveteam
23:47 πŸ”— BlueMaxim has joined #archiveteam

irclogger-viewer