Time |
Nickname |
Message |
00:01
π
|
|
DiscantX has joined #archiveteam |
00:16
π
|
|
DoomTay has joined #archiveteam |
00:17
π
|
|
namespace has joined #archiveteam |
00:20
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
00:34
π
|
|
WinterFox has joined #archiveteam |
00:38
π
|
|
rsanek has joined #archiveteam |
00:39
π
|
rsanek |
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
00:42
π
|
namespace |
" Google Groups: "Gone within a year" (SketchCow, 2016-06-07). " |
00:42
π
|
namespace |
Couldn't find anything with google. |
00:42
π
|
namespace |
Source? |
00:42
π
|
Frogging |
rsanek: What is your quest with the wiki, friend? |
00:43
π
|
rsanek |
just wanted to edit a date, though I found the secret in an irc log |
00:43
π
|
Frogging |
ah okay :p |
00:44
π
|
Frogging |
i guess we're fine as long as spambots don't figure that one out |
00:44
π
|
Frogging |
;p |
00:44
π
|
rsanek |
yeah lets hope |
00:44
π
|
|
rsanek has quit IRC (Quit: Page closed) |
00:44
π
|
Frogging |
yeah, bye |
00:44
π
|
|
philpem has quit IRC (Remote host closed the connection) |
00:46
π
|
|
Sue_ has quit IRC (Read error: Operation timed out) |
00:48
π
|
namespace |
Do we have any crawlers that can do JS? |
00:48
π
|
namespace |
Google Groups is pure JS slurry, at least to get the machine readable DOM part of it. |
00:49
π
|
|
philpem has joined #archiveteam |
00:50
π
|
Frogging |
we do, ArchiveBot does phantomJS but I think putting a general purpose crawler onto something as big as that would be asking for trouble |
00:50
π
|
Frogging |
but it is possible, since that's what you're asking |
00:51
π
|
namespace |
Noted. |
00:51
π
|
namespace |
How would you handle a behemoth of that size then? |
00:52
π
|
Frogging |
Warrior job |
00:52
π
|
namespace |
(I wanted to do this in high school, but I was technically incapable at the time.) |
00:52
π
|
namespace |
(I can probably actually write up the warrior scripts now.) |
00:58
π
|
|
JesseW has joined #archiveteam |
01:01
π
|
|
BlueMaxim has joined #archiveteam |
01:01
π
|
|
SDr has quit IRC () |
01:17
π
|
|
JesseW has quit IRC (Quit: Leaving.) |
01:17
π
|
|
JesseW has joined #archiveteam |
01:36
π
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
02:13
π
|
|
DiscantX has joined #archiveteam |
02:20
π
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
02:32
π
|
|
philpem has quit IRC (Ping timeout: 260 seconds) |
02:55
π
|
|
DiscantX has joined #archiveteam |
03:04
π
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
03:12
π
|
|
ravetcofx has quit IRC (Ping timeout: 506 seconds) |
03:20
π
|
|
ravetcofx has joined #archiveteam |
03:22
π
|
|
Coderjoe has quit IRC (Read error: Connection reset by peer) |
03:30
π
|
|
Coderjoe has joined #archiveteam |
04:24
π
|
|
RichardG has quit IRC (Ping timeout: 258 seconds) |
04:28
π
|
|
ravetcofx has quit IRC (Ping timeout: 506 seconds) |
04:42
π
|
|
ravetcofx has joined #archiveteam |
04:49
π
|
|
Kitaru has joined #archiveteam |
04:54
π
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
04:55
π
|
|
Kitaru has quit IRC (Quit: This computer has gone to sleep) |
05:00
π
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
05:03
π
|
SketchCow |
ahahahhahaha |
05:03
π
|
SketchCow |
It's called someone leaked the info to me |
05:04
π
|
|
metalcamp has joined #archiveteam |
05:06
π
|
|
ravetcofx has joined #archiveteam |
05:06
π
|
|
Sk1d has joined #archiveteam |
05:08
π
|
DFJustin |
there's still a robots.txt bug preventing google groups from being viewable in wayback https://web.archive.org/web/20110514012530/http://groups.google.com/group/google.public.support.general/msg/d88f36fb3e2c0aac |
05:09
π
|
DFJustin |
it does seem to be working better now on other sites though |
05:10
π
|
DoomTay |
foxbox.tv seems to be "working" |
05:10
π
|
DoomTay |
That is, it's not affected anymore, but it turns out that a good chunk of stuff is gone-gone |
05:19
π
|
|
metal_cam has joined #archiveteam |
05:20
π
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
05:21
π
|
|
ndiddy has quit IRC (Quit: Leaving) |
05:44
π
|
|
Jeroen52 has quit IRC (Ping timeout: 260 seconds) |
05:48
π
|
|
Jeroen52 has joined #archiveteam |
05:52
π
|
|
tomwsmf-a has joined #archiveteam |
06:00
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
06:38
π
|
|
DoomTay has quit IRC (Quit: Page closed) |
06:55
π
|
namespace |
SketchCow: K. |
06:56
π
|
namespace |
Also wow I don't know if you tried scouting the directory structure of Groups, but it's really bad. All the top level categories have random numbers (at least in so far as I can tell, they're random). Then each post inside of a group has a unique (random?) ID. |
06:56
π
|
namespace |
Wondering if it's not random and actually just a hex string or something. |
06:57
π
|
PurpleSym |
namespace: We can use the JWT(?) API. |
06:58
π
|
PurpleSym |
Iβve seen scripts on GitHub, but I canβt find them anymore. |
06:59
π
|
PurpleSym |
*GWT |
07:00
π
|
|
anjacks0n has joined #archiveteam |
07:30
π
|
|
anjacks0n has quit IRC (anjacks0n) |
07:33
π
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
07:43
π
|
|
ravetcofx has joined #archiveteam |
07:53
π
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
07:54
π
|
|
anjacks0n has joined #archiveteam |
07:57
π
|
|
anjacks0n has quit IRC (anjacks0n) |
08:04
π
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
08:17
π
|
|
ravetcofx has joined #archiveteam |
08:36
π
|
|
ravetcofx has quit IRC (Remote host closed the connection) |
09:04
π
|
|
robink has quit IRC (Ping timeout: 633 seconds) |
09:13
π
|
|
robink has joined #archiveteam |
09:32
π
|
|
pfallenop has quit IRC (Ping timeout: 244 seconds) |
09:34
π
|
|
pfallenop has joined #archiveteam |
09:41
π
|
|
Emcy has quit IRC (Read error: Operation timed out) |
09:45
π
|
|
Emcy has joined #archiveteam |
10:20
π
|
|
Tomcat_ has joined #archiveteam |
10:38
π
|
|
Tomcat_ has quit IRC (Ping timeout: 258 seconds) |
10:40
π
|
|
philpem has joined #archiveteam |
10:48
π
|
|
kristian_ has joined #archiveteam |
11:00
π
|
luckcolor |
PurpleSym: gggd actually only uses rss for updating exstisting crawls |
11:00
π
|
luckcolor |
wrong chat |
11:17
π
|
|
Tomcat_ has joined #archiveteam |
11:35
π
|
|
Tomcat_ has quit IRC (Remote host closed the connection) |
11:57
π
|
|
signius has quit IRC (Ping timeout: 260 seconds) |
12:11
π
|
|
signius has joined #archiveteam |
12:56
π
|
|
anjacks0n has joined #archiveteam |
13:07
π
|
|
anjacks0n has quit IRC (anjacks0n) |
13:14
π
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
13:15
π
|
|
dashcloud has joined #archiveteam |
13:20
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
13:20
π
|
|
BartoCH has joined #archiveteam |
13:25
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
13:52
π
|
|
anjacks0n has joined #archiveteam |
13:57
π
|
|
anjacks0n has quit IRC (anjacks0n) |
14:07
π
|
|
VADemon has joined #archiveteam |
14:28
π
|
|
ndiddy has joined #archiveteam |
14:45
π
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
14:53
π
|
|
anjacks0n has joined #archiveteam |
15:12
π
|
|
BartoCH has joined #archiveteam |
15:14
π
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
15:17
π
|
|
JesseW has joined #archiveteam |
15:18
π
|
|
RichardG has joined #archiveteam |
15:52
π
|
|
ravetcofx has joined #archiveteam |
15:52
π
|
|
RichardG has quit IRC (Read error: Operation timed out) |
15:53
π
|
|
RichardG has joined #archiveteam |
16:00
π
|
|
anjacks0n has quit IRC (anjacks0n) |
16:01
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
16:04
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
16:22
π
|
|
BartoCH has joined #archiveteam |
16:25
π
|
|
anjacks0n has joined #archiveteam |
16:36
π
|
|
anjacks0n has quit IRC (anjacks0n) |
16:44
π
|
|
Kitaru has joined #archiveteam |
16:48
π
|
|
DoomTay has joined #archiveteam |
16:50
π
|
|
Medowar_ has joined #archiveteam |
16:51
π
|
|
Medowar_ has quit IRC (Remote host closed the connection) |
16:52
π
|
|
namespace has quit IRC (Read error: Operation timed out) |
17:00
π
|
|
banderas6 has joined #archiveteam |
17:04
π
|
|
VADemon has quit IRC (Quit: left4dead) |
17:05
π
|
|
kristian_ has quit IRC (Leaving) |
17:05
π
|
|
banderas6 has quit IRC (Ping timeout: 268 seconds) |
17:29
π
|
|
tomwsmf-a has joined #archiveteam |
17:29
π
|
|
schbirid has joined #archiveteam |
17:38
π
|
|
anjacks0n has joined #archiveteam |
17:52
π
|
|
anjacks0n has quit IRC (anjacks0n) |
17:58
π
|
|
db48x has quit IRC (Read error: Connection reset by peer) |
17:59
π
|
|
anjacks0n has joined #archiveteam |
18:30
π
|
|
db48x has joined #archiveteam |
18:36
π
|
|
VADemon has joined #archiveteam |
18:53
π
|
|
DiscantX has joined #archiveteam |
18:58
π
|
|
Kitaru has quit IRC (Quit: This computer has gone to sleep) |
19:00
π
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
19:01
π
|
|
JesseW has joined #archiveteam |
19:11
π
|
|
Kitaru has joined #archiveteam |
19:14
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
19:16
π
|
|
DiscantX has joined #archiveteam |
19:28
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:33
π
|
|
dashcloud has joined #archiveteam |
19:36
π
|
|
REiN^ has quit IRC () |
19:51
π
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
19:53
π
|
|
Kitaru has quit IRC (Quit: This computer has gone to sleep) |
20:06
π
|
|
metal_cam has quit IRC (Ping timeout: 250 seconds) |
20:07
π
|
|
metalcamp has joined #archiveteam |
20:12
π
|
|
schbirid has quit IRC (Quit: Leaving) |
20:29
π
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
20:31
π
|
|
DoomTay has quit IRC (Quit: Page closed) |
20:39
π
|
|
xXx_ndidd has joined #archiveteam |
20:40
π
|
|
ndiddy has quit IRC (Ping timeout: 244 seconds) |
21:01
π
|
|
REiN^ has joined #archiveteam |
21:24
π
|
|
Kitaru has joined #archiveteam |
21:27
π
|
|
JesseW has joined #archiveteam |
21:27
π
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
21:35
π
|
|
Kitaru has quit IRC (Quit: This computer has gone to sleep) |
21:36
π
|
|
VADemon has quit IRC (Quit: left4dead) |
21:38
π
|
|
Kitaru has joined #archiveteam |
21:57
π
|
|
Start_ has joined #archiveteam |
21:57
π
|
|
Start has quit IRC (Read error: Connection reset by peer) |
22:30
π
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
22:32
π
|
|
dashcloud has joined #archiveteam |
23:07
π
|
JesseW |
Anyone happen to have a copy of wikipedia-logs-2001-08-17.7z (used to be at http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z six years ago)? IA search doesn't turn up a copy... |
23:15
π
|
db48x |
JesseW: https://web.archive.org/web/20130501000000*/http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z |
23:17
π
|
JesseW |
strange, when I looked I didn't find that |
23:22
π
|
|
DoomTay has joined #archiveteam |
23:27
π
|
|
divingk has joined #archiveteam |
23:28
π
|
divingk |
Good god. |
23:28
π
|
divingk |
Digging through a bunch of games and finding their source code. |
23:28
π
|
divingk |
And I've only dug through games on three platforms, tops. |
23:30
π
|
JesseW |
divingk: say more? |
23:30
π
|
divingk |
Well...it's interesting to say the least. |
23:30
π
|
divingk |
What I've been doing is using Astrogrep over ROM collections. |
23:31
π
|
divingk |
I knew I would find bits of code, but I wasn't aware of the potential scale behind this. |
23:31
π
|
JesseW |
what do you mean "finding their source code" -- where are you finding it? Included in the ROMs, or? |
23:31
π
|
divingk |
Yes, source code accidentally included in ROMs. |
23:31
π
|
divingk |
I can provide a lot of examples of this. |
23:31
π
|
JesseW |
neat! |
23:31
π
|
divingk |
https://tcrf.net/Ometron |
23:31
π
|
JesseW |
That's very good, no? |
23:31
π
|
divingk |
https://tcrf.net/Invasion_(ZX_Spectrum,_Bulldog_Software) |
23:32
π
|
JesseW |
More interesting data to examine and learn from. |
23:32
π
|
divingk |
Good, but I can't help but think there are many games out there with this sort of thing/ |
23:32
π
|
divingk |
I haven't looked into the C64 library. |
23:32
π
|
divingk |
No doubt it's one of the most interesting things to find in a game, |
23:32
π
|
divingk |
that is depending on how long said fragments are. |
23:33
π
|
divingk |
For instance, here's one case where most of the code was discovered: https://tcrf.net/Exodus_(ZX_Spectrum,_Firebird_Software) |
23:33
π
|
divingk |
Whereas here, there's only a snippet: https://tcrf.net/Robotron:_2084_(ZX_Spectrum) |
23:34
π
|
divingk |
Most of the ones found so far are on the ZX Spectrum. |
23:34
π
|
divingk |
I've found some on the Amstrad CPC too, plus I wrote up one for the Supervision. |
23:35
π
|
divingk |
https://tcrf.net/Arcade_Flight_Simulator_(ZX_Spectrum) |
23:35
π
|
divingk |
A rare example of a Codemasters game with code sprawling about. |
23:36
π
|
divingk |
Early Ocean games, like Hunchback or Eskimo Eddie, also have bits of code. |
23:36
π
|
divingk |
https://tcrf.net/Hunchback_(ZX_Spectrum) |
23:36
π
|
divingk |
But yeah, curious if anyone here knows about this... |
23:36
π
|
JesseW |
(you may want to move this to #archiveteam-bs, as this channel is generally reserved for quick announcements, rather than longer discussions) |
23:36
π
|
divingk |
Oh. |
23:36
π
|
divingk |
Mind if I copy and paste what I said here over to there? |
23:37
π
|
JesseW |
Better to just link it from the public log (which I'll do) |
23:38
π
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
23:38
π
|
|
RichardG has joined #archiveteam |
23:42
π
|
JesseW |
Hm, the wiki doesn't seem to have an entry for "The Cutting Room Floor" (video game history site: https://tcrf.net/ ) yet -- someone should add one. |
23:45
π
|
|
WinterFox has joined #archiveteam |
23:47
π
|
|
BlueMaxim has joined #archiveteam |