#archiveteam-bs 2020-09-17,Thu

↑back Search

Time Nickname Message
00:04 🔗 benjins has joined #archiveteam-bs
00:58 🔗 JAA OrIdow6: You asked for the Microsoft Download Center pages. Sorry, took a bit longer, but here they are: https://archive.org/details/microsoft_download_center_pages_20200804
01:03 🔗 JAA Oh wait, mixed up some files, more coming.
01:36 🔗 wyatt8750 has joined #archiveteam-bs
01:36 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
01:43 🔗 JAA OrIdow6: https://archive.org/download/microsoft_download_center_pages_202008
02:10 🔗 wyatt8750 has quit IRC (Read error: Operation timed out)
02:12 🔗 wyatt8740 has joined #archiveteam-bs
02:17 🔗 OrIdow6 JAA: Thanks
03:07 🔗 lennier2 has joined #archiveteam-bs
03:08 🔗 qw3rty_ has joined #archiveteam-bs
03:09 🔗 lennier1 has quit IRC (Read error: Operation timed out)
03:09 🔗 lennier2 is now known as lennier1
03:12 🔗 Ctrl has quit IRC (Read error: Operation timed out)
03:13 🔗 SJon_____ has quit IRC (Read error: Operation timed out)
03:14 🔗 SJon_____ has joined #archiveteam-bs
03:15 🔗 qw3rty__ has quit IRC (Read error: Operation timed out)
03:16 🔗 bsmith093 has quit IRC (Read error: Operation timed out)
03:18 🔗 bsmith093 has joined #archiveteam-bs
03:19 🔗 klg has joined #archiveteam-bs
03:20 🔗 jodizzle_ has joined #archiveteam-bs
03:27 🔗 klg_ has quit IRC (Read error: Operation timed out)
03:28 🔗 jodizzle has quit IRC (Read error: Operation timed out)
03:28 🔗 jodizzle_ is now known as jodizzle
04:33 🔗 Jonboy345 has quit IRC (Read error: Operation timed out)
04:35 🔗 mtntmnky_ has quit IRC (Remote host closed the connection)
04:35 🔗 mtntmnky_ has joined #archiveteam-bs
04:37 🔗 lennier2 has joined #archiveteam-bs
04:41 🔗 lennier1 has quit IRC (Ping timeout: 272 seconds)
04:41 🔗 lennier2 is now known as lennier1
04:48 🔗 mtntmnky_ has quit IRC (Remote host closed the connection)
04:49 🔗 mtntmnky_ has joined #archiveteam-bs
06:27 🔗 Ctrl has joined #archiveteam-bs
06:51 🔗 HP_Archiv has joined #archiveteam-bs
06:51 🔗 britmob has quit IRC (Read error: Operation timed out)
06:52 🔗 britmob has joined #archiveteam-bs
06:52 🔗 britmob4 has joined #archiveteam-bs
06:55 🔗 betamax has quit IRC (Read error: Operation timed out)
06:55 🔗 britm0b has quit IRC (Read error: Operation timed out)
06:57 🔗 betamax has joined #archiveteam-bs
07:01 🔗 betamax has quit IRC (Read error: Operation timed out)
07:01 🔗 betamax has joined #archiveteam-bs
07:20 🔗 HP_Archiv has quit IRC (Read error: Connection reset by peer)
07:57 🔗 MaximeleG has joined #archiveteam-bs
08:50 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
09:02 🔗 wyatt8740 has quit IRC (Ping timeout: 260 seconds)
09:02 🔗 wyatt8740 has joined #archiveteam-bs
11:03 🔗 MaximeleG has quit IRC (Quit: MaximeleG)
11:07 🔗 MaximeleG has joined #archiveteam-bs
11:29 🔗 jshoard has joined #archiveteam-bs
12:16 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:03 🔗 britmob_ has joined #archiveteam-bs
14:09 🔗 britmob has quit IRC (Read error: Operation timed out)
14:09 🔗 britmob4 is now known as britmob
16:07 🔗 Arcorann has quit IRC (Read error: Connection reset by peer)
16:22 🔗 SketchCow I am PRETTY sure this is the case already but we should probably make all new project channels go to hackint.
16:22 🔗 SketchCow But explicitly mark so in announcements and wiki
16:50 🔗 dashcloud has joined #archiveteam-bs
17:00 🔗 britmob_ has quit IRC (Read error: Connection reset by peer)
17:05 🔗 britmob_ has joined #archiveteam-bs
17:06 🔗 Mateon1 has quit IRC (Quit: Mateon1)
17:06 🔗 Mateon1 has joined #archiveteam-bs
17:24 🔗 dashcloud has quit IRC (Read error: Operation timed out)
17:25 🔗 lennier2 has joined #archiveteam-bs
17:28 🔗 lennier1 has quit IRC (Ping timeout: 260 seconds)
17:28 🔗 lennier2 is now known as lennier1
17:45 🔗 lunik1 has joined #archiveteam-bs
17:50 🔗 JAA cm: Did you mean -k rather than -x on wget?
17:51 🔗 JAA If -k doesn't handle the percent-encoding, that sounds like a bug to me.
17:53 🔗 JAA It looks like it's supposed to convert ? to %3F per http://git.savannah.gnu.org/cgit/wget.git/tree/src/convert.c?id=314a4f42be3c969aadc1cef9f5859f8a61b7ca82#n722
17:54 🔗 JAA But only when --adjust-extension (-E) is also used.
18:01 🔗 cm the problem is how to store the files while maintain a mapping with the original url
18:02 🔗 JAA Do you need that mapping, or do you just need internal links to work?
18:02 🔗 cm wget -x creates directories so that the output file path mirrors the url (idk how reliably)
18:03 🔗 cm i would like to keep the mapping so i know which url was requested
18:03 🔗 JAA Well, then you need to do some web server magic.
18:03 🔗 cm so i can avoid requesting urls i already have
18:04 🔗 cm web server magic isn't an option since this is controlled by my web host
18:04 🔗 SJon_____ has quit IRC (Read error: Connection reset by peer)
18:04 🔗 JAA Web server magic on serving the archive, I mean.
18:05 🔗 cm yeah
18:05 🔗 JAA Ah right
18:05 🔗 JAA Well, you can't have both.
18:05 🔗 SJon_____ has joined #archiveteam-bs
18:05 🔗 cm well i dont need the original url to be perfectly preserved within the archive url
18:06 🔗 cm i just need it to be stored somehow, so i can tell if i've already requested a given url
18:06 🔗 JAA Right. wget's filename conversion with -k should be deterministic.
18:10 🔗 cm the man page doesn't fully explain what is done
18:10 🔗 cm i could read the code i guess
18:10 🔗 cm but i would need some way to convert the urls to what wget -k would produce, in order to check if i have gotten the URL already
18:11 🔗 MaximeleG has quit IRC (Quit: MaximeleG)
18:11 🔗 JAA Yeah, that's in convert.c, but I don't know all the details.
18:12 🔗 JAA Should be possible to write a little tool around that which just does the conversion.
18:12 🔗 brayden has quit IRC (Ping timeout: 272 seconds)
18:12 🔗 Aoede has quit IRC (Ping timeout: 272 seconds)
18:12 🔗 Laverne has quit IRC (Ping timeout: 272 seconds)
18:12 🔗 sHATNER has quit IRC (Ping timeout: 272 seconds)
18:13 🔗 Aoede has joined #archiveteam-bs
18:13 🔗 cm well i probably wont be using wget -k anyway
18:13 🔗 cm im archiving podcasts and i don't think wget parses rss feeds
18:14 🔗 cm so i could use my own implementation of the wget -k conversion algo, or just use something like hex encoding for the filenames
18:43 🔗 TC01 has quit IRC (Read error: Operation timed out)
18:46 🔗 TC01 has joined #archiveteam-bs
18:51 🔗 scorche` has joined #archiveteam-bs
18:52 🔗 scorche has quit IRC (Read error: Operation timed out)
18:52 🔗 scorche` is now known as scorche
19:16 🔗 sHATNER has joined #archiveteam-bs
19:17 🔗 brayden has joined #archiveteam-bs
19:17 🔗 Laverne has joined #archiveteam-bs
20:34 🔗 mtntmnky_ has quit IRC (Remote host closed the connection)
20:34 🔗 mtntmnky_ has joined #archiveteam-bs
20:42 🔗 dashcloud has joined #archiveteam-bs
21:11 🔗 lennier1 has quit IRC (Ping timeout: 265 seconds)
21:13 🔗 lennier1 has joined #archiveteam-bs
21:16 🔗 lennier2 has joined #archiveteam-bs
21:18 🔗 BlueMax has joined #archiveteam-bs
21:21 🔗 lennier1 has quit IRC (Read error: Operation timed out)
21:21 🔗 lennier2 is now known as lennier1
21:29 🔗 jshoard has quit IRC (Quit: Leaving)
21:41 🔗 BlueMax has quit IRC (Quit: Leaving)
23:06 🔗 larryv has joined #archiveteam-bs
23:07 🔗 Arcorann has joined #archiveteam-bs
23:25 🔗 Arcorann has quit IRC (Remote host closed the connection)
23:26 🔗 Arcorann has joined #archiveteam-bs
23:50 🔗 HP_Archiv has joined #archiveteam-bs
23:51 🔗 atbk has quit IRC (Quit: ZNC - https://znc.in)
23:54 🔗 atbk has joined #archiveteam-bs

irclogger-viewer