Time |
Nickname |
Message |
00:10
🔗
|
|
BlueMax has joined #archiveteam-bs |
00:43
🔗
|
|
deevious has joined #archiveteam-bs |
01:13
🔗
|
|
ShellyRol has quit IRC (Remote host closed the connection) |
01:13
🔗
|
|
ShellyRol has joined #archiveteam-bs |
01:18
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
01:20
🔗
|
|
ShellyRol has joined #archiveteam-bs |
01:23
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
01:24
🔗
|
|
ShellyRol has joined #archiveteam-bs |
01:32
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
01:35
🔗
|
|
ShellyRol has joined #archiveteam-bs |
01:40
🔗
|
|
BartoCH has joined #archiveteam-bs |
02:03
🔗
|
|
wyatt8750 has quit IRC (Read error: Operation timed out) |
02:37
🔗
|
|
ivan_ has quit IRC (Leaving) |
02:38
🔗
|
|
ivan_ has joined #archiveteam-bs |
02:38
🔗
|
|
Fusl sets mode: +o ivan_ |
02:38
🔗
|
|
Fusl_ sets mode: +o ivan_ |
02:38
🔗
|
|
svchfoo3 sets mode: +o ivan_ |
02:41
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
02:43
🔗
|
|
ShellyRol has joined #archiveteam-bs |
02:55
🔗
|
|
ivan- has joined #archiveteam-bs |
02:55
🔗
|
|
ivan_ has quit IRC (Read error: Operation timed out) |
02:55
🔗
|
|
Fusl sets mode: +o ivan- |
02:55
🔗
|
|
Fusl_ sets mode: +o ivan- |
03:00
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
03:01
🔗
|
|
ivan- is now known as ivan_ |
03:17
🔗
|
|
Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) |
03:27
🔗
|
|
qw3rty113 has joined #archiveteam-bs |
03:31
🔗
|
|
qw3rty112 has quit IRC (Ping timeout: 600 seconds) |
03:49
🔗
|
|
odemgi has joined #archiveteam-bs |
03:50
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
03:54
🔗
|
|
odemgi_ has quit IRC (Read error: Operation timed out) |
04:05
🔗
|
|
odemg has joined #archiveteam-bs |
05:03
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
05:06
🔗
|
|
fuzzy8021 has quit IRC (Ping timeout: 258 seconds) |
05:07
🔗
|
|
fuzzy8021 has joined #archiveteam-bs |
05:13
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 252 seconds) |
05:14
🔗
|
|
Mateon1 has joined #archiveteam-bs |
05:14
🔗
|
|
Stiletto has joined #archiveteam-bs |
05:17
🔗
|
|
d5f4a3622 has quit IRC (Read error: Operation timed out) |
05:20
🔗
|
|
d5f4a3622 has joined #archiveteam-bs |
05:31
🔗
|
|
asdf0101 has quit IRC (The Lounge - https://thelounge.chat) |
05:31
🔗
|
|
markedL has quit IRC (The Lounge - https://thelounge.chat) |
05:31
🔗
|
|
markedL has joined #archiveteam-bs |
05:31
🔗
|
|
asdf0101 has joined #archiveteam-bs |
05:34
🔗
|
|
systwi_ has joined #archiveteam-bs |
05:40
🔗
|
|
systwi has quit IRC (Read error: Operation timed out) |
05:42
🔗
|
|
Dragnog2 has joined #archiveteam-bs |
05:55
🔗
|
|
systwi has joined #archiveteam-bs |
05:59
🔗
|
|
boramalpe has joined #archiveteam-bs |
06:00
🔗
|
|
systwi_ has quit IRC (Read error: Operation timed out) |
06:27
🔗
|
|
Ivy has quit IRC (Quit: Connection closed for inactivity) |
06:42
🔗
|
|
boramalpe has quit IRC (Leaving) |
07:05
🔗
|
|
jspiros_ has joined #archiveteam-bs |
07:05
🔗
|
|
jspiros has quit IRC (Read error: Connection reset by peer) |
07:18
🔗
|
|
fuzzy8021 has quit IRC (Read error: Operation timed out) |
07:19
🔗
|
|
fuzzy8021 has joined #archiveteam-bs |
08:42
🔗
|
|
Dragnog2 has quit IRC (Quit: Connection closed for inactivity) |
09:19
🔗
|
|
h3ndr1k has quit IRC (Quit: h3ndr1k) |
09:20
🔗
|
|
h3ndr1k has joined #archiveteam-bs |
09:27
🔗
|
|
h3ndr1k has quit IRC (Quit: h3ndr1k) |
09:28
🔗
|
|
h3ndr1k has joined #archiveteam-bs |
09:30
🔗
|
|
h3ndr1k has quit IRC (Client Quit) |
09:31
🔗
|
|
h3ndr1k has joined #archiveteam-bs |
09:37
🔗
|
|
h3ndr1k has quit IRC (Quit: h3ndr1k) |
09:38
🔗
|
|
h3ndr1k has joined #archiveteam-bs |
09:42
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
09:58
🔗
|
|
bluefoo has quit IRC (Read error: Operation timed out) |
10:20
🔗
|
|
Video has joined #archiveteam-bs |
10:23
🔗
|
|
bluefoo has joined #archiveteam-bs |
10:31
🔗
|
Video |
So I need some help with some ideas for archiving a certain forum I am a part of. |
10:32
🔗
|
|
bluefoo has quit IRC (Remote host closed the connection) |
10:32
🔗
|
Igloo |
Hi Video |
10:32
🔗
|
Igloo |
Sure |
10:32
🔗
|
Video |
The forum runs on ProBoards, and it contains 7 years worth of threads, replies, attachments, and users. |
10:33
🔗
|
Video |
I don't know how I'd approach saving every single page, because I don't know if it's possible to login as a forum user in order to access certain threads and just user pages in general. |
10:36
🔗
|
|
godane has quit IRC (Ping timeout: 246 seconds) |
10:39
🔗
|
Video |
What would be good software to allow for pages to be accessed while logged in as a forum account? |
10:51
🔗
|
|
godane has joined #archiveteam-bs |
10:53
🔗
|
Igloo |
You'd have to use something like grab-site and provide it the cookies |
11:13
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
11:15
🔗
|
Video |
Alright, thanks! |
11:16
🔗
|
|
Video has quit IRC (Quit: Page closed) |
11:17
🔗
|
|
Soni has quit IRC (se.hub irc.homelien.no) |
11:19
🔗
|
|
Dragnog2 has joined #archiveteam-bs |
11:29
🔗
|
|
Soni has joined #archiveteam-bs |
11:38
🔗
|
odemgi |
magnet:?xt=urn:btih:f871522dac2c3a94f26c9ed79c09b4b1d4633aa7&dn=Epitech |
11:38
🔗
|
odemgi |
https://www.zataz.com/epitek-reveal-des-anonymous-diffusent-des-infos-internes-depitech/ |
11:54
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
11:54
🔗
|
|
odemg has joined #archiveteam-bs |
12:14
🔗
|
Flashfire |
Translation |
12:27
🔗
|
eythian |
https://www.linuxjournal.com/content/linux-journal-ceases-publication-awkward-goodbye |
12:28
🔗
|
eythian |
"The website will continue to stay up for the next few weeks, hopefully longer for archival purposes if we can make it happen." |
12:40
🔗
|
|
deevious has quit IRC (Quit: deevious) |
13:04
🔗
|
|
godane has joined #archiveteam-bs |
13:05
🔗
|
|
deevious has joined #archiveteam-bs |
13:09
🔗
|
|
Mateon1 has quit IRC (Mateon1) |
13:10
🔗
|
|
Mateon1 has joined #archiveteam-bs |
13:13
🔗
|
|
bluefoo has joined #archiveteam-bs |
14:39
🔗
|
|
RichardG_ has joined #archiveteam-bs |
14:43
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
15:25
🔗
|
|
omglolba- has quit IRC (Read error: No route to host) |
15:30
🔗
|
|
omglolbah has joined #archiveteam-bs |
15:48
🔗
|
|
marked has joined #archiveteam-bs |
16:05
🔗
|
h3ndr1k |
Is it even possible to archive all linux journal magazines? It seems you need a subscription to access them, but does that include access to all old ones? |
16:14
🔗
|
marked |
The subscribe page implies as such "Current Subscribers - Download back issues of Linux Journal" and links to that download page with pdf's that goes as far back as issue/volume 1 |
16:14
🔗
|
|
marked is now known as marked1 |
16:22
🔗
|
|
Dj-Wawa has joined #archiveteam-bs |
16:25
🔗
|
marked1 |
the subscribe links point to pubservice.com but just says "This Publication is currently not taking orders online. Please try later." |
16:36
🔗
|
marked1 |
weird, sometimes it gives me a pdf and sometimes that link invalid |
17:08
🔗
|
markedL |
seems like a server hiccup then. without changing scripts got 171x pdfs already |
17:49
🔗
|
|
killsushi has joined #archiveteam-bs |
18:11
🔗
|
|
Pixi` has joined #archiveteam-bs |
18:15
🔗
|
|
Pixi has quit IRC (Read error: Operation timed out) |
18:38
🔗
|
|
marked1 has quit IRC (Read error: Operation timed out) |
19:13
🔗
|
|
MrRadar2 has quit IRC (Read error: Operation timed out) |
19:17
🔗
|
|
bithippo has joined #archiveteam-bs |
19:35
🔗
|
|
DigiDigi has quit IRC (Read error: Operation timed out) |
19:40
🔗
|
|
DigiDigi has joined #archiveteam-bs |
19:48
🔗
|
nyany |
h3ndr1k: not sure if anyone sent this to you or not but https://secure2.linuxjournal.com/pdf/dljdownload.php |
19:48
🔗
|
nyany |
knock yourself out |
19:50
🔗
|
h3ndr1k |
nyany: Thanks, that link was posted here multiple times, yesterday I clicked some of the PDFs, but alwasy got redirected to the sebscriptions service. Not sure what I did wrong. A few minutes ago, I downloaded all pdfs, epubs and mobis from there. |
19:51
🔗
|
h3ndr1k |
Currently running httrack on https://secure2.linuxjournal.com/ljarchive/LJ/ |
19:51
🔗
|
h3ndr1k |
It seems to contains all previous issues |
19:52
🔗
|
nyany |
yeah i can tell you that they probably dont give a fuck at this point |
19:55
🔗
|
JAA |
I think they also released a zip with all issues on the last shutdown announcement in Dec 2017. |
19:56
🔗
|
nyany |
That IDK |
19:57
🔗
|
nyany |
Did you see this? https://www.linuxjournal.com/content/linux-journal-ceases-publication-awkward-goodbye |
19:58
🔗
|
JAA |
https://secure2.linuxjournal.com/LJArchive2017.zip is where it was. |
20:04
🔗
|
Igloo |
OK so https://secure2.linuxjournal.com/pdf/dljdownload.php has been opened up since I last checked it. |
20:15
🔗
|
h3ndr1k |
Igloo: Oh, so I did nothing wrong, but they changed it. Yesterday it just forwarded to a subscriptions service I think. |
20:15
🔗
|
Igloo |
Yeah same here h3ndr1k |
20:16
🔗
|
|
bithippo has quit IRC (Textual IRC Client: www.textualapp.com) |
20:40
🔗
|
|
katocala has joined #archiveteam-bs |
20:47
🔗
|
|
mls_ has joined #archiveteam-bs |
20:57
🔗
|
SketchCow |
Boop |
20:57
🔗
|
Igloo |
THere this window is. |
20:57
🔗
|
JAA |
We're grabbing some images, but not all the resolutions, and those are the ones which actually matter. |
20:57
🔗
|
Igloo |
Ah. |
20:58
🔗
|
JAA |
Specifically, there's a <source> tag with a data-srcset (why not just a srcset?) attribute that has the URLs that virtually all users will see, including in the WBM. |
20:58
🔗
|
JAA |
And there's an <img> tag as well, whose src we are grabbing. |
20:58
🔗
|
JAA |
I wish JS had never been invented. |
21:03
🔗
|
SketchCow |
I'm almost inclined for us to do a screenshot-bot here. |
21:03
🔗
|
SketchCow |
Just screenshot the whole fucking thing |
21:04
🔗
|
SketchCow |
Are we in fact grabbing the images, though? A hack can grab them? |
21:04
🔗
|
SketchCow |
Can I see an example |
21:04
🔗
|
SketchCow |
I know, here I am wandering in cold, sorry |
21:05
🔗
|
SketchCow |
But I'm feeling really poorly this week and I at least can aim my ire at something productive |
21:05
🔗
|
Igloo |
The problem is that the tools we have right now can't run javascript properly |
21:05
🔗
|
SketchCow |
I get that |
21:05
🔗
|
JAA |
Don't have any examples handy. I just know it did grab images before, and I had to ignore the improperly parsed srcsets. |
21:05
🔗
|
JAA |
It would probably work fine if wpull's srcset parser wasn't broken. |
21:05
🔗
|
SketchCow |
And I don't care if it is fucking up, as long as someone can make an archive from what we grabbed |
21:05
🔗
|
Igloo |
https://psmag.com/.image/ar_16:9%2Cc_fill%2Ccs_srgb%2Cfl_progressive%2Cg_faces:center%2Cq_auto:good%2Cw_620/MTQ3OTQ3MDE1MDg5Njk0MjY4/1972july-01.jpg |
21:06
🔗
|
Igloo |
(I should probably have checked that first...) |
21:06
🔗
|
JAA |
Well, we're definitely not grabbing the highest-resolution images. |
21:06
🔗
|
JAA |
Not a picture for ants either though, something intermediate. |
21:10
🔗
|
SketchCow |
I realize I'm asking this before a very small deadline |
21:10
🔗
|
SketchCow |
But psmag.com is almost absolutely going to be 100% taken down next week |
21:30
🔗
|
|
Ceiro has joined #archiveteam-bs |
21:53
🔗
|
SketchCow |
In short: i dunked on a handle and he's selling a thing |
21:53
🔗
|
SketchCow |
A twist I didn't expect |
21:53
🔗
|
SketchCow |
M. Night Archives |
21:54
🔗
|
|
super3 has joined #archiveteam-bs |
21:54
🔗
|
super3 |
@SketchCow, hello again! |
21:55
🔗
|
SketchCow |
You SAY that |
22:03
🔗
|
super3 |
@SketchCow, what do you think would be useful to publicly archive? ideally stuff that non-copyrighted |
22:03
🔗
|
SketchCow |
Have you... been in archive team before |
22:04
🔗
|
super3 |
I ran a warrior once for Tumblr, that it. |
22:06
🔗
|
super3 |
I'm a newbie, trying to figure out where is best to start. |
22:07
🔗
|
|
wyatt8750 has joined #archiveteam-bs |
22:12
🔗
|
|
ShellyRol has quit IRC (Read error: Operation timed out) |
22:20
🔗
|
|
DogsRNice has joined #archiveteam-bs |
22:27
🔗
|
super3 |
@kisspunch, are you around? |
22:27
🔗
|
|
ShellyRol has joined #archiveteam-bs |
22:41
🔗
|
|
super3 has quit IRC (Read error: Operation timed out) |