#archiveteam-ot 2019-03-02,Sat

↑back Search

Time Nickname Message
02:05 🔗 Stilett0 has joined #archiveteam-ot
02:06 🔗 Stiletto has quit IRC (Read error: Operation timed out)
02:11 🔗 S1mpbrain has joined #archiveteam-ot
02:11 🔗 SimpBrain has quit IRC (Remote host closed the connection)
02:16 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
02:29 🔗 BlueMax has joined #archiveteam-ot
03:08 🔗 nataraj has joined #archiveteam-ot
03:27 🔗 nataraj has quit IRC (Read error: Operation timed out)
03:27 🔗 S1mpbrain has quit IRC (Read error: Connection reset by peer)
03:27 🔗 S1mpbrain has joined #archiveteam-ot
03:56 🔗 lag__ has joined #archiveteam-ot
03:56 🔗 S1mpbrain has quit IRC (Remote host closed the connection)
04:00 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
04:04 🔗 VerifiedJ has joined #archiveteam-ot
04:19 🔗 odemg has quit IRC (Ping timeout: 615 seconds)
04:26 🔗 odemg has joined #archiveteam-ot
05:34 🔗 Stilett0 is now known as Stiletto
05:36 🔗 Despatche has joined #archiveteam-ot
05:51 🔗 wp494 has quit IRC (Read error: Operation timed out)
05:52 🔗 wp494 has joined #archiveteam-ot
05:52 🔗 Despatche has quit IRC (Ping timeout: 624 seconds)
06:53 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
06:54 🔗 VerifiedJ has joined #archiveteam-ot
07:52 🔗 Oddly has joined #archiveteam-ot
08:31 🔗 Oddly has quit IRC (Ping timeout: 255 seconds)
08:45 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
08:54 🔗 VerifiedJ has joined #archiveteam-ot
09:30 🔗 LFlare has quit IRC (Quit: Ping timeout (120 seconds))
09:31 🔗 LFlare has joined #archiveteam-ot
12:22 🔗 BlueMax has quit IRC (Quit: Leaving)
14:54 🔗 wp494 has quit IRC (Read error: Operation timed out)
14:54 🔗 wp494 has joined #archiveteam-ot
15:24 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
16:13 🔗 VerifiedJ has joined #archiveteam-ot
16:23 🔗 SimpBrain has joined #archiveteam-ot
16:23 🔗 lag__ has quit IRC (Read error: Connection reset by peer)
16:23 🔗 SimpBrain has quit IRC (Remote host closed the connection)
17:40 🔗 Kaz Anyone here magic with grafana and wouldn't mind helping me out for a couple of mins please?
17:44 🔗 Mateon1 has quit IRC (Ping timeout: 255 seconds)
17:44 🔗 Mateon1 has joined #archiveteam-ot
17:45 🔗 Fusl Kaz: here
17:46 🔗 Kaz pm?
18:24 🔗 t3 So this question might have been asked many times. Can anyone point me to a general purpose web scraper to collect URLs of PDF files?
18:28 🔗 t3 So if you look at the source for http://www.rubycon.co.jp/en/catalog/capacitors.html, you will find that the links to the PDF files are coded as the following example: `<area shape="rect" coords="41, 239, 108, 290" href="e_pdfs/pzcap/e_PEV.pdf">`.
18:29 🔗 t3 Those are area tags.
18:29 🔗 t3 It seems as though ArchiveBot does not pick them up.
18:30 🔗 t3 Or at least not grab-site.
18:36 🔗 ivan wpull has 'area': {'href': ATTR_HTML},
18:38 🔗 t3 ivan: So I should somehow use wpull?
18:39 🔗 ivan nah I mean grab-site _should_ have worked maybe
18:42 🔗 t3 ivan: Oh, it worked using grab-site for http://www.rubycon.co.jp/.
18:42 🔗 t3 Thanks.
18:49 🔗 kiska1 has quit IRC (Ping timeout (120 seconds))
18:50 🔗 kiska1 has joined #archiveteam-ot
18:56 🔗 kiska1 has quit IRC (Ping timeout (120 seconds))
18:59 🔗 kiska1 has joined #archiveteam-ot
18:59 🔗 kiska1 has quit IRC (Remote host closed the connection)
19:01 🔗 kiska1 has joined #archiveteam-ot
19:01 🔗 JAA Hmm, are you sure about ArchiveBot not picking them up? That would be a bug.
19:01 🔗 JAA t3: ^
19:03 🔗 kiska1 has quit IRC (Remote host closed the connection)
19:04 🔗 kiska1 has joined #archiveteam-ot
19:18 🔗 NickN00b has joined #archiveteam-ot
19:24 🔗 t3 JAA: Well I used `!ao`. Maybe that is why it did not archive the PDFs.
19:33 🔗 Fusl t3: afaik !ao only recurses into inline links, not into html links
19:33 🔗 MR9K has quit IRC (Ping timeout: 264 seconds)
19:36 🔗 MR9K has joined #archiveteam-ot
20:04 🔗 Hani has quit IRC (Read error: Operation timed out)
20:05 🔗 mr_archiv has quit IRC (WeeChat 1.6)
20:10 🔗 mr_archiv has joined #archiveteam-ot
20:42 🔗 Hani has joined #archiveteam-ot
21:11 🔗 Despatche has joined #archiveteam-ot
21:18 🔗 nataraj has joined #archiveteam-ot
21:20 🔗 JAA t3: Yup, Fusl is right. That's a link, so it isn't followed. An !a job (at the right directory level) should retrieve them.
21:53 🔗 nataraj has quit IRC (Read error: Operation timed out)
22:03 🔗 LFlare has quit IRC (west.us.hub irc.mzima.net)
22:03 🔗 Fusl has quit IRC (west.us.hub irc.mzima.net)
22:03 🔗 Soni has quit IRC (west.us.hub irc.mzima.net)
22:03 🔗 fuzzy8021 has quit IRC (west.us.hub irc.mzima.net)
22:03 🔗 tjg1_ has quit IRC (west.us.hub irc.mzima.net)
22:06 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
22:07 🔗 LFlare has joined #archiveteam-ot
22:07 🔗 Fusl has joined #archiveteam-ot
22:07 🔗 Soni has joined #archiveteam-ot
22:07 🔗 fuzzy8021 has joined #archiveteam-ot
22:07 🔗 tjg1_ has joined #archiveteam-ot
22:11 🔗 Mateon1 has quit IRC (Mateon1)
22:11 🔗 VerifiedJ has joined #archiveteam-ot
22:11 🔗 Mateon1 has joined #archiveteam-ot
22:35 🔗 BlueMax has joined #archiveteam-ot
22:49 🔗 VerifiedJ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Despatche has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 justas has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Flashfire has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 ranma has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 kiska has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 BlueMax has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 odemg has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Jens has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 bztoot has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 phuz has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 apache2 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 SketchCow has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 arkiver has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 acridAxid has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 tuluu has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Sanqui has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Jon has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 t3 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 gandalf has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 ephemer0l has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 jeekl has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 eientei95 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 hook54321 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 horkermon has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Fusl_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 revi has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 chr1sm has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 HCross has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Vito` has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 bitspill has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 pnJay has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 diggan has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 bakJAA has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Ctrl-S_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 deathy has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 DrasticAc has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Meroje has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 kpcyrd has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Hecatz has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Muad-Dib has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 mgrytbak has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 LFlare has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Fusl has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Soni has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 fuzzy8021 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 tjg1_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Hani has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 MR9K has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Frogging has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 noirscape has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 argus has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 jodizzle has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 N4Y has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 VoynichCr has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 MrRadar2 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Tenebrae has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 BnAboyZ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 bithippo has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 sknebel has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 robogoat has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 ivan has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Polylith_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 JAA has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 simon816 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Stiletto has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 marked has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 mr_archiv has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 m007a83 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 kbtoo_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 schbirid has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 slyphic has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 kode54 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 betamax has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 chirlu` has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 MrRadar has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 ats has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 cf has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 voltagex has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 swebb has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 jrwr has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 jspiros has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 erin has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 chfoo has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 zino has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 dxrt has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 _niklas has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Igloo has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 nightpool has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Soulflare has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Mateon1 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 NickN00b has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 kiska1 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 benjins has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Odd0002 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 mal has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 yano has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 sep332 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 dxrt_ has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 step has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 GLaDOS has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 paul2520 has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Albardin has quit IRC (efnet.portlane.se se.hub)
22:49 🔗 Aoede has quit IRC (efnet.portlane.se se.hub)
22:50 🔗 BlueMax has joined #archiveteam-ot
22:50 🔗 Mateon1 has joined #archiveteam-ot
22:50 🔗 VerifiedJ has joined #archiveteam-ot
22:50 🔗 tjg1_ has joined #archiveteam-ot
22:50 🔗 fuzzy8021 has joined #archiveteam-ot
22:50 🔗 Soni has joined #archiveteam-ot
22:50 🔗 Fusl has joined #archiveteam-ot
22:50 🔗 LFlare has joined #archiveteam-ot
22:50 🔗 Despatche has joined #archiveteam-ot
22:50 🔗 Hani has joined #archiveteam-ot
22:50 🔗 mr_archiv has joined #archiveteam-ot
22:50 🔗 MR9K has joined #archiveteam-ot
22:50 🔗 NickN00b has joined #archiveteam-ot
22:50 🔗 kiska1 has joined #archiveteam-ot
22:50 🔗 odemg has joined #archiveteam-ot
22:50 🔗 Stiletto has joined #archiveteam-ot
22:50 🔗 justas has joined #archiveteam-ot
22:50 🔗 benjins has joined #archiveteam-ot
22:50 🔗 bithippo has joined #archiveteam-ot
22:50 🔗 Flashfire has joined #archiveteam-ot
22:50 🔗 ranma has joined #archiveteam-ot
22:50 🔗 kiska has joined #archiveteam-ot
22:50 🔗 m007a83 has joined #archiveteam-ot
22:50 🔗 Odd0002 has joined #archiveteam-ot
22:50 🔗 mal has joined #archiveteam-ot
22:50 🔗 kbtoo_ has joined #archiveteam-ot
22:50 🔗 marked has joined #archiveteam-ot
22:50 🔗 Jens has joined #archiveteam-ot
22:50 🔗 Muad-Dib has joined #archiveteam-ot
22:50 🔗 Hecatz has joined #archiveteam-ot
22:50 🔗 kpcyrd has joined #archiveteam-ot
22:50 🔗 Meroje has joined #archiveteam-ot
22:50 🔗 DrasticAc has joined #archiveteam-ot
22:50 🔗 deathy has joined #archiveteam-ot
22:50 🔗 Ctrl-S_ has joined #archiveteam-ot
22:50 🔗 bakJAA has joined #archiveteam-ot
22:50 🔗 diggan has joined #archiveteam-ot
22:50 🔗 bitspill has joined #archiveteam-ot
22:50 🔗 Vito` has joined #archiveteam-ot
22:50 🔗 HCross has joined #archiveteam-ot
22:50 🔗 chr1sm has joined #archiveteam-ot
22:50 🔗 pnJay has joined #archiveteam-ot
22:50 🔗 revi has joined #archiveteam-ot
22:50 🔗 Fusl_ has joined #archiveteam-ot
22:50 🔗 horkermon has joined #archiveteam-ot
22:50 🔗 se.hub sets mode: +oooo Muad-Dib bakJAA HCross horkermon
22:50 🔗 hook54321 has joined #archiveteam-ot
22:50 🔗 eientei95 has joined #archiveteam-ot
22:50 🔗 jeekl has joined #archiveteam-ot
22:50 🔗 ephemer0l has joined #archiveteam-ot
22:50 🔗 gandalf has joined #archiveteam-ot
22:50 🔗 t3 has joined #archiveteam-ot
22:50 🔗 Jon has joined #archiveteam-ot
22:50 🔗 Sanqui has joined #archiveteam-ot
22:50 🔗 tuluu has joined #archiveteam-ot
22:50 🔗 acridAxid has joined #archiveteam-ot
22:50 🔗 arkiver has joined #archiveteam-ot
22:50 🔗 SketchCow has joined #archiveteam-ot
22:50 🔗 apache2 has joined #archiveteam-ot
22:50 🔗 phuz has joined #archiveteam-ot
22:50 🔗 bztoot has joined #archiveteam-ot
22:50 🔗 yano has joined #archiveteam-ot
22:50 🔗 sep332 has joined #archiveteam-ot
22:50 🔗 schbirid has joined #archiveteam-ot
22:50 🔗 dxrt_ has joined #archiveteam-ot
22:50 🔗 step has joined #archiveteam-ot
22:50 🔗 GLaDOS has joined #archiveteam-ot
22:50 🔗 paul2520 has joined #archiveteam-ot
22:50 🔗 Albardin has joined #archiveteam-ot
22:50 🔗 sknebel has joined #archiveteam-ot
22:50 🔗 BnAboyZ has joined #archiveteam-ot
22:50 🔗 Tenebrae has joined #archiveteam-ot
22:50 🔗 MrRadar2 has joined #archiveteam-ot
22:50 🔗 VoynichCr has joined #archiveteam-ot
22:50 🔗 N4Y has joined #archiveteam-ot
22:50 🔗 jodizzle has joined #archiveteam-ot
22:50 🔗 argus has joined #archiveteam-ot
22:50 🔗 noirscape has joined #archiveteam-ot
22:50 🔗 Frogging has joined #archiveteam-ot
22:50 🔗 simon816 has joined #archiveteam-ot
22:50 🔗 slyphic has joined #archiveteam-ot
22:50 🔗 kode54 has joined #archiveteam-ot
22:50 🔗 robogoat has joined #archiveteam-ot
22:50 🔗 mgrytbak has joined #archiveteam-ot
22:50 🔗 ivan has joined #archiveteam-ot
22:50 🔗 betamax has joined #archiveteam-ot
22:50 🔗 chirlu` has joined #archiveteam-ot
22:50 🔗 Polylith_ has joined #archiveteam-ot
22:50 🔗 JAA has joined #archiveteam-ot
22:50 🔗 MrRadar has joined #archiveteam-ot
22:50 🔗 ats has joined #archiveteam-ot
22:50 🔗 cf has joined #archiveteam-ot
22:50 🔗 voltagex has joined #archiveteam-ot
22:50 🔗 se.hub sets mode: +oo dxrt_ JAA
22:50 🔗 zino has joined #archiveteam-ot
22:50 🔗 dxrt has joined #archiveteam-ot
22:50 🔗 _niklas has joined #archiveteam-ot
22:50 🔗 Igloo has joined #archiveteam-ot
22:50 🔗 Soulflare has joined #archiveteam-ot
22:50 🔗 nightpool has joined #archiveteam-ot
22:50 🔗 chfoo has joined #archiveteam-ot
22:50 🔗 erin has joined #archiveteam-ot
22:50 🔗 jspiros has joined #archiveteam-ot
22:50 🔗 jrwr has joined #archiveteam-ot
22:50 🔗 swebb has joined #archiveteam-ot
22:50 🔗 Aoede has joined #archiveteam-ot
22:50 🔗 se.hub sets mode: +oooo zino dxrt chfoo Aoede
22:51 🔗 kode54 wow
22:51 🔗 kode54 SJIS encoded
22:52 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
22:55 🔗 VerifiedJ has joined #archiveteam-ot
23:49 🔗 wp494 has quit IRC (Ping timeout: 265 seconds)
23:50 🔗 wp494 has joined #archiveteam-ot

irclogger-viewer