#archiveteam-ot 2020-02-14,Fri

↑back Search

Time Nickname Message
00:16 🔗 Flashfire has joined #archiveteam-ot
00:17 🔗 britmob_ has quit IRC (Read error: Operation timed out)
00:21 🔗 britmob has joined #archiveteam-ot
00:53 🔗 dashcloud has quit IRC (Ping timeout: 258 seconds)
01:05 🔗 ShellyRol has quit IRC (Read error: Connection reset by peer)
01:16 🔗 logchfoo3 starts logging #archiveteam-ot at Fri Feb 14 01:16:19 2020
01:16 🔗 logchfoo3 has joined #archiveteam-ot
01:31 🔗 OrIdow6 has joined #archiveteam-ot
01:50 🔗 BlueMax has joined #archiveteam-ot
02:28 🔗 dashcloud has joined #archiveteam-ot
02:39 🔗 martini has joined #archiveteam-ot
02:40 🔗 martini has quit IRC (Client Quit)
03:57 🔗 thuban1 has quit IRC (Ping timeout: 258 seconds)
03:58 🔗 thuban1 has joined #archiveteam-ot
03:59 🔗 DogsRNice has quit IRC (Read error: Connection reset by peer)
04:06 🔗 qw3rty_ has joined #archiveteam-ot
04:07 🔗 ShellyRol has quit IRC (Remote host closed the connection)
04:07 🔗 HP_Archiv has quit IRC (Remote host closed the connection)
04:08 🔗 HP_Archiv has joined #archiveteam-ot
04:08 🔗 ShellyRol has joined #archiveteam-ot
04:09 🔗 HP_Archiv has quit IRC (Remote host closed the connection)
04:09 🔗 HP_Archiv has joined #archiveteam-ot
04:14 🔗 qw3rty has quit IRC (Read error: Operation timed out)
04:20 🔗 HP_Archiv has quit IRC (Ping timeout: 610 seconds)
04:21 🔗 HP_Archiv has joined #archiveteam-ot
04:34 🔗 qw3rty__ has joined #archiveteam-ot
04:42 🔗 qw3rty_ has quit IRC (Read error: Operation timed out)
04:44 🔗 BlueMaxim has joined #archiveteam-ot
04:44 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
04:56 🔗 Panasonic has joined #archiveteam-ot
04:57 🔗 britmob_ has joined #archiveteam-ot
04:58 🔗 Stilett0 has joined #archiveteam-ot
04:59 🔗 benjinss has joined #archiveteam-ot
05:00 🔗 Stiletto has quit IRC (Ping timeout: 317 seconds)
05:00 🔗 jodizzle has quit IRC (Ping timeout: 317 seconds)
05:00 🔗 jodizzle_ has joined #archiveteam-ot
05:01 🔗 dxrt has joined #archiveteam-ot
05:01 🔗 dxrt- has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 kiska18 has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 JAA has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 eythian has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 Larsenv has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 _niklas has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 svchfoo1 has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 simon816 has quit IRC (Ping timeout: 316 seconds)
05:01 🔗 _niklas has joined #archiveteam-ot
05:01 🔗 jodizzle_ is now known as jodizzle
05:02 🔗 svchfoo3 sets mode: +o dxrt
05:02 🔗 eythian has joined #archiveteam-ot
05:04 🔗 britmob has quit IRC (Ping timeout: 538 seconds)
05:04 🔗 Ravenloft has quit IRC (Ping timeout: 538 seconds)
05:04 🔗 wp494 has quit IRC (Ping timeout: 538 seconds)
05:04 🔗 benjinsmi has quit IRC (Ping timeout: 538 seconds)
05:05 🔗 Larsenv has joined #archiveteam-ot
05:06 🔗 dashcloud has quit IRC (Ping timeout: 538 seconds)
05:08 🔗 dashcloud has joined #archiveteam-ot
05:10 🔗 Stiletto has joined #archiveteam-ot
05:10 🔗 svchfoo1 has joined #archiveteam-ot
05:10 🔗 simon816 has joined #archiveteam-ot
05:11 🔗 svchfoo3 sets mode: +o svchfoo1
05:15 🔗 JAA has joined #archiveteam-ot
05:15 🔗 svchfoo3 sets mode: +o JAA
05:15 🔗 AlsoJAA sets mode: +o JAA
05:15 🔗 svchfoo1 sets mode: +o JAA
05:17 🔗 Stilett0 has quit IRC (Ping timeout: 745 seconds)
05:57 🔗 nataraj_ has joined #archiveteam-ot
06:18 🔗 HP_Archiv has quit IRC (Ping timeout: 276 seconds)
06:20 🔗 HP_Archiv has joined #archiveteam-ot
06:43 🔗 Mateon1 has quit IRC (Remote host closed the connection)
06:45 🔗 Mateon1 has joined #archiveteam-ot
06:50 🔗 Flashfire has joined #archiveteam-ot
07:15 🔗 wp494 has joined #archiveteam-ot
07:39 🔗 systwi marked1: Would you happen to know how one would go about deduping a collection of WARCs?
08:15 🔗 DLoader_ has joined #archiveteam-ot
08:27 🔗 DLoader has quit IRC (Ping timeout: 745 seconds)
08:27 🔗 DLoader_ is now known as DLoader
09:10 🔗 jake_test Is there anyway to get grab-site to scroll down pages to get more content? I'm attempting to back up a custom forum which only shows one page of content by default.
09:23 🔗 atphoenix infinite scroll sucks both for bots and humans. It along with low contrast flat design are 2 of the worst design memes of the 2010s
11:17 🔗 BlueMaxim has quit IRC (Quit: Leaving)
11:21 🔗 marked1 systwi are you asking about comparing between multiple .warc or within a single file? some of the warrior code has a dedup function using python/warcio
11:56 🔗 NIC007a83 has quit IRC (Read error: Operation timed out)
11:58 🔗 NIC007a83 has joined #archiveteam-ot
12:24 🔗 JAA jake_test: Generally speaking, grab-site (or rather, wpull) can't do that. You'll have to either use something browser-based (e.g. brozzler, crocoite, webrecorder) or reverse-engineer how the scrolling works and then emulate that with the tool of your choice.
13:11 🔗 eythian has quit IRC (Read error: Connection reset by peer)
13:13 🔗 eythian has joined #archiveteam-ot
13:17 🔗 Dallas has joined #archiveteam-ot
13:21 🔗 Stilett0 has joined #archiveteam-ot
13:27 🔗 Stiletto has quit IRC (Read error: Operation timed out)
13:29 🔗 ranma_ has quit IRC ()
13:35 🔗 Ravenloft has joined #archiveteam-ot
13:35 🔗 Panasonic has quit IRC (Read error: Connection reset by peer)
14:20 🔗 synm0nger has quit IRC (Quit: Wait, what?)
14:20 🔗 SynMonger has joined #archiveteam-ot
14:46 🔗 josey Anyone know how to use user-service-manuals.com? I'm guessing it's a fpt server.
15:54 🔗 superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye)
15:57 🔗 DogsRNice has joined #archiveteam-ot
16:10 🔗 davis1 has joined #archiveteam-ot
16:17 🔗 Ryz !ignore c0eu41udh9y34pqnajti5o5bn ^https?://talk\.sonymobile\.com/t5/media/v4/gallerypage\.liabase
16:17 🔗 Ryz Ugh~
16:31 🔗 josey has quit IRC (Quit: WeeChat 2.7)
16:35 🔗 scorche has quit IRC (Read error: Operation timed out)
16:37 🔗 scorche has joined #archiveteam-ot
17:22 🔗 thuban2 has joined #archiveteam-ot
17:25 🔗 thuban1 has quit IRC (Ping timeout: 258 seconds)
17:26 🔗 systwi marked1: I'm looking to dedupe for both, I guess. The WARCs I have will be played back using openwayback and I thought it would save space to only keep one copy of a file inside of the entire collection of WARCs (e.g. the Archive Team logo will only be kept once throughout all WARCs in my collection, so one WARC from 2017 and another from 2018 both use the logo, but only one copy is kept.)
17:26 🔗 systwi I hope that makes sense
17:26 🔗 superkuh has joined #archiveteam-ot
17:27 🔗 systwi I think of how MAME works with parent ROMs. In order to use a specific revision of a game, you need the files for the parent as well.
18:19 🔗 Frogging https://old.reddit.com/r/dataisbeautiful/comments/ez13dv/oc_quadratic_coronavirus_epidemic_growth_model/fh7c1uk/
18:19 🔗 Frogging um, oops
18:19 🔗 Frogging I didn't mean to post that here
18:19 🔗 Frogging but I guess it's generally interesting, so have fun?
19:13 🔗 icedice2 has joined #archiveteam-ot
19:16 🔗 MRX3 has joined #archiveteam-ot
19:17 🔗 icedice has quit IRC (Read error: Operation timed out)
19:19 🔗 icedice has joined #archiveteam-ot
19:20 🔗 icedice2 has quit IRC (Ping timeout: 276 seconds)
19:21 🔗 icedice2 has joined #archiveteam-ot
19:22 🔗 icedice2 has quit IRC (Client Quit)
19:22 🔗 MRX3 has quit IRC (Ping timeout: 276 seconds)
19:26 🔗 icedice has quit IRC (Read error: Operation timed out)
19:30 🔗 Stilett0 is now known as Stiletto
19:30 🔗 thuban2 has quit IRC (Read error: Operation timed out)
20:16 🔗 thuban2 has joined #archiveteam-ot
20:50 🔗 icedice has joined #archiveteam-ot
21:03 🔗 SJon___ has joined #archiveteam-ot
21:06 🔗 SJon__ has quit IRC (Read error: Operation timed out)
21:06 🔗 SJon___ is now known as SJon__
21:39 🔗 marked1 systwi : what exists reads one file and writes one file https://github.com/ArchiveTeam/tinypic-grab/blob/2f985620e64b56488dc27af70410785ad7d03000/pipeline.py#L135
21:52 🔗 ivan has quit IRC (Quit: Leaving)
21:55 🔗 ivan has joined #archiveteam-ot
21:56 🔗 Larsenv trying to upload some crap to archive.org but it shows "Please wait while your page is being created"
21:56 🔗 Larsenv and a very large amount of tasks to come that are growing and shrining
21:57 🔗 Larsenv SketchCow do you know?
21:57 🔗 Larsenv This is why I hate uploading to archive.org
21:57 🔗 Raccoon are you sure it's crap and not something useful?
21:58 🔗 Larsenv https://p33.f1.n0.cdn.getcloudapp.com/items/6quBRe1B/Image+2020-02-14+at+3.57.22+PM.png?v=7fec9b05a3e554dcb52970ef50694aed
21:58 🔗 Larsenv no it's useful *rolls eyes*
22:04 🔗 Larsenv is it ok to leave the tab when it's running the queue?
22:15 🔗 BlueMax has joined #archiveteam-ot
22:31 🔗 SootBectr Anyone know a way to use a firefox keyword with WBM? I made a bookmark in my usual fashion https://web.archive.org/web/*/%s but using it urlencodes the string, e.g. https://web.archive.org/web/*/http%3A%2F%2Fexample.com%2F which is no good.
22:32 🔗 SootBectr Perhaps a better question: does WBM search accept a ?query string somehow?
22:38 🔗 asie4 has joined #archiveteam-ot
23:19 🔗 godane has quit IRC (Quit: Leaving.)
23:22 🔗 nataraj_ has quit IRC (Read error: Operation timed out)
23:27 🔗 godane has joined #archiveteam-ot
23:33 🔗 thuban3 has joined #archiveteam-ot
23:37 🔗 thuban2 has quit IRC (Ping timeout: 276 seconds)

irclogger-viewer