#archiveteam-bs 2020-02-09,Sun

↑back Search

Time Nickname Message
00:26 🔗 Raccoon has joined #archiveteam-bs
00:26 🔗 Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat)
00:30 🔗 RichardG has quit IRC (Ping timeout: 615 seconds)
00:36 🔗 RichardG has joined #archiveteam-bs
01:00 🔗 fredgido has quit IRC (Remote host closed the connection)
01:02 🔗 fredgido has joined #archiveteam-bs
01:38 🔗 HP_Archiv has joined #archiveteam-bs
01:47 🔗 synm0nger has quit IRC (Quit: Wait, what?)
01:48 🔗 SynMonger has joined #archiveteam-bs
02:35 🔗 superkuh_ is now known as superkuh
02:50 🔗 brayden has joined #archiveteam-bs
02:52 🔗 Craigle has joined #archiveteam-bs
02:56 🔗 Craigle has quit IRC (Remote host closed the connection)
03:09 🔗 BlueMax has quit IRC (Remote host closed the connection)
03:10 🔗 BlueMax has joined #archiveteam-bs
03:11 🔗 BlueMax has quit IRC (Remote host closed the connection)
03:14 🔗 BlueMax has joined #archiveteam-bs
03:15 🔗 BlueMax has quit IRC (Remote host closed the connection)
03:15 🔗 BlueMax has joined #archiveteam-bs
03:16 🔗 BlueMax has quit IRC (Remote host closed the connection)
03:17 🔗 BlueMax has joined #archiveteam-bs
03:18 🔗 BlueMax has quit IRC (Remote host closed the connection)
04:20 🔗 odemgi_ has joined #archiveteam-bs
04:23 🔗 odemgi has quit IRC (Ping timeout: 276 seconds)
04:29 🔗 Raccoon has quit IRC (Read error: Connection reset by peer)
04:31 🔗 Raccoon has joined #archiveteam-bs
04:39 🔗 qw3rty_ has joined #archiveteam-bs
04:43 🔗 qw3rty__ has quit IRC (Ping timeout: 276 seconds)
04:50 🔗 synm0nger has joined #archiveteam-bs
04:52 🔗 SynMonger has quit IRC (Read error: Connection reset by peer)
05:05 🔗 Ctrl has quit IRC (Read error: Operation timed out)
05:05 🔗 DogsRNice has quit IRC (Read error: Connection reset by peer)
05:11 🔗 VADemon_ has joined #archiveteam-bs
05:12 🔗 Pixi has joined #archiveteam-bs
05:13 🔗 Pixi` has quit IRC (Read error: Operation timed out)
05:15 🔗 VADemon has quit IRC (Read error: Operation timed out)
05:15 🔗 VADemon has joined #archiveteam-bs
05:23 🔗 VADemon_ has quit IRC (Read error: Operation timed out)
05:24 🔗 Ctrl has joined #archiveteam-bs
05:29 🔗 OrIdow6 I've been making an estimate of the size of BitBucket's Mercurial repositories (goes down June 1), &, barring extreme outliers or anything like that, there look to be maybe a few hundred thousand at the most, with I'd say a few TiB repository data (not sure about the issue tracker &c.)
05:37 🔗 OrIdow6 Looks implausible to play back the repository data itself, in full (i.e., barring the zip or tar files, which don't have everything in them), unless there's some horrible way to capture it all through the web interface (& even if that was possible, it would still be bad not to let people download the .hg file) - this is because the hg clone command uses a fairly complex protocol that only runs on top of HTTP (https://www.mercuria
05:37 🔗 OrIdow6 l-scm.org/wiki/HttpCommandProtocol), and many options (e.g. the --stream option, which has sped up my test downloads by perhaps an order of magnitude, though this still needs more investigation), and I would presume software changes, would break `hg clone https://web.archive.org/...`
05:39 🔗 OrIdow6 The best thing I can think of is to create an IA item for each repo, but I don't know how much they'd like that (& that is not a "I don't think they'd like that" - I don't know, though I obviously have my doubts)
05:45 🔗 Raccoon has quit IRC (Read error: Connection reset by peer)
05:48 🔗 SketchCo1 has joined #archiveteam-bs
05:48 🔗 ranma has joined #archiveteam-bs
05:49 🔗 Raccoon has joined #archiveteam-bs
05:53 🔗 ranma_ has quit IRC (hub.efnet.us efnet.deic.eu)
05:53 🔗 Mateon1 has quit IRC (hub.efnet.us efnet.deic.eu)
05:53 🔗 SketchCow has quit IRC (hub.efnet.us efnet.deic.eu)
05:53 🔗 Maylay has quit IRC (hub.efnet.us efnet.deic.eu)
05:53 🔗 ctrl_ has quit IRC (hub.efnet.us efnet.deic.eu)
05:58 🔗 nicolas17 OrIdow6: IIRC you can "hg clone" from a static bundle file
05:59 🔗 Craigle has joined #archiveteam-bs
06:04 🔗 nicolas17 OrIdow6: ok, turns out you can't "hg clone http://example.com/repo.bundle" but you can "wget http://example.com/repo.bundle && hg clone repo.bundle repo"
06:04 🔗 nicolas17 example: https://nicolas17.s3.amazonaws.com/testhg.bundle
06:07 🔗 OrIdow6 nicolas17: Do you know if BitBucket has public bundle files? They would certainly help implement the scenario I've described (of creating IA items), but my primary question is whether it can be averted
06:08 🔗 nicolas17 I don't think so, but if you can get the normal-format repos you can use 'hg bundle' to turn them into static files you can easily put in IA items
06:12 🔗 Mateon1 has joined #archiveteam-bs
06:21 🔗 ctrl_ has joined #archiveteam-bs
06:25 🔗 HP_Archiv Hey all, wondering if someone might be able to grab each blog entry on this page, eg; from each drop down group. I need a way to pull each entry and easily print out without manually selecting each group, clicking out to each entry, printing, etc.
06:25 🔗 HP_Archiv https://www.joshuakennon.com/i-will-be-re-opening-selected-posts-from-the-private-archives/
06:25 🔗 HP_Archiv Any ideas?
06:28 🔗 NIC007a83 has quit IRC (Ping timeout: 276 seconds)
06:29 🔗 NIC007a83 has joined #archiveteam-bs
06:55 🔗 ctrl_ has quit IRC (hub.efnet.us efnet.deic.eu)
07:03 🔗 Maylay_ has joined #archiveteam-bs
07:20 🔗 d5f4a3622 has quit IRC (https://i.imgur.com/xacQ09F.mp4)
07:21 🔗 marked1 I don't see anything on that blog called a "group", do you mean Category?
07:21 🔗 ctrl_ has joined #archiveteam-bs
07:21 🔗 HP_Archiv marked1, scroll down more
07:22 🔗 HP_Archiv it's after the entry, Group 1 - 9
07:22 🔗 marked1 ok I see it. hold a bit while I see what's triggering that.
07:22 🔗 HP_Archiv Okay sure
07:26 🔗 marked1 Q: you just want a list of urls like https://www.joshuakennon.com/the-power-of-marketing/
07:27 🔗 d5f4a3622 has joined #archiveteam-bs
07:27 🔗 HP_Archiv Well I'm not sure. Maybe - the problem is it's very tedious to open up each one and print. I need to print out all of these
07:27 🔗 HP_Archiv Any idea how to make that process more simple?
07:27 🔗 marked1 print to paper?
07:27 🔗 HP_Archiv Yeah, it's actually work related
07:28 🔗 HP_Archiv I was thinking maybe save each page as a pdf then que the lot of pds for printing, not sure if that's possible though to do batch pdf printing
07:29 🔗 marked1 let's move to #archiveteam-ot
07:29 🔗 HP_Archiv Okay ^^
08:01 🔗 nicolas17 has quit IRC (Ping timeout: 745 seconds)
10:06 🔗 thuban2 is now known as thuban
10:47 🔗 kiska has quit IRC (Remote host closed the connection)
10:47 🔗 Flashfire has quit IRC (Remote host closed the connection)
10:47 🔗 kiska has joined #archiveteam-bs
10:47 🔗 Flashfire has joined #archiveteam-bs
10:48 🔗 svchfoo3 sets mode: +o kiska
10:49 🔗 svchfoo1 sets mode: +o kiska
11:37 🔗 thuban1 has joined #archiveteam-bs
11:39 🔗 thuban has quit IRC (Read error: Operation timed out)
12:14 🔗 klg has quit IRC (brb)
12:23 🔗 klg has joined #archiveteam-bs
13:54 🔗 kiska3 has joined #archiveteam-bs
15:09 🔗 Pixi` has joined #archiveteam-bs
15:10 🔗 fredgido_ has joined #archiveteam-bs
15:14 🔗 fredgido has quit IRC (Ping timeout: 360 seconds)
15:14 🔗 chfoo has quit IRC (Ping timeout: 360 seconds)
15:14 🔗 ranma has quit IRC (Read error: Operation timed out)
15:14 🔗 ranma has joined #archiveteam-bs
15:15 🔗 Pixi has quit IRC (Read error: Operation timed out)
15:15 🔗 PurpleSym has quit IRC (Read error: Connection reset by peer)
15:15 🔗 pie_[bnc] has quit IRC (Ping timeout: 360 seconds)
15:15 🔗 chfoo has joined #archiveteam-bs
15:15 🔗 MrRadar2 has quit IRC (Read error: Connection reset by peer)
15:16 🔗 Igloo has quit IRC (Read error: Connection reset by peer)
15:17 🔗 Ctrl has quit IRC (Ping timeout: 864 seconds)
15:18 🔗 pie_[bnc] has joined #archiveteam-bs
15:18 🔗 nyany has quit IRC (Ping timeout: 360 seconds)
15:19 🔗 voltagex has quit IRC (Ping timeout: 264 seconds)
15:19 🔗 me_ has quit IRC (Read error: Operation timed out)
15:20 🔗 VADemon_ has joined #archiveteam-bs
15:22 🔗 qw3rty_ has quit IRC (Ping timeout: 276 seconds)
15:23 🔗 MrRadar2 has joined #archiveteam-bs
15:23 🔗 qw3rty has joined #archiveteam-bs
15:24 🔗 HP_Archiv has quit IRC (Ping timeout: 276 seconds)
15:24 🔗 PurpleSym has joined #archiveteam-bs
15:26 🔗 HP_Archiv has joined #archiveteam-bs
15:26 🔗 atphoenix has quit IRC (Ping timeout: 276 seconds)
15:28 🔗 alex73_ has joined #archiveteam-bs
15:28 🔗 cf has quit IRC (Read error: Operation timed out)
15:28 🔗 atphoenix has joined #archiveteam-bs
15:30 🔗 superkuh has quit IRC (Excess Flood)
15:30 🔗 Raccoon has quit IRC (Ping timeout: 276 seconds)
15:31 🔗 VADemon has quit IRC (Read error: Operation timed out)
15:31 🔗 Raccoon has joined #archiveteam-bs
15:31 🔗 RKenshin has joined #archiveteam-bs
15:31 🔗 Kenshin has quit IRC (Ping timeout: 276 seconds)
15:31 🔗 odemgi_ has quit IRC (Remote host closed the connection)
15:31 🔗 RKenshin is now known as Kenshin
15:32 🔗 MrRadar2 has quit IRC (Read error: Operation timed out)
15:32 🔗 MrRadar2 has joined #archiveteam-bs
15:32 🔗 odemgi has joined #archiveteam-bs
15:32 🔗 Lord_Nigh has quit IRC (Ping timeout: 268 seconds)
15:33 🔗 Craigle has quit IRC (Ping timeout: 276 seconds)
15:33 🔗 equant_ has joined #archiveteam-bs
15:34 🔗 Dallas has quit IRC (Ping timeout: 276 seconds)
15:34 🔗 Flashfire has quit IRC (Ping timeout: 276 seconds)
15:34 🔗 equant has quit IRC (Ping timeout: 276 seconds)
15:34 🔗 Fionera has quit IRC (Ping timeout: 276 seconds)
15:34 🔗 yano has quit IRC (Read error: Connection reset by peer)
15:35 🔗 yano has joined #archiveteam-bs
15:36 🔗 antomati_ has joined #archiveteam-bs
15:36 🔗 antomatic has quit IRC (Ping timeout: 276 seconds)
15:36 🔗 Hooloovoo has quit IRC (Ping timeout: 276 seconds)
15:36 🔗 dxrt- has joined #archiveteam-bs
15:36 🔗 Hoolootwo has joined #archiveteam-bs
15:36 🔗 Lord_Nigh has joined #archiveteam-bs
15:37 🔗 thuban1 has quit IRC (Ping timeout: 276 seconds)
15:37 🔗 ats_ has quit IRC (Ping timeout: 276 seconds)
15:37 🔗 Fionera has joined #archiveteam-bs
15:38 🔗 dxrt has quit IRC (Ping timeout: 276 seconds)
15:38 🔗 cf has joined #archiveteam-bs
15:38 🔗 thuban1 has joined #archiveteam-bs
15:38 🔗 MrRadar has quit IRC (Read error: Operation timed out)
15:39 🔗 MrRadar2_ has joined #archiveteam-bs
15:39 🔗 Stiletto has quit IRC (Ping timeout: 276 seconds)
15:40 🔗 Stiletto has joined #archiveteam-bs
15:42 🔗 robogoat_ has quit IRC (Ping timeout: 276 seconds)
15:43 🔗 robogoat has joined #archiveteam-bs
15:43 🔗 Stilett0 has joined #archiveteam-bs
15:44 🔗 MrRadar2_ has quit IRC (Remote host closed the connection)
15:44 🔗 MrRadar2 has quit IRC (Read error: Connection reset by peer)
15:44 🔗 MrRadar2_ has joined #archiveteam-bs
15:44 🔗 me- has joined #archiveteam-bs
15:44 🔗 brayden has quit IRC (Read error: Connection reset by peer)
15:44 🔗 OrIdow6 has quit IRC (Ping timeout: 276 seconds)
15:44 🔗 SJon_ has quit IRC (Ping timeout: 276 seconds)
15:44 🔗 superkuh has joined #archiveteam-bs
15:44 🔗 brayden has joined #archiveteam-bs
15:44 🔗 katocala has joined #archiveteam-bs
15:45 🔗 katocala has left
15:46 🔗 Ctrl has joined #archiveteam-bs
15:47 🔗 VoynichCr has quit IRC (Ping timeout: 276 seconds)
15:48 🔗 VoynichCr has joined #archiveteam-bs
15:50 🔗 OrIdow6 has joined #archiveteam-bs
15:50 🔗 Dallas has joined #archiveteam-bs
15:52 🔗 Stiletto has quit IRC (Ping timeout: 745 seconds)
15:57 🔗 thuban2 has joined #archiveteam-bs
15:57 🔗 ats has joined #archiveteam-bs
16:01 🔗 voltagex has joined #archiveteam-bs
16:03 🔗 nyany has joined #archiveteam-bs
16:05 🔗 dashcloud has quit IRC (http://quassel-irc.org - Chat comfortably. Anywhere.)
16:07 🔗 thuban1 has quit IRC (Ping timeout: 745 seconds)
16:10 🔗 MrRadar has joined #archiveteam-bs
16:13 🔗 Igloo has joined #archiveteam-bs
16:14 🔗 svchfoo1 sets mode: +o Igloo
16:14 🔗 svchfoo3 sets mode: +o Igloo
16:55 🔗 thuban3 has joined #archiveteam-bs
16:57 🔗 thuban2 has quit IRC (Ping timeout: 276 seconds)
16:58 🔗 TC01 has joined #archiveteam-bs
17:06 🔗 Dallas has quit IRC (Ping timeout: 276 seconds)
17:06 🔗 MrRadar2_ has quit IRC (Remote host closed the connection)
17:06 🔗 MrRadar2 has joined #archiveteam-bs
17:12 🔗 MrRadar2_ has joined #archiveteam-bs
17:12 🔗 Ctrl has quit IRC (Read error: Operation timed out)
17:12 🔗 MrRadar2 has quit IRC (Remote host closed the connection)
17:13 🔗 Dallas has joined #archiveteam-bs
17:18 🔗 Dallas has quit IRC (Ping timeout: 276 seconds)
17:24 🔗 Craigle has joined #archiveteam-bs
17:31 🔗 Ctrl has joined #archiveteam-bs
17:39 🔗 OrIdow6 has quit IRC (Ping timeout: 276 seconds)
17:40 🔗 OrIdow6 has joined #archiveteam-bs
18:05 🔗 dashcloud has joined #archiveteam-bs
18:19 🔗 synm0nger has quit IRC (Wait, what?)
18:20 🔗 Raccoon` has joined #archiveteam-bs
18:20 🔗 Raccoon has quit IRC (Ping timeout: 258 seconds)
18:20 🔗 Laverne has quit IRC (Ping timeout: 258 seconds)
18:20 🔗 Raccoon` is now known as Raccoon
18:21 🔗 SynMonger has joined #archiveteam-bs
18:22 🔗 sHATNER has quit IRC (Ping timeout: 258 seconds)
18:23 🔗 Gfy has quit IRC (se.hub efnet.portlane.se)
18:23 🔗 SynMonger has quit IRC (Client Quit)
18:24 🔗 SynMonger has joined #archiveteam-bs
18:29 🔗 sHATNER has joined #archiveteam-bs
18:30 🔗 cppchrisc has quit IRC (Ping timeout: 496 seconds)
18:34 🔗 qw3rty_ has joined #archiveteam-bs
18:35 🔗 pie_[bnc] has quit IRC (Read error: Operation timed out)
18:35 🔗 pie_[bnc] has joined #archiveteam-bs
18:35 🔗 Stiletto has joined #archiveteam-bs
18:36 🔗 chfoo has quit IRC (Write error: Broken pipe)
18:36 🔗 chfoo has joined #archiveteam-bs
18:37 🔗 alex73__ has joined #archiveteam-bs
18:37 🔗 Fionera has quit IRC (Read error: Operation timed out)
18:38 🔗 ctrl_ has quit IRC (Read error: Operation timed out)
18:38 🔗 odemgi has quit IRC (Write error: Broken pipe)
18:38 🔗 equant has joined #archiveteam-bs
18:39 🔗 godane has quit IRC (Ping timeout: 360 seconds)
18:39 🔗 ranma_ has joined #archiveteam-bs
18:40 🔗 SynMonger has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 dashcloud has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 Stilett0 has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 equant_ has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 alex73_ has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 HP_Archiv has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 qw3rty has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 ranma has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 kiska3 has quit IRC (hub.efnet.us efnet.deic.eu)
18:40 🔗 MeeDee has joined #archiveteam-bs
18:41 🔗 synm0nger has joined #archiveteam-bs
18:42 🔗 odemgi has joined #archiveteam-bs
18:42 🔗 cppchrisc has joined #archiveteam-bs
18:44 🔗 morgandaw has quit IRC (Read error: Operation timed out)
19:00 🔗 dashcloud has joined #archiveteam-bs
19:04 🔗 ctrl_ has joined #archiveteam-bs
19:07 🔗 Kliment has quit IRC (Read error: Connection reset by peer)
19:10 🔗 dashcloud has quit IRC (http://quassel-irc.org - Chat comfortably. Anywhere.)
19:19 🔗 dashcloud has joined #archiveteam-bs
19:22 🔗 dashcloud has quit IRC (Client Quit)
19:30 🔗 Raccoon has quit IRC (Read error: Connection reset by peer)
19:32 🔗 Raccoon has joined #archiveteam-bs
19:37 🔗 thuban4 has joined #archiveteam-bs
19:38 🔗 halt has joined #archiveteam-bs
19:38 🔗 OrIdow6 has quit IRC (Ping timeout: 276 seconds)
19:39 🔗 thuban3 has quit IRC (Ping timeout: 276 seconds)
19:40 🔗 Gfy has joined #archiveteam-bs
19:43 🔗 kiska3 has joined #archiveteam-bs
19:43 🔗 superkuh has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 Pixi` has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 SketchCo1 has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 scorche has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 Somebody2 has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 achip has quit IRC (hub.efnet.us irc.Prison.NET)
19:43 🔗 alex73__ Hi. <atphoenix> said that it's right place to ask about warc uploading. I prepared warc of my site that will be offline some time later, and uploaded it into https://archive.org/details/sites-mova.biel-2020-02-09.warc, tagged it as "archiveteam", and by FAQ(https://www.archiveteam.org/index.php?title=Frequently_Asked_Questions) should "let us know so we can move it under the ArchiveTeam collection". So, what is the process to move warc item into
19:43 🔗 alex73__ Archiveteam collection for automatic uploading into wayback machine ?
19:47 🔗 VoynichCr has quit IRC (Ping timeout: 276 seconds)
19:47 🔗 pew has quit IRC (Ping timeout: 276 seconds)
19:47 🔗 atphoenix alex73__, I don't know the proper answer, but I'm sure more senior folks here do know. Responses may take a few hours if they're away for IRL activites.
19:49 🔗 SJon__ has joined #archiveteam-bs
19:52 🔗 Laverne has joined #archiveteam-bs
19:52 🔗 Fionera_ has joined #archiveteam-bs
19:54 🔗 alex73__ thank you
20:20 🔗 superkuh has joined #archiveteam-bs
20:20 🔗 Pixi` has joined #archiveteam-bs
20:20 🔗 SketchCo1 has joined #archiveteam-bs
20:20 🔗 scorche has joined #archiveteam-bs
20:20 🔗 Somebody2 has joined #archiveteam-bs
20:20 🔗 achip has joined #archiveteam-bs
20:20 🔗 irc.Prison.NET sets mode: +o Somebody2
20:23 🔗 VoynichCr has joined #archiveteam-bs
20:27 🔗 Craigle I also don't have the answer to your question, but could you use the Wayback Machine's Save Page Now feature?
20:27 🔗 Craigle https://web.archive.org/save/
20:29 🔗 Craigle I took from your question that the site was still up, so this would be an easy way to add it to the Wayback without having to deal with a manual upload.
20:43 🔗 TC01 has quit IRC (Read error: Operation timed out)
20:43 🔗 TC01 has joined #archiveteam-bs
20:46 🔗 alex73__ Well, this site is not so big, and I can add some hundreds of urls via email, but I'm going to archive some additional sites with tens of thousands pages. It will take too much resources from Save Page processing.
20:47 🔗 JAA We could throw it into ArchiveBot.
20:49 🔗 alex73__ Yes, probably it will be good way. But I would like to know - is process working as described on the https://www.archiveteam.org/index.php?title=Frequently_Asked_Questions ?
20:50 🔗 alex73__ I have enough fast internet connection and could crawl some important sites myself.
21:09 🔗 Stiletto has quit IRC (Ping timeout: 360 seconds)
21:39 🔗 ShellyRol has quit IRC (Read error: Connection reset by peer)
21:41 🔗 ShellyRol has joined #archiveteam-bs
22:04 🔗 nicolas17 has joined #archiveteam-bs
22:09 🔗 mtntmnky_ is now known as mtntmnky
22:23 🔗 mtntmnky has quit IRC (Remote host closed the connection)
22:29 🔗 mtntmnky has joined #archiveteam-bs
23:03 🔗 BlueMax has joined #archiveteam-bs
23:15 🔗 dashcloud has joined #archiveteam-bs
23:21 🔗 JAA alex73__: Yeah, that looks fine. Jason Scott is the person to ask to move things into the AT collection on IA. jason@textfiles.com
23:23 🔗 dashcloud has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)
23:40 🔗 dashcloud has joined #archiveteam-bs
23:44 🔗 dashcloud has quit IRC (Client Quit)

irclogger-viewer