Time |
Nickname |
Message |
00:13
π
|
|
cerca has joined #archiveteam-bs |
00:30
π
|
|
mtntmnky has quit IRC (Remote host closed the connection) |
00:30
π
|
|
mtntmnky has joined #archiveteam-bs |
00:43
π
|
|
Atom__ has quit IRC (Ping timeout: 276 seconds) |
00:44
π
|
|
Atom__ has joined #archiveteam-bs |
01:00
π
|
godane |
SketchCow: i'm uploading the original cbz files that i made the mobile beat pdfs from |
01:09
π
|
|
ShellyRol has quit IRC (Ping timeout: 496 seconds) |
01:22
π
|
|
ShellyRol has joined #archiveteam-bs |
01:25
π
|
|
mtntmnky has quit IRC (Remote host closed the connection) |
01:26
π
|
|
mtntmnky has joined #archiveteam-bs |
01:29
π
|
|
VerifiedJ has quit IRC (Quit: Leaving) |
01:37
π
|
|
britmob has quit IRC (Read error: Connection reset by peer) |
01:40
π
|
|
britmob has joined #archiveteam-bs |
01:58
π
|
|
Jens has quit IRC (Remote host closed the connection) |
01:58
π
|
|
Jens has joined #archiveteam-bs |
02:08
π
|
|
godane has quit IRC (Ping timeout: 246 seconds) |
02:23
π
|
|
godane has joined #archiveteam-bs |
02:28
π
|
|
odemg has quit IRC (Quit: Leaving) |
02:41
π
|
atphoenix |
nicolas17, OrIdow6 from my quick look at po.st, it didn't appear to offer shortened URLs to the general public. Rather it seemed to be more so a traffic-tracker, possibly for emails in inside a company's own website. If a company that used po.st was still maintaining it's website, I would imagine they'll fix their URLs. I think losing bit.ly or tinyurl.com would be easily be worse. |
02:42
π
|
OrIdow6 |
I don't think they were for emails |
02:42
π
|
OrIdow6 |
It was primarily to track links from social media etc. |
02:45
π
|
OrIdow6 |
-> a lot of public links |
02:46
π
|
hook54321 |
OrIdow6: do you know who owns the company? |
02:46
π
|
OrIdow6 |
Big loss, I should expect, is from customers using a custom domain, e.g. tylt.it (which not returns 404s on everything) |
02:46
π
|
OrIdow6 |
*now |
02:48
π
|
OrIdow6 |
hook54321: https://www.rhythmone.com/ - I do not have any special knowledge about them except by poking about their website |
02:55
π
|
hook54321 |
I think it's this https://en.wikipedia.org/wiki/RadiumOne |
02:59
π
|
|
Maylay has joined #archiveteam-bs |
03:00
π
|
atphoenix |
https://web.archive.org/web/20180813105249/https://blog.po.st/ is RadiumOne |
03:02
π
|
atphoenix |
BUT this references po.st also https://www.rhythmone.com/privacy-policy |
03:02
π
|
atphoenix |
cross ownership or ownership changes likely |
03:05
π
|
OrIdow6 |
As I said a day ago in #urlteam, the shutdown notice said that RythemOne were the owners at that time |
03:06
π
|
OrIdow6 |
"that time" being a day ago |
03:09
π
|
atphoenix |
here is the answer: https://www.cmo.com.au/article/621238/adtech-company-rhythmone-acquires-radiumone/ |
03:11
π
|
hook54321 |
I'll send them an email |
03:24
π
|
|
Raccoon` has joined #archiveteam-bs |
03:24
π
|
|
Raccoon has quit IRC (Ping timeout: 258 seconds) |
03:24
π
|
|
Raccoon` is now known as Raccoon |
03:29
π
|
|
AeonG has quit IRC (Read error: Operation timed out) |
03:43
π
|
|
Raccoon has quit IRC (Ping timeout: 258 seconds) |
04:40
π
|
|
odemgi has joined #archiveteam-bs |
04:44
π
|
|
odemgi_ has quit IRC (Read error: Operation timed out) |
04:58
π
|
|
qw3rty__ has joined #archiveteam-bs |
05:05
π
|
|
qw3rty_ has quit IRC (Read error: Operation timed out) |
05:14
π
|
|
kiska18 has quit IRC (Read error: Operation timed out) |
05:14
π
|
|
kiska18 has joined #archiveteam-bs |
05:15
π
|
|
svchfoo3 sets mode: +o kiska18 |
05:15
π
|
|
svchfoo1 sets mode: +o kiska18 |
05:32
π
|
hook54321 |
atphoenix: do you know what the storage limit is for users using Frontier's FTP service? |
05:32
π
|
atphoenix |
I think 25 mb FTP and 25 mb web |
05:33
π
|
atphoenix |
Every Frontier customer starts off with 25MB for their web space and 25MB for their emailβdepending on their service plan. |
05:33
π
|
atphoenix |
per https://frontier.com/helpcenter/categories/online-services/advanced-features/upload-my-web-site |
05:33
π
|
atphoenix |
that URL lists the naming patterns to expect |
05:33
π
|
atphoenix |
and other related domains too |
05:34
π
|
atphoenix |
If your email address ends with⦠Your public files are available at⦠|
05:34
π
|
atphoenix |
@frontier.com ftp.frontier.com/pub/users/yourusername |
05:34
π
|
atphoenix |
@frontiernet.net ftp.frontiernet.net/pub/users/yourusername |
05:34
π
|
atphoenix |
@citlink.net ftp.citlink.net/pub/users/yourusername |
05:34
π
|
atphoenix |
@newnorth.net ftp.newnorth.net/pub/users/yourusername |
05:34
π
|
atphoenix |
@epix.net ftp1.epix.net/pub/users/yourusername |
05:34
π
|
atphoenix |
@gvni.com ftp://username@gvni.com/pub/users/yourusername |
05:34
π
|
atphoenix |
I recognize some of those various domains as companies Frontier ingested |
05:36
π
|
atphoenix |
so far all the sites I've found hosted on Frontier are simple pages that should be AB-friendly |
05:37
π
|
atphoenix |
except for the ftp, that is |
05:37
π
|
atphoenix |
and that caveat about the site throwing errors on read attempts |
05:37
π
|
atphoenix |
(sometimes) |
05:42
π
|
OrIdow6 |
https://transfer.notkiska.pw/FMcrL/Frontier_myplace_all_users_pages https://transfer.notkiska.pw/L0owC/Frontiernet_all_users_pages |
05:43
π
|
OrIdow6 |
Userlists so far |
05:43
π
|
OrIdow6 |
Not all are still alive |
05:47
π
|
atphoenix |
frontier ftp is slow. 13 mb file 4 minutes eta |
05:48
π
|
OrIdow6 |
What are you downloading? |
05:49
π
|
atphoenix |
I'm poking around in ftp://ftp.frontier.com/pub/users/usnraptor/Fighters%20Anthology/Music/ |
05:52
π
|
atphoenix |
the big file is a sound mod for the game. The small file contains many MIDIs. |
06:00
π
|
|
cerca has quit IRC (Remote host closed the connection) |
06:14
π
|
|
Raccoon has joined #archiveteam-bs |
06:23
π
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
06:23
π
|
|
ShellyRol has joined #archiveteam-bs |
06:32
π
|
|
Atom-- has joined #archiveteam-bs |
06:36
π
|
|
Atom__ has quit IRC (Ping timeout: 276 seconds) |
06:42
π
|
|
ShellyRol has quit IRC (Ping timeout: 610 seconds) |
06:42
π
|
|
ShellyRol has joined #archiveteam-bs |
06:46
π
|
|
nicolas17 has quit IRC (Ping timeout: 746 seconds) |
06:50
π
|
|
RichardG has quit IRC (Ping timeout: 615 seconds) |
07:51
π
|
atphoenix |
SketchCow, tubeup means throw into youtubearchive? |
07:53
π
|
dxrt |
I imagine they need to actually be on IA, so tubeup. |
07:54
π
|
atphoenix |
https://www.youtube.com/user/madbitcoins is currently on YT |
07:55
π
|
atphoenix |
all 3 are on YT currently |
07:55
π
|
dxrt |
are you familiar with tubeup? https://github.com/bibanon/tubeup |
07:55
π
|
dxrt |
It rips them from youtube and uploads to IA |
07:56
π
|
atphoenix |
no, not familiar with that. Only familiar with ivan's archive |
07:57
π
|
atphoenix |
well, I mean I've heard that YT->IA is on hold because junk was getting sent in |
07:59
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
08:01
π
|
dxrt |
yeah, risky to rely on it IMO. |
08:03
π
|
atphoenix |
I'll submit to Ivan's youtubearchiver under assumption these are at risk. |
08:04
π
|
atphoenix |
"tubeup uses youtube-dl to download a Youtube video (or any other provider supported by youtube-dl), and then uploads it with all metadata to the Internet Archive." |
08:07
π
|
atphoenix |
1 of the 3 channels was already in ivan's archive. I don't know how to move anything to IA, so I'll leave that to someone else. |
09:11
π
|
|
amelia386 has quit IRC () |
09:11
π
|
|
amelia386 has joined #archiveteam-bs |
10:23
π
|
|
picklefac has quit IRC () |
10:23
π
|
|
picklefac has joined #archiveteam-bs |
10:26
π
|
|
LowLevelM has quit IRC (Read error: Operation timed out) |
10:31
π
|
|
ibachandl has quit IRC (Remote host closed the connection) |
10:31
π
|
|
Dallas has quit IRC (Read error: Connection reset by peer) |
10:32
π
|
|
ibachandl has joined #archiveteam-bs |
10:34
π
|
|
marked1 has quit IRC (Read error: Connection reset by peer) |
10:34
π
|
|
Maylay has quit IRC (Ping timeout: 276 seconds) |
10:36
π
|
|
atphoenix has quit IRC (Ping timeout: 276 seconds) |
10:37
π
|
|
RichardG has joined #archiveteam-bs |
10:37
π
|
|
atphoenix has joined #archiveteam-bs |
10:39
π
|
|
Maylay has joined #archiveteam-bs |
10:39
π
|
|
OrIdow6 has quit IRC (Ping timeout: 276 seconds) |
10:52
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
10:58
π
|
|
DFJustin has quit IRC (Ping timeout: 745 seconds) |
11:01
π
|
|
OrIdow6 has joined #archiveteam-bs |
11:31
π
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
11:35
π
|
|
ShellyRol has joined #archiveteam-bs |
12:04
π
|
|
OrIdow6 has quit IRC (Ping timeout: 276 seconds) |
12:21
π
|
|
ibachandl has quit IRC (Ping timeout: 610 seconds) |
13:22
π
|
|
OrIdow6 has joined #archiveteam-bs |
13:46
π
|
|
closure has joined #archiveteam-bs |
15:13
π
|
|
schbirid has joined #archiveteam-bs |
15:14
π
|
|
VerifiedJ has joined #archiveteam-bs |
15:15
π
|
|
X-Scale has quit IRC (Read error: Operation timed out) |
15:35
π
|
|
marked1 has joined #archiveteam-bs |
15:40
π
|
|
anarcat has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
d5f4a3622 has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
brayden_ has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
Tenebrae has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
PurpleSym has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
MrRadar2 has quit IRC (se.hub irc.efnet.nl) |
15:40
π
|
|
anarcat has joined #archiveteam-bs |
15:40
π
|
|
d5f4a3622 has joined #archiveteam-bs |
15:40
π
|
|
brayden_ has joined #archiveteam-bs |
15:40
π
|
|
Tenebrae has joined #archiveteam-bs |
15:40
π
|
|
PurpleSym has joined #archiveteam-bs |
15:40
π
|
|
MrRadar2 has joined #archiveteam-bs |
15:40
π
|
|
irc.efnet.nl sets mode: +o PurpleSym |
16:02
π
|
|
Ryz has quit IRC (Remote host closed the connection) |
16:02
π
|
|
kiska18 has quit IRC (Remote host closed the connection) |
16:03
π
|
|
kiska18 has joined #archiveteam-bs |
16:03
π
|
|
svchfoo3 sets mode: +o kiska18 |
16:03
π
|
|
Ryz has joined #archiveteam-bs |
16:03
π
|
|
svchfoo1 sets mode: +o kiska18 |
16:29
π
|
|
Dallas has joined #archiveteam-bs |
17:15
π
|
SketchCow |
What is all this |
17:24
π
|
|
ibachandl has joined #archiveteam-bs |
17:42
π
|
astrid |
you remember how you darked a bunch of youtube uploads |
17:42
π
|
astrid |
well, noindex'd |
18:05
π
|
|
Harzilein has joined #archiveteam-bs |
18:34
π
|
|
ibachand has joined #archiveteam-bs |
18:34
π
|
|
ibachandl has quit IRC (Read error: Connection reset by peer) |
19:00
π
|
|
DLoader_ has joined #archiveteam-bs |
19:08
π
|
|
DLoader has quit IRC (Ping timeout: 745 seconds) |
19:08
π
|
|
DLoader_ is now known as DLoader |
19:24
π
|
|
DFJustin has joined #archiveteam-bs |
19:36
π
|
|
icedice has joined #archiveteam-bs |
19:37
π
|
|
Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat) |
19:37
π
|
|
Craigle has joined #archiveteam-bs |
20:06
π
|
|
ibachandl has joined #archiveteam-bs |
20:12
π
|
|
kiska has quit IRC (Remote host closed the connection) |
20:13
π
|
|
kiska has joined #archiveteam-bs |
20:13
π
|
|
ibachand has quit IRC (Read error: Operation timed out) |
20:13
π
|
|
Flashfire has joined #archiveteam-bs |
20:13
π
|
|
svchfoo1 sets mode: +o kiska |
20:13
π
|
|
svchfoo3 sets mode: +o kiska |
20:16
π
|
|
britmob has quit IRC (Read error: Connection reset by peer) |
20:29
π
|
abstract |
arduino is a pretty widely used framework for iot stuff, both hobbyist and production |
20:29
π
|
atphoenix |
^context? |
20:29
π
|
abstract |
it has a central archive of packages like cpan/pypi, but given iot stuff has a tendency to break/abandon/shutdown, it might be worth grabbing these packages |
20:29
π
|
|
is- has quit IRC (Ping timeout: 496 seconds) |
20:29
π
|
abstract |
what do people think about that? |
20:30
π
|
abstract |
i have a line to download them all but i dont have any archival disk space |
20:31
π
|
abstract |
(ie i can put in the effort, i just need to know where is appropriate to put them, im not sure IA is the right place to save artefacts of code) |
20:31
π
|
atphoenix |
OrIdow6, this looks like a useful tool to find Frontier customer homepages. http://scraperr.com/ |
20:31
π
|
abstract |
left-pad but for iot is probably a bad thing |
20:32
π
|
atphoenix |
abstract, IA has github stuff and other code stuff in it |
20:32
π
|
abstract |
neat |
20:32
π
|
atphoenix |
if something is web-scrapable, IA SPN http://web.archive.org/save can be used by anyone to archive URLs |
20:34
π
|
abstract |
it's not, its an index of zips containing code, examples, help files, etc |
20:34
π
|
atphoenix |
if you have a link to the ZIP, you can put the link into SPN |
20:35
π
|
abstract |
i have a 11k links |
20:35
π
|
abstract |
s/a// |
20:36
π
|
atphoenix |
there is an SPN email submission option too. We also have tools that can take in long lists of links |
20:37
π
|
abstract |
$ wget -q -O - https://downloads.arduino.cc/libraries/library_index.json | jq ".libraries[].url" | wc -l |
20:37
π
|
abstract |
11012 |
20:37
π
|
abstract |
each libraries entry also has metadata like author, short description, homepage, repo, etc |
20:40
π
|
atphoenix |
https://blog.archive.org/2019/10/23/the-wayback-machines-save-page-now-is-new-and-improved/ says Have you ever wanted to archive all the web pages linked from an email message? Well, you are in luck because now you can forward that email to βsavepagenow@archive.orgβ and after a few minutes you will get an email back filled with Wayback Machine playback URLs. |
20:42
π
|
abstract |
cool, so i can spam it with 11,000 links, but will they be properly archived under a collection with metadata? im down for this i just dont know the tooling for doing so |
20:43
π
|
abstract |
https://paste.debian.net/1126573/ |
20:43
π
|
abstract |
bad archiving is good but good quality archiving is surely best |
20:43
π
|
atphoenix |
I do not know the limits to the email-based submission |
20:44
π
|
atphoenix |
I have heard it takes longer to reply for long lists |
20:44
π
|
atphoenix |
you might try lists of say 100 and see how it goes |
20:44
π
|
atphoenix |
and work upwards from there if it works as expected |
20:45
π
|
atphoenix |
items submitted via SPN end up in the Wayback Machine |
21:00
π
|
atphoenix |
abstract, what were you trying to demonstrate with the paste.debian link? |
21:00
π
|
abstract |
all the metadata i have |
21:00
π
|
abstract |
* 11k |
21:01
π
|
abstract |
https://archive.org/services/docs/api/internetarchive/index.html looks useful |
21:09
π
|
marked1 |
is there no html index of all the downloads? |
21:12
π
|
|
nicolas17 has joined #archiveteam-bs |
21:17
π
|
atphoenix |
OrIdow6, seems the search scraper I listed above isn't working very well right now. I have found something else (python script) intended for the same purpose https://github.com/NikolaiT/GoogleScraper . That github links what I guess is a commercial implementation https://scrapeulous.com/ that offers 500 free searches per month |
21:23
π
|
atphoenix |
https://scrapeulous.com/about/ says As of 2019, GoogleScraper is replaced by a modern successor named se-scraper that builds on top of puppeteer and headless Chromium browser. |
21:23
π
|
atphoenix |
https://github.com/NikolaiT/se-scraper |
21:24
π
|
|
Raccoon` has joined #archiveteam-bs |
21:25
π
|
|
OrIdow6 has quit IRC (Ping timeout: 276 seconds) |
21:26
π
|
|
Flashfire has quit IRC (Ping timeout: 276 seconds) |
21:26
π
|
|
Dallas has quit IRC (Ping timeout: 276 seconds) |
21:27
π
|
abstract |
marked1, nah, they have a custom tool in the IDE for managing them |
21:28
π
|
|
Raccoon has quit IRC (Ping timeout: 610 seconds) |
21:28
π
|
|
Raccoon` is now known as Raccoon |
21:30
π
|
|
marked1 has quit IRC (Ping timeout: 276 seconds) |
21:32
π
|
|
X-Scale has joined #archiveteam-bs |
21:32
π
|
|
Atom__ has joined #archiveteam-bs |
21:35
π
|
|
pew has quit IRC (Ping timeout: 276 seconds) |
21:35
π
|
|
purplebot has quit IRC (Ping timeout: 276 seconds) |
21:35
π
|
|
foureyes_ has joined #archiveteam-bs |
21:35
π
|
|
Frogging has quit IRC (Quit: Close the World, Open the nExt) |
21:35
π
|
|
Hoolootwo has joined #archiveteam-bs |
21:36
π
|
|
Atom-- has quit IRC (Ping timeout: 276 seconds) |
21:36
π
|
|
Hooloovoo has quit IRC (Ping timeout: 276 seconds) |
21:36
π
|
|
foureyes has quit IRC (Ping timeout: 276 seconds) |
21:37
π
|
|
Frogging has joined #archiveteam-bs |
21:38
π
|
|
purplebot has joined #archiveteam-bs |
21:45
π
|
|
britmob has joined #archiveteam-bs |
21:48
π
|
|
DiscantX has joined #archiveteam-bs |
21:54
π
|
|
Dallas has joined #archiveteam-bs |
21:54
π
|
|
pew has joined #archiveteam-bs |
21:55
π
|
|
marked1 has joined #archiveteam-bs |
21:56
π
|
|
Flashfire has joined #archiveteam-bs |
22:01
π
|
|
qw3rty has joined #archiveteam-bs |
22:01
π
|
|
ibachand has joined #archiveteam-bs |
22:02
π
|
|
britmob_ has joined #archiveteam-bs |
22:02
π
|
|
Stiletto has joined #archiveteam-bs |
22:02
π
|
|
OrIdow6 has joined #archiveteam-bs |
22:03
π
|
|
benjins has joined #archiveteam-bs |
22:04
π
|
|
Fionera_ has joined #archiveteam-bs |
22:05
π
|
|
britmob has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
Atom__ has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
X-Scale has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
ibachandl has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
VerifiedJ has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
ShellyRol has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
Maylay has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
qw3rty__ has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
Fionera has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
Stilett0 has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
HP_Archiv has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
benjinsmi has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
obskyr has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
ctrl_ has quit IRC (hub.efnet.us efnet.deic.eu) |
22:05
π
|
|
kiska3 has quit IRC (hub.efnet.us efnet.deic.eu) |
22:08
π
|
|
Maylay_ has joined #archiveteam-bs |
22:08
π
|
|
asdf0101 has quit IRC (The Lounge - https://thelounge.chat) |
22:08
π
|
|
marked1 has quit IRC (Quit: The Lounge - https://thelounge.chat) |
22:10
π
|
|
asdf0101 has joined #archiveteam-bs |
22:10
π
|
|
marked1 has joined #archiveteam-bs |
22:15
π
|
|
actually_ has joined #archiveteam-bs |
22:21
π
|
|
HP_Archiv has joined #archiveteam-bs |
22:21
π
|
|
ShellyRol has joined #archiveteam-bs |
22:24
π
|
|
BlueMax has joined #archiveteam-bs |
22:29
π
|
|
schbirid has quit IRC (Read error: Operation timed out) |
22:39
π
|
|
X-Scale has joined #archiveteam-bs |
22:41
π
|
|
ctrl_ has joined #archiveteam-bs |
22:46
π
|
|
DiscantX has quit IRC (Remote host closed the connection) |
23:20
π
|
|
af10b3e5e has joined #archiveteam-bs |
23:20
π
|
|
d5f4a3622 has quit IRC (Read error: Connection reset by peer) |
23:40
π
|
|
dewdrop3 has joined #archiveteam-bs |
23:49
π
|
|
dewdrop has quit IRC (Ping timeout: 745 seconds) |
23:49
π
|
|
dewdrop3 is now known as dewdrop |