#archiveteam-bs 2017-10-19,Thu

↑back Search ←Prev date (last date) Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***aschmitz has joined #archiveteam-bs [00:19]
.......... (idle for 48mn)
Stilett0 has joined #archiveteam-bs
Mateon1 has quit IRC (Read error: Operation timed out)
[01:07]
robink has joined #archiveteam-bs [01:20]
........ (idle for 38mn)
Stilett0 has quit IRC (Read error: Operation timed out) [01:58]
pizzaiolo has quit IRC (Remote host closed the connection) [02:12]
robink has quit IRC (Read error: Connection reset by peer)
robink has joined #archiveteam-bs
[02:18]
tuluu_ has quit IRC (Read error: Operation timed out) [02:25]
SilSte has quit IRC (Ping timeout: 492 seconds)
SilSte has joined #archiveteam-bs
tuluu has joined #archiveteam-bs
[02:30]
Stilett0 has joined #archiveteam-bs
drumstick has quit IRC (Ping timeout: 255 seconds)
[02:45]
Pixi has quit IRC (Ping timeout: 255 seconds) [02:59]
Pixi has joined #archiveteam-bs [03:04]
qw3rty11 has joined #archiveteam-bs [03:17]
qw3rty10 has quit IRC (Read error: Operation timed out)
Aoede has quit IRC (Ping timeout: 250 seconds)
[03:22]
Aoede has joined #archiveteam-bs
Aoede has quit IRC (Connection closed)
Aoede has joined #archiveteam-bs
Stilett0 is now known as Stiletto
[03:31]
........ (idle for 37mn)
Sk1d has quit IRC (Ping timeout: 250 seconds) [04:09]
Sk1d has joined #archiveteam-bs
Sk1d has quit IRC (Connection Closed)
Sk1d has joined #archiveteam-bs
[04:16]
porky has joined #archiveteam-bs [04:23]
.... (idle for 19mn)
Somebody2Hm, http://archiveteam.org/index.php?title=Textfiles is a blank page -- it probbbably shouldn't be [04:42]
porkyI want to save myself to the harddisk for reading later
mirrors are cool, but I need the files for my personal library
[04:46]
Somebody2try this: https://archive.org/details/textfiles-dot-com-2011
specifically, this torrent: https://archive.org/download/textfiles-dot-com-2011/textfiles-dot-com-2011_archive.torrent
[04:46]
porkyo I downloaded this already [04:47]
Somebody2Ah, what was missing? [04:47]
porkybut is it old already, in 2011, a lot of files have been added since that time?
I would have fresh ways)
but if there is not one, I'll download the archives from the site
[04:48]
Somebody2Hm, dig around on archive.org, and please report what you find.
I'm not sure how much has been added since 2011, frankly.
[04:51]
***Asparagir has joined #archiveteam-bs [04:52]
porkyWell, that's the problem, it seems, there was a lot added from that time [04:52]
Somebody2Hm. [04:53]
porkyin any case, thanks, buddy, I'll download from the main site, it's good that there is not much work [04:53]
Somebody2Sure. [04:54]
porkyI have one more question
on another issue
about Mozilla Addons
http://www.archiveteam.org/index.php?title=Mozilla_Addons
[04:54]
Somebody2What about? [04:57]
porkyDoes the status of the project mean that the archiving was canceled? [04:57]
Somebody2I think we tried grabbing it with ArchiveBot, but it didn't finish. So further effort would be welcomed! [04:58]
porkywith Mozilla such a difficult situation, that around the summer of 2018 old addons will be removed from Amo [04:59]
***drumstick has joined #archiveteam-bs [04:59]
porkySo before the b-day is there a chance that you will have time to save Amo? [05:01]
Somebody2We includes *you* -- will *you* have time?
A good first step would be making a list of them, probalby through the API.
[05:02]
porkyunderstand
i'm trying to save now using offline explorer enterprise
but this is not very productive
[05:03]
Somebody2What makes it ineffective? [05:04]
porkyWell, there are about a million files on the site, it seems that only light themes are almost 500,000 [05:06]
***Asparagir has quit IRC (Asparagir) [05:06]
Somebody2AFAIK, themes aren't scheduled to be removed, though. [05:08]
porkyfirstly slowly save, and has a limit on the number of downloaded files [05:08]
Somebody2Ah, I see. [05:08]
porkyI know, I gave an example that there are a lot of files [05:08]
Somebody2Are you able to write code yourself? [05:09]
porkyno, nfortunately [05:11]
Somebody2Ah, that does make it more difficult. [05:13]
***Mateon1 has joined #archiveteam-bs [05:13]
porkyarchiveteam has more productive tools [05:14]
Somebody2Few that don't require at least some programming to get going, though. :-/ [05:16]
porkyTell me how I can use this archiveteam tools or how I can help to save addons
I just at the moment have completely saved almost 900 firefox add-ons, I used scrapbook and unmht add-ons
[05:16]
Somebody2Well, the start would be generating a list of the old-style addons that are supposed to get removed. [05:18]
porkybut in terms of time, this is hellish work [05:18]
Somebody2porky: I'm delighted to hear you've got 900 of them -- please upload them to archive.org!
You can create an account at archive.org with any email address, and upload them that way.
[05:18]
porkyso to join the team you need to be able to write code? [05:20]
Somebody2No, but to start a new project, it greatly helps.
The team is ... rather informal. You can participate by as little as mentioning sites that are in danger of going away in the IRC channel.
Or helping to investigate url shortening services so urlteam can scrape them.
Or running the ArchiveTeam Warrior, a virtual machine that helps us with bigger projects.
Or processing suggestions over in the #archivebot channel, and making sure jobs being worked on there have proper ignore sets applied.
[05:20]
porkyWell, I understood, Ie for current projects [05:23]
Somebody2So there are a lot of ways to help that don't require writing code -- but starting new projects is kinda tricky without it.
For AMO, it looks like this is the API query to start with: https://addons-server.readthedocs.io/en/latest/topics/api/addons.html#search
[05:23]
porkyunderstood [05:27]
mozilla thankless cattle) [05:33]
Somebody2hm [05:33]
***porky has quit IRC (Quit: ChatZilla 0.9.92 [Firefox 28.0/20140314220517]) [05:35]
kisspunchhmm, i wrote a mozilla addon scraper for quixey, which is now out of business--I doubt they'll send me the code but I can ask [05:47]
Somebody2kisspunch: might as well ask!
I'm about to start grabbing all 839 25-item pages of API results for firefox extensions; so that'll at least get us a list to work from.
[05:47]
kisspunchthat's most of what you'd get from me anyway, but i asked
don't expect an answer in any relevant timespan
[05:48]
Somebody2nods [05:48]
kisspunchdoes anyone have hard data on cold storage bit rot or mechanical lifetimes [05:49]
Somebody2well it seems to be going rather fast, so that's nice at aleast [05:49]
kisspunchi'm wondering what the lifetime of cold storage hard drives is [05:49]
Somebody2ok, that finished
Here's how I did it, for reference: for n in {1..839}; do curl -L 'https://addons.mozilla.org/api/v3/addons/search?app=firefox&type=extension&sort=created&page='$n | tee results_$n; date; done
It's 92MB of results; I'll send them to anyone who asks.
[05:55]
And it looks like there are 17,293 (out of the 20,962 total) that are not WebExtensions (and so will be trashed, in theory).
which I generated with: jq -r '.results[] | select(.current_version.files[].is_webextension != true)| .url' results_* | wc -l
We should probably make a separate channel for this; name suggestions?
[06:05]
It looks like the total size is only 4GB. [06:12]
OK, now dowloading them; about 800 downloaded so far; should be done in a day or less. [06:22]
.... (idle for 19mn)
***fie has quit IRC (Read error: Operation timed out) [06:41]
fie has joined #archiveteam-bs [06:52]
porky has joined #archiveteam-bs [06:58]
schbirid has joined #archiveteam-bs [07:03]
porkySomebody2 hi, sorry for disappear [07:13]
***icedice has joined #archiveteam-bs [07:14]
porkyabout my addons, I because of the lack of time will not be able to put them on the archive.org
But I can specifically send them to you via torrent
for example
although
maybe i need to figure it out
[07:14]
***icedice has quit IRC (Ping timeout: 260 seconds) [07:20]
porkyjust you can find this application better than me [07:22]
***schbirid has quit IRC (Quit: Leaving) [07:28]
.................... (idle for 1h36mn)
porky has quit IRC (Quit: ChatZilla 0.9.92 [Firefox 28.0/20140314220517]) [09:04]

↑back Search ←Prev date (last date) Show only urls(Click on time to select a line by its url)