| Time |
Nickname |
Message |
|
00:02
π
|
Cowering |
oh.. maybe not, i mistook those for http://www.parodius.com/ |
|
00:07
π
|
arkhive |
S |
|
00:07
π
|
arkhive |
SketchCow: Do you have plans for speaking/presenting in Colorado in 2014? |
|
00:53
π
|
ivan` |
Lord_Nigh: how big is this site in GB? |
|
00:54
π
|
ivan` |
also, do you need all of the uptobox.com files? |
|
00:54
π
|
ivan` |
grabbing from a bunch of different file hosts could be annoying |
|
00:55
π
|
ivan` |
Lord_Nigh: achivebot is grabbing the site itself http://archivebot.at.ninjawedding.org:4567/ |
|
01:16
π
|
Lord_Nigh |
Cowering: the nesdev.parodius.com forums were backed up and rehosted by tepples at nesdev.com when parodius shut down |
|
01:16
π
|
Lord_Nigh |
ivan`: i'm not sure whether the uptobox files are needed ornot but i'd err on the side of yes |
|
01:18
π
|
SketchCow |
In general? No idea. |
|
01:23
π
|
ivan` |
Lord_Nigh: you might want to get them all with jdownloader or similar |
|
01:44
π
|
DragonDon |
sooo, "ArchiveTeam's Choice" chooses 'TinyBack' which fails retreiving anything. Should I be sticking with one of the other 3 options? (formspring, URLTeam, blip"? |
|
01:45
π
|
ivan` |
http://pastebin.com/j05z2Bnx external links for the 69 pages on 64scener |
|
01:47
π
|
DragonDon |
actually....nothing is working. Tried the others, "No item received" message for URLTeam and blip |
|
11:08
π
|
antomatic |
Clearly everything has been archived already, then! :) |
|
11:09
π
|
antomatic |
It feels like we should always make sure that there's at least ONE project available at all times in the warrior, even if it's only pre-emptive archiving or crawling or similar. |
|
11:11
π
|
antomatic |
There IS stuff that needs to be done - e.g. the warhammer forums closing in December - but I think that project may be in need of some setup assistance? |
|
11:36
π
|
ersi |
DragonDon: Don't worry, it'll start doing project work. |
|
11:36
π
|
Nemo_bis |
antomatic: isn't urlteam running? |
|
11:36
π
|
ersi |
Nemo_bis: TinyBack == URLTeam |
|
11:36
π
|
Nemo_bis |
so? |
|
11:37
π
|
ersi |
It fucks up occationally |
|
12:34
π
|
DragonDon |
pccasionally? |
|
12:35
π
|
DragonDon |
guess I just don't want something sitting that will 'eventually' do something... |
|
12:35
π
|
Nemo_bis |
SketchCow: turns out the discs I have, alone, are almost 8 kg... let's see how much they cost |
|
12:35
π
|
Nemo_bis |
DragonDon: orly? isn't that what daemons are for? :) |
|
12:36
π
|
DragonDon |
warrior != daemon |
|
12:37
π
|
DragonDon |
does that mean you are ok with asking your print daemon to print and then hoping it'll print 'eventually'? |
|
13:04
π
|
ersi |
Well, we've had some problems with that specific project - which currently, is the only one active that you can use in the ArchiveTeam Warrior. |
|
13:08
π
|
* |
joepie93 reminds people that Hyves needs urgent backing up and that no code exists for it yet, and that there is #angerthehyve for that |
|
13:08
π
|
joepie93 |
(and it should probably be a warrior project because they have LOTS of data) |
|
13:11
π
|
ersi |
Oh yes. |
|
13:13
π
|
Foxboron |
joepie93: happy now? |
|
13:13
π
|
joepie93 |
Foxboron: ohai, wrong channel |
|
13:13
π
|
joepie93 |
#angerthehyve |
|
13:13
π
|
joepie93 |
:P |
|
13:13
π
|
joepie93 |
cc _46bit |
|
13:14
π
|
Foxboron |
ohh |
|
13:15
π
|
_46bit |
Hey guys |
|
13:16
π
|
_46bit |
joepie93: Yup I saw, that's why I connected :) |
|
13:18
π
|
DragonDon |
so then...nothing to save right now then huh? ok, will check in some other time. |
|
13:24
π
|
joepie93 |
DragonDon: not -yet- |
|
13:25
π
|
joepie93 |
through the warrior anyway |
|
13:25
π
|
joepie93 |
in a few days at most, some code should materialize for hyves at least |
|
13:25
π
|
ersi |
DragonDon: Feel free to hang around and/or come around some other time :) |
|
13:28
π
|
godane |
looks like the buck sexton show is now 6 days a week |
|
13:31
π
|
joepie93 |
ATTENTION PYTHON DEVELOPERS: developers needed to write pipeline code for archiving Hyves, a massive Dutch social network; please join #angerthehyve |
|
13:31
π
|
joepie93 |
shutdown expected in under a month |
|
13:34
π
|
Nemo_bis |
DragonDon: if you don't want to have something (the warrior) sitting there doing something (or maybe not), you can run the scripts directly |
|
13:34
π
|
Nemo_bis |
the warrior is to avoid worrying |
|
13:36
π
|
DragonDon |
Nemo_bis, I'm cool with letting things run in the background. While not overly versed in scripts(something I am learning more about this year) but if it's not too much work/great a learning curve, I'll be game. |
|
13:58
π
|
ersi |
No need to be 'versed'. It's just "running programs" really |
|
13:58
π
|
ersi |
But the whole point of the ArchiveTeam warrior is to be as little hassle as possible with regards to running projects :) |
|
14:04
π
|
DragonDon |
I haven't looked but is the setup instructions in the wiki? |
|
14:06
π
|
ersi |
Of what? Running the scripts standalone? |
|
14:08
π
|
DragonDon |
running the scripts |
|
14:09
π
|
ersi |
No, there's no general installation instruction. But they are usually available in the README file for each projects source code. |
|
14:09
π
|
ersi |
They require to be run in a Linux environment though. |
|
14:10
π
|
DragonDon |
oh, ok. Having never looked at any of the source files, where do I find them? |
|
14:10
π
|
DragonDon |
I run Linux Mint 15 :) So we're good |
|
14:10
π
|
ersi |
They're all available at https://github.com/ArchiveTeam/ |
|
14:10
π
|
antomatic |
A good place to look is github.com/ArchiveTeam |
|
14:10
π
|
antomatic |
doh, stereo :) |
|
14:10
π
|
ersi |
Usually, the ones called something with -grab are projects. There's a bunch of misc. source repositories there as well |
|
14:11
π
|
ersi |
With the exception of tinyback (which is the urlteam project) |
|
14:11
π
|
antomatic |
Have a look at puush-grab, for example. That project is still live and runs pre-emptively with new work units every hour. |
|
14:12
π
|
DragonDon |
oh, ok, these seem pretty straightfoward to setup and run. |
|
14:12
π
|
antomatic |
You can run that (as a script) now, today. It sits idle when there's nothing to do, and springs into action each hour to get new stuff |
|
14:12
π
|
DragonDon |
I'll look into it tomorrow after I got to bed. nearly 11:30pm here |
|
14:12
π
|
ersi |
Sure thing. :) |
|
14:14
π
|
DragonDon |
Question: I see "./get-wget-lua.sh" then I see "# Start downloading with: screen ~/.local/bin/run-pipeline --disable-web-server pipeline.py YOURNICKNAME" does that mean I need to edit the script with that info? run both commands consecutively? |
|
14:15
π
|
antomatic |
No, you can just type it as a command |
|
14:15
π
|
antomatic |
so usually something like... |
|
14:15
π
|
antomatic |
cd whatever-grab |
|
14:15
π
|
DragonDon |
but there are TWO things to type.... |
|
14:15
π
|
antomatic |
./get-wget-lua.sh |
|
14:15
π
|
antomatic |
run-pipeline --concurrent 9000 pipeline.py AntIsFantastic --disable-web-server |
|
14:15
π
|
antomatic |
or screen run-pipeline --concurrent 9000 pipeline.py AntIsFantastic --disable-web-server for extra funk |
|
14:15
π
|
DragonDon |
ah, the .sh is just to set it up right? got it |
|
14:15
π
|
* |
antomatic nods |
|
14:17
π
|
DragonDon |
ok, screw it....doing it now |
|
14:22
π
|
BiggieJon |
concurrent 9000 ??!? |
|
14:23
π
|
ersi |
Sounds like a bad idea :) |
|
14:23
π
|
joepie93 |
that's what I tried to tell him during isoprey |
|
14:23
π
|
joepie93 |
lol |
|
14:23
π
|
BiggieJon |
need like 1TB ram to run that many threads |
|
14:26
π
|
DragonDon |
ok, script running now...but "No item received. Retrying after 30 seconds..." will it keep trying every 30 seconds till it finds something then switch to once an hour or the like? |
|
14:27
π
|
BiggieJon |
new items are added to teh tracker at teh top of each hour, takes about 15-20 min to run thru the additions |
|
14:29
π
|
ersi |
If you leave it running, it'll pick up work |
|
14:30
π
|
DragonDon |
ok, will do |
|
15:44
π
|
joepie93 |
hey look, it's a website shutdown hotline: https://docs.google.com/forms/d/1jAzdEfsAGNzzVQpDisNDuJHk_kjCUIXV2nCpKsPAL0I/viewform |
|
16:02
π
|
Lord_Nigh |
joepie93: nice! |
|
16:03
π
|
Lord_Nigh |
64scener already got archived though (except for external links) so... |
|
17:30
π
|
joepie93 |
hotline responses minus the e-mail addresses go here: https://docs.google.com/a/cryto.net/spreadsheet/ccc?key=0Aj7l5eFy3CKsdDFWRUxwMGVjTmhYc291ZXlCdk1zOWc#gid=0 |
|
19:50
π
|
Nemo_bis |
is there a way to use wget --continue on web.archive.org? ΓΒ«Note that -c only works with FTP servers and with HTTP servers that support the "Range" header.ΓΒ» |
|
20:10
π
|
Nemo_bis |
https://archive.org/post/1003894/wayback-machine-doesnt-support-the-range-header-aka-wget-continue-doesnt-work |
|
21:32
π
|
_46bit |
Hey guys |
|
21:33
π
|
_46bit |
Can I setup a server to just archive whatever happens to be going on, so I don't have to babysit it over time? |
|
21:33
π
|
_46bit |
I've only helped with #isoprey before so don't know much about this. |
|
21:38
π
|
DFJustin |
that's basically the warrior http://archiveteam.org/index.php?title=ArchiveTeam_Warrior |
|
21:38
π
|
DFJustin |
not every project is hooked up to that though |
|
21:40
π
|
_46bit |
Oh I see, the ArchiveTeamΓ’ΒΒs Choice option. Thanks DFJustin. Is there a guide to setting that up outside the VM? |
|
21:42
π
|
DFJustin |
there might be but I don't know where |
|
21:45
π
|
ersi |
_46bit: Yes, for every project there's a README who documents how to get going. All of the projects source code are on https://github.com/ArchiveTeam/ and the projects usually have "-grab" in their name |
|
21:47
π
|
_46bit |
ersi: Yeah, I meant getting the ArchiveTeam's Choice working outside it - thanks tohugh :-) |
|
21:48
π
|
ersi |
Oh, ah. Not in a simple way, no. |
|
21:48
π
|
ersi |
You could actually do that though. But that'd mean running every script that make up the Warrior :) |
|
21:48
π
|
ersi |
Should make a guide for that anyway though |
|
21:50
π
|
_46bit |
ersi: Okay, thanks. I suppose I'll just go for one of the longer-term projects for now. |
|
22:44
π
|
touya |
Blue Max was a good game. |
|
22:45
π
|
BlueMax |
>___> |
|
22:46
π
|
touya |
i think it was about the first games i ever played. that and decathlon. and this probably belongs into -bs. |
|
22:46
π
|
* |
ersi nods and pats touya |