#urlteam 2018-12-06,Thu

↑back Search

Time Nickname Message
02:17 🔗 klg has joined #urlteam
04:11 🔗 dashcloud has quit IRC (Ping timeout: 633 seconds)
04:15 🔗 dashcloud has joined #urlteam
04:17 🔗 odemg has quit IRC (Ping timeout: 265 seconds)
04:29 🔗 odemg has joined #urlteam
06:05 🔗 dashcloud has quit IRC (No Ping reply in 210 seconds.)
06:09 🔗 dashcloud has joined #urlteam
06:28 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
06:31 🔗 dashcloud has joined #urlteam
06:32 🔗 bacchusm has joined #urlteam
06:33 🔗 bacchusm Hi, I have some extra Bandwidth, lots of Disk Space and a extra Raspberry Pi 3+. What would be the best team to join with that?
06:34 🔗 teej_ bacchusm: Team?
06:35 🔗 bacchusm ftp team, Newsgrabber, URL etc...
06:35 🔗 bacchusm Sorry Projects
06:39 🔗 teej_ Oh. Well you can get started with URLTeam's TerrorOfTinyTown project. It uses very little bandwidth and space.
06:41 🔗 teej_ That's an easy one to get started with. I haven't done the others yet.
06:41 🔗 teej_ bacchusm: Firstly, you need to have Python installed. Use `pip install --upgrade seesaw requests`.
06:42 🔗 bacchusm 'okay That is easy
06:42 🔗 teej_ Then you go to a directory... for example `cd Downloads`.
06:44 🔗 teej_ Then you want to get the official git repo by typing `git clone --recurse-submodules https://github.com/ArchiveTeam/terroroftinytown-client-grab.git`
06:44 🔗 teej_ That will get the URLTeam's code.
06:45 🔗 teej_ Then you go into the directory: `cd terroroftinytown-client-grab/`
06:47 🔗 teej_ Whenever you want to run the code, just type `run-pipeline --concurrent 6 --auto-update pipeline.py USERNAME` and replace the USERNAME with anything you like.
06:49 🔗 teej_ You can set the concurrent flag to 3 if you want to save resources. I guess it doesn't matter much with bitly and tinyurl queries.
06:49 🔗 bacchusm I put a new SD card I will follow directions shortly.
06:49 🔗 bacchusm I did a screen grabe
06:49 🔗 dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.)
06:51 🔗 teej_ You can the go to https://tracker.archiveteam.org:1338 to see your progress.
06:51 🔗 bacchusm ok install some now.
06:51 🔗 teej_ I have mine running 24/7. It's pretty easy.
06:53 🔗 bacchusm IT says bash: run-pipeline command not found
06:54 🔗 teej_ Did you do `pip install --upgrade seesaw requests`?
06:54 🔗 dashcloud has joined #urlteam
06:55 🔗 bacchusm Yes I did
06:56 🔗 teej_ Type `python --version` what does it say?
06:56 🔗 bacchusm 2.7.13
06:57 🔗 bacchusm Does it need 3.7?
06:59 🔗 teej_ Hmm... well you can use python3 as well...
07:00 🔗 teej_ You can do `pip3 install --upgrade seesaw requests` and then `run-pipeline3 --concurrent 6 --auto-update pipeline.py USERNAME`.
07:01 🔗 teej_ I'm not sure why your `run-pipeline` command isn't there.
07:02 🔗 bacchusm Do I need the TinyTown grab using Python 3?
07:03 🔗 bacchusm Where does 'Pipeline' get installed is that same as Seesaw?
07:06 🔗 teej_ So when you use the pip install thing, I think it would install the run-pipeline command as well... I remember having the same issue as you did when I first set it up.
07:07 🔗 teej_ You can run the bash script, `bash warrior-install.sh`.
07:07 🔗 teej_ Mayve try closing the terminal and opening it again...
07:08 🔗 bacchusm I will try closing and opening.
07:08 🔗 bacchusm Bash warrior didn't work
07:11 🔗 teej_ Hmmm... So when you do `pip install --upgrade seesaw requests`, what does it say?
07:14 🔗 bacchusm Collecting seesaw , Collecting requests...
07:15 🔗 bacchusm then a whole bunch of packagse.. but I don't see pipeline. weird but it might have scrolld by
07:16 🔗 teej_ Oh... if it's collecting, doesn't that mean it just finished installing?
07:16 🔗 teej_ If you type run-pip and then press tab, does it autocomplete to run-pipeline?
07:17 🔗 bacchusm nope. It installed six,singledispatch, certifi, backports-abc, tornado, socksjs, sesaw, urllib3, idna, chardet , requests
07:17 🔗 bacchusm Thank you for helping me really apprecaited it
07:18 🔗 bacchusm run and tab... it has runcon, runleve, run-parts, run-with-aspell, run-erl, run-mailcap,runuser
07:20 🔗 teej_ Hmm...
07:21 🔗 teej_ I'm trying to remember what I did to fix the same issue.
07:22 🔗 bacchusm I used this and it started to work
07:22 🔗 bacchusm ~/.local/bin/run-pipeline pipeline.py --concurrent 2 YOURNICKHERE
07:22 🔗 bacchusm I believe it is working at least
07:25 🔗 teej_ Oh! That means pip installed run-pipeline into your local user directory!
07:26 🔗 teej_ So you can probably try `sudo pip install --upgrade seesaw requests`.
07:26 🔗 bacchusm So stop it and use the sudo?
07:27 🔗 teej_ Try `pip install --global --upgrade seesaw requests` first.
07:29 🔗 teej_ Actually... I think you're just using your standard python. So everything is pretty much perfectly fine. My system uses pyenv for python, so that's probably why it's a little different.
07:30 🔗 bacchusm oh. okay. So I can just let it run. I saw my name flash by in the stats so it seems to be working.
07:30 🔗 bacchusm Is there another project to run on a Pi or is this the only one you tried?
07:31 🔗 teej_ So... recap: whenever you want to run the URLTeam code, just `cd` to the directory and type `~/.local/bin/run-pipeline pipeline.py --concurrent 2 YOURNICKHERE` and just let it run.
07:31 🔗 teej_ Well... this is the only one I tried so far.
07:32 🔗 bacchusm Yeah. I am going to set it up at my office tomorrow and let it continually run.
07:32 🔗 bacchusm Cool! Once again, thank you for your help!
07:32 🔗 teej_ You could also set up the ArchiveBot which can archive thousands of pages, but if you do, your IP might get accidentally banned on some sites.
07:33 🔗 teej_ That's why I haven't done that.
07:33 🔗 bacchusm Does that work on a PI? I didn't see any install.
07:33 🔗 bacchusm They all need VirtualBox or Vmware, which the PI doesn't run. At least not that I know of
07:34 🔗 bacchusm I'm okay with getting banned on some sites.
07:34 🔗 teej_ I don't think it will run on Pi. You can see what's being archived here: http://dashboard.at.ninjawedding.org/
07:36 🔗 teej_ Some jobs lasts for months. They eventually get uploaded to the Internet Archive so people can view websites using the Wayback Machine.
07:37 🔗 bacchusm Very Cool. Do you know what type of machine do I need for an Archivebot? I have a couple old ones sitting around and plenty of HD space
07:38 🔗 teej_ https://www.archiveteam.org/index.php?title=ArchiveBot#Volunteer_to_run_a_Pipeline
07:39 🔗 bacchusm Is that the same pipeline I am running now?
07:40 🔗 teej_ So you join the #archivebot channel and ask one of the Ops for setting things up and getting things working.
07:40 🔗 bacchusm Okay cool. Thank You again. Will bounce over there now.
07:40 🔗 teej_ No, the pipeline you're running is for URLTeam's TerrorOfTinyTown.
07:40 🔗 bacchusm okay got Ya
07:41 🔗 bacchusm_ has joined #urlteam
07:42 🔗 qwebirc57 has joined #urlteam
07:43 🔗 qwebirc57 sorry I click on your link and it exited me
07:43 🔗 bacchusm_ has quit IRC (Client Quit)
07:43 🔗 teej_ Who's link?
07:43 🔗 bacchusm_ has joined #urlteam
07:45 🔗 bacchusm has quit IRC (Ping timeout: 260 seconds)
07:45 🔗 bacchusm has joined #urlteam
07:47 🔗 qwebirc57 has quit IRC (Ping timeout: 260 seconds)
07:48 🔗 bacchusm Sorry Teej, I click on your link and got trasnfer to other IRC channel. I thought another window would open
07:48 🔗 bacchusm_ has quit IRC (Ping timeout: 260 seconds)
07:56 🔗 teej_ bacchusm: Oh. That's fine. Don't worry.
07:57 🔗 teej_ For IRC stuff, I use IRCCloud. It makes things easier.
12:12 🔗 zerkalo has quit IRC (Ping timeout: 260 seconds)
12:23 🔗 zerkalo has joined #urlteam
13:01 🔗 Kagee has quit IRC (WeeChat 2.3)
13:03 🔗 Kagee has joined #urlteam
21:55 🔗 coldon2dr has quit IRC (Read error: Operation timed out)
22:43 🔗 bacchusm has quit IRC (Ping timeout: 260 seconds)
22:55 🔗 hook54321 has quit IRC (Quit: Connection closed for inactivity)

irclogger-viewer