Time |
Nickname |
Message |
00:00
🔗
|
|
Darkstar has joined #archiveteam-bs |
00:05
🔗
|
|
mr-b has joined #archiveteam-bs |
00:31
🔗
|
|
JesseW has joined #archiveteam-bs |
00:49
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
01:10
🔗
|
|
JetBalsa has joined #archiveteam-bs |
01:12
🔗
|
|
MrIdea has quit IRC () |
01:20
🔗
|
|
JesseW has joined #archiveteam-bs |
01:23
🔗
|
acridAxid |
Following the AT wiki, I'm trying to use wget to scrape computerhope.com, but all I get is the index.html page. |
01:25
🔗
|
acridAxid |
Even if I use --mirror and --recursive and a host of other options, all I can pull down is index.html and maybe a bunch of other cross-linked trash. |
01:37
🔗
|
kyan |
acridAxid, it seems to be downloading fine using grab-site |
01:37
🔗
|
kyan |
have you checked your useragent? |
01:41
🔗
|
|
Start has joined #archiveteam-bs |
01:47
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
01:58
🔗
|
|
JesseW has joined #archiveteam-bs |
01:59
🔗
|
|
JesseW has quit IRC (Client Quit) |
02:05
🔗
|
|
schbirid2 has joined #archiveteam-bs |
02:07
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
02:23
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
02:26
🔗
|
|
dashcloud has joined #archiveteam-bs |
02:58
🔗
|
|
vitzli has joined #archiveteam-bs |
03:13
🔗
|
acridAxid |
kyan: thanks, I'll look into grab-site. Is this tool preferred over wget today? |
03:14
🔗
|
kyan |
Not exactly. |
03:14
🔗
|
kyan |
It's based on a program called wpull, which *is* preferred over wget |
03:14
🔗
|
acridAxid |
kyan: yeah, i had a useragent set. without one the site simply refuses. |
03:14
🔗
|
kyan |
grab-site is basically a wrapper around wpull |
03:14
🔗
|
acridAxid |
Is wpull preferred over wget for archiving, or just in general? |
03:15
🔗
|
kyan |
Well, I think both |
03:15
🔗
|
kyan |
grab-site: https://github.com/ludios/grab-site/ |
03:15
🔗
|
kyan |
wpull: http://wpull.readthedocs.org/ |
03:15
🔗
|
acridAxid |
kyan: thanks! :) |
03:15
🔗
|
kyan |
np :) |
03:15
🔗
|
kyan |
good luck! |
03:17
🔗
|
acridAxid |
hmm, not packaged for Arch |
03:17
🔗
|
acridAxid |
project for this evening :) |
03:27
🔗
|
dxrt |
acridAxid: Easiest/quickest would just be using pip to install grab-site |
03:27
🔗
|
dxrt |
and it has a nice little webserver to view progress |
03:31
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
03:52
🔗
|
acridAxid |
prefer having it managed by my package manager |
03:52
🔗
|
acridAxid |
i don't need more things to remember to update |
03:52
🔗
|
acridAxid |
also, if i package it, it helps others |
04:01
🔗
|
|
vitzli has quit IRC (Leaving) |
04:25
🔗
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
04:34
🔗
|
|
Mayonaise has joined #archiveteam-bs |
05:18
🔗
|
|
Infreq has quit IRC (Ping timeout: 258 seconds) |
05:19
🔗
|
|
Infreq has joined #archiveteam-bs |
05:27
🔗
|
|
logchfoo4 has quit IRC (Ping timeout: 360 seconds) |
05:29
🔗
|
|
logchfoo1 starts logging #archiveteam-bs at Sat Feb 06 05:29:46 2016 |
05:29
🔗
|
|
logchfoo1 has joined #archiveteam-bs |
05:47
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:54
🔗
|
|
Sk1d has joined #archiveteam-bs |
06:00
🔗
|
|
JetBalsa has quit IRC (Read error: Connection reset by peer) |
07:22
🔗
|
|
Kenshin has quit IRC (Ping timeout: 260 seconds) |
07:23
🔗
|
|
CHANFIX has joined #archiveteam-bs |
07:23
🔗
|
|
services.int sets mode: +o CHANFIX |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*67c68b35@103.198.139.* *!*JojoRecv@*.dyn.optonline.net *!*stereo197@*.iusacell.net *!*ae830ded@*.mibbit.com |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb WOOHOO!*@* *!WOOHOO@* *!OOHOOW@* *!*@66.23.235.245 |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*archivist@*.static.wa.bigpond.net.au *!*@178.18.16.10 mxncqci79!*@* *!~egyptmosl@41.46.215.* |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*mig2970@190.235.150.* *!*@c-76-108-171-70.hsd1.fl.comcast.net dl-boy-!*@* *!*webchat@71.93.65.* |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb EG!*@* *!*EricJess*@* *!*smuxi@*.hsd1.wa.comcast.net instence!*@* |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!~DrShitsAB@50.7.30.34 dec31*!*@* *!*@c-76-22-62-23.hsd1.wa.comcast.net *!*4c163e17@*.mibbit.com |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!~korkite@225.Red-79-153-104.dynamicIP.rima-tde.net Dec*!*@* *!*KaderCavd@213.74.159.* *!*@31-184-242-76.mrhost.biz |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*@213.143.61.87 *!*@114.76.21.95.dynamic.jazztel.es garette!*@* *!~garette@213.143.61.* |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*bkr@*.mindhackers.org *!*sponges@*.Red-2-138-166.dynamicIP.rima-tde.net *!~sponges@98.Red-83-39-251.dynamicIP.rima-tde.net *!*sponges@*.Red-83-55-30.dynamicIP.rima-tde.net |
07:23
🔗
|
|
CHANFIX sets mode: -bbbb *!*sponges@*.Red-81-38-80.dynamicIP.rima-tde.net sponges!*@* *!*tichels@*.Red-2-138-161.dynamicIP.rima-tde.net critics!*@* |
07:23
🔗
|
|
CHANFIX sets mode: -bb xocco!*@* *!*uid118096@*.tooting.irccloud.com |
07:23
🔗
|
CHANFIX |
I only joined to remove modes. |
07:23
🔗
|
|
CHANFIX has left |
07:27
🔗
|
dxrt |
why |
07:27
🔗
|
dxrt |
oh we lost all ops nice |
07:33
🔗
|
|
CHANFIX has joined #archiveteam-bs |
07:33
🔗
|
|
services.int sets mode: +o CHANFIX |
07:33
🔗
|
|
CHANFIX sets mode: +oo SadDM closure |
07:33
🔗
|
CHANFIX |
2 clients should have been opped. |
07:33
🔗
|
|
CHANFIX has left |
07:33
🔗
|
|
Kenshin has joined #archiveteam-bs |
07:43
🔗
|
|
CHANFIX has joined #archiveteam-bs |
07:43
🔗
|
|
services.int sets mode: +o CHANFIX |
07:43
🔗
|
|
CHANFIX sets mode: +oo Kenshin ivan` |
07:43
🔗
|
CHANFIX |
2 clients should have been opped. |
07:43
🔗
|
|
CHANFIX has left |
07:53
🔗
|
|
CHANFIX has joined #archiveteam-bs |
07:53
🔗
|
|
services.int sets mode: +o CHANFIX |
07:53
🔗
|
|
CHANFIX sets mode: +o alard |
07:53
🔗
|
CHANFIX |
1 client should have been opped. |
07:53
🔗
|
|
CHANFIX has left |
08:34
🔗
|
|
JesseW has joined #archiveteam-bs |
09:04
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
09:26
🔗
|
|
vitzli has joined #archiveteam-bs |
10:12
🔗
|
|
arkiver3 has joined #archiveteam-bs |
10:38
🔗
|
|
arkiver3 has quit IRC (Ping timeout: 252 seconds) |
11:38
🔗
|
HCross2 |
Heh. First page of The Register http://www.theregister.co.uk/2016/02/05/malware_museum/ |
11:45
🔗
|
|
arkiver3 has joined #archiveteam-bs |
11:52
🔗
|
Sk1d |
is friendsreunited currently deactivated? |
12:11
🔗
|
|
arkiver3 has quit IRC (Ping timeout: 252 seconds) |
13:23
🔗
|
godane |
does anyone ever got free ppv around jan 1997? |
13:24
🔗
|
godane |
i got free ppv by going to tv channel 2 back then |
13:25
🔗
|
godane |
this hack happened like every 5 months that |
13:25
🔗
|
godane |
must have been feb 1997 then if it was 5 months |
13:25
🔗
|
godane |
it lasted like 4 days |
13:25
🔗
|
godane |
starting on the 4th thursday of the month |
13:47
🔗
|
joepie91 |
currently haphazardly recording AT5 stream of Pegida protest and counter-protestt |
13:47
🔗
|
joepie91 |
they fucked up their RTMP stream so I'm just dumping .ts segments now |
13:50
🔗
|
|
mismatch has quit IRC (Remote host closed the connection) |
13:50
🔗
|
|
mismatch has joined #archiveteam-bs |
14:42
🔗
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
14:58
🔗
|
|
username1 has joined #archiveteam-bs |
15:00
🔗
|
|
Mayonaise has joined #archiveteam-bs |
15:00
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
15:36
🔗
|
|
username1 is now known as schbirid |
17:12
🔗
|
|
JesseW has joined #archiveteam-bs |
17:15
🔗
|
|
JesseW has quit IRC (Client Quit) |
17:20
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
17:22
🔗
|
|
MrIdea has joined #archiveteam-bs |
17:29
🔗
|
MrIdea |
hey, so i was thinking, although you guys might already thought of this before, why don't we go out and find the user generated content sites(since those as far as i can tell are the ones we are doing mostly and are the most important) and have the management of that site "sign"/agree to "yes, i promise to preserve the content that i host, even in the event |
17:29
🔗
|
MrIdea |
of a merger/shutdown/bankruptcy ect., by either allowing users to export there content via an simple export function, provide a data base dump to the internet archive/archiveteam(with sensitive data removed), and/or collaborating with archiveteam as they scrape my site" |
17:30
🔗
|
MrIdea |
kinda like an contingency agreement |
17:32
🔗
|
MrIdea |
for lack of better words |
17:32
🔗
|
MrIdea |
SketchCow: what are you thoughts on this? |
17:38
🔗
|
SmileyG |
MrIdea: we could |
17:38
🔗
|
SmileyG |
or we could just scrape them |
17:39
🔗
|
MrIdea |
well in this case, scraping would likely be a last resort |
17:39
🔗
|
SmileyG |
nod |
17:39
🔗
|
SmileyG |
wel welcome them to offer us an easier way to get the data before they kill it |
17:40
🔗
|
SmileyG |
however most of the time they aren't interested |
17:42
🔗
|
MrIdea |
i think stuff like this might be better for "smaller" sites, like um... say an forum or such |
18:13
🔗
|
xmc |
nobody ever responds positively to that |
18:13
🔗
|
xmc |
never |
18:13
🔗
|
xmc |
we've tried |
18:13
🔗
|
xmc |
alard closure SadDM ivan` Kenshin: ops |
18:14
🔗
|
|
Kenshin sets mode: +o xmc |
18:14
🔗
|
xmc |
thx |
18:14
🔗
|
|
xmc sets mode: +o swebb |
18:14
🔗
|
|
swebb sets mode: +o DFJustin |
18:14
🔗
|
|
swebb sets mode: +o SketchCow |
18:14
🔗
|
|
swebb sets mode: +o antomatic |
18:14
🔗
|
|
swebb sets mode: +o balrog |
18:14
🔗
|
|
swebb sets mode: +o brayden |
18:14
🔗
|
|
swebb sets mode: +o ersi |
18:34
🔗
|
|
VADemon has joined #archiveteam-bs |
19:26
🔗
|
SketchCow |
MrIdea: They don't talk to us |
19:35
🔗
|
MrIdea |
besides the fact of "b-but i don't want to lose my userbase/money", i don't see why they would really disagree |
19:36
🔗
|
SketchCow |
Besides the unstoppable, irrefutable facts, there's no reason |
19:36
🔗
|
SketchCow |
See also: Lawyers |
19:36
🔗
|
SketchCow |
Every once in a while one of the folks in here negotiates. |
19:36
🔗
|
SketchCow |
I'd say that works once out of 80 large-scale projects. |
19:39
🔗
|
MrIdea |
well i'm sure doing this for a site thats soul porpouse of existence was profit, those guys would very much disagree |
19:40
🔗
|
MrIdea |
i'm thinking this would be much more effective for some of the smaller sites like fourms/geocity clones/ ect |
19:42
🔗
|
SketchCow |
We do that. |
19:42
🔗
|
SketchCow |
We do that all the time. |
19:42
🔗
|
SketchCow |
We've been here 7 years. |
19:47
🔗
|
MrIdea |
haha, might want to update that topic then SketchCow |
19:51
🔗
|
|
SketchCow changes topic to: Archive Team: Oh Yeah, Negotiation is the Answer | <SketchCow> Shut Up |
20:11
🔗
|
|
vitzli has quit IRC (Leaving) |
20:14
🔗
|
ivan` |
MrIdea: a few people, have, hilariously come into #archivebot to make a backup of their own site |
20:15
🔗
|
ivan` |
MrIdea: one of the problems with database dumps is that somebody has to kill a few days either setting up the site infrastructure or dumping it into something readable |
20:15
🔗
|
SimpBrain |
oh my no backup plan |
20:16
🔗
|
SimpBrain |
might aswell leave the server in a smokey state |
20:17
🔗
|
SimpBrain |
owner or web/database people have a smartass way of creating the database so that it's structure only suits the site and cant be used elsewhere |
21:17
🔗
|
godane |
so 2013 of The Morning Joe podcast is all uploaded |
21:41
🔗
|
|
Swizzle has quit IRC (Quit: Leaving) |
22:04
🔗
|
dashcloud |
MrIdea: You should try that- if it works, great, otherwise they are put on notice that rogue archivists are coming for them |
22:13
🔗
|
ersi |
MrIdea: Feel free to try to negotiate with sites. There's no ArchiveTeam management. That said, it usually does not work |
22:13
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:13
🔗
|
ersi |
if it would, more people would ask for a backup or "sign" people up |
22:14
🔗
|
|
dashcloud has joined #archiveteam-bs |
22:33
🔗
|
Sanqui |
anyway, I think GNU Social *has* the potential to become significant. |
22:33
🔗
|
Sanqui |
Imagine if Twitter did mess with users timelines, a few major users made their exodus, and media caught on it |
22:33
🔗
|
Sanqui |
...oh, icedice ain't here. |
22:34
🔗
|
ersi |
I dunno, decentralization isn't really a trend |
22:34
🔗
|
ersi |
and it's confusing as fuck |
22:35
🔗
|
|
icedice has joined #archiveteam-bs |
22:35
🔗
|
Sanqui |
<Sanqui> anyway, I think GNU Social *has* the potential to become significant. |
22:35
🔗
|
Sanqui |
<Sanqui> Imagine if Twitter did mess with users timelines, a few major users made their exodus, and media caught on it |
22:35
🔗
|
Sanqui |
<Sanqui> ...oh, icedice ain't here. |
22:35
🔗
|
Sanqui |
<@ersi> I dunno, decentralization isn't really a trend |
22:35
🔗
|
Sanqui |
<@ersi> and it's confusing as fuck |
22:35
🔗
|
ersi |
Techies looove decentralization and the idea/potential |
22:35
🔗
|
ersi |
it's confusing as fuck for non-aspies though |
22:36
🔗
|
Sanqui |
can it not be completely abstracted away? |
22:36
🔗
|
ersi |
In what way? You need an account *somewhere* |
22:36
🔗
|
Sanqui |
with quitter you don't even have to realize there are alternative instances |
22:36
🔗
|
ersi |
even if it's rolled into your OS |
22:36
🔗
|
ersi |
I think you're undervaluing a central brand |
22:37
🔗
|
Sanqui |
regardless, we probably define 'significant' in different ways. |
22:37
🔗
|
ersi |
Maybe so! |
22:37
🔗
|
Sanqui |
even if it became to Twitter what Duck Duck Go is to Google, it'd be significant in my eyes |
22:37
🔗
|
icedice |
They would need to market a site - Quitter - instead of the software - GNU Social |
22:37
🔗
|
ersi |
I mean, best case for GNU Social would be to herd all the techies over from G+ |
22:38
🔗
|
icedice |
Are techies using Google+? |
22:38
🔗
|
ersi |
Sanqui: that would be pretty significant. DDG isn't small |
22:38
🔗
|
Sanqui |
it isn't |
22:38
🔗
|
ersi |
There's plenty of FOSS people using Google+ - posting updates and conversating |
22:38
🔗
|
icedice |
I thought most people dumped it after hostile YouTube take-over |
22:39
🔗
|
icedice |
and I assumed that the techies wouldn't like Google's privacy approach any more than Facebook's |
22:40
🔗
|
ersi |
Seems like they/some do |
22:40
🔗
|
icedice |
since both approaches are "collect all data that we can and sell it to the advertisers" |
22:45
🔗
|
ersi |
I would guess some take Google for being the lesser evil or just don't care |
22:56
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:00
🔗
|
|
DFJustin has quit IRC (Read error: Connection reset by peer) |
23:00
🔗
|
|
dashcloud has joined #archiveteam-bs |
23:00
🔗
|
|
alard has quit IRC (Read error: No route to host) |
23:02
🔗
|
|
DFJustin has joined #archiveteam-bs |
23:02
🔗
|
|
swebb sets mode: +o DFJustin |
23:19
🔗
|
icedice |
the latter I would guess |
23:26
🔗
|
|
Swizzle has joined #archiveteam-bs |
23:35
🔗
|
arkiver |
I'm going to work on fotolog soon |
23:35
🔗
|
arkiver |
Website isn't slow, so I hope we can fully get fotolog |
23:36
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
23:37
🔗
|
godane |
i may need some help with this: http://www.oldgamemags.com/ |
23:40
🔗
|
|
RichardG has joined #archiveteam-bs |