Time |
Nickname |
Message |
00:02
π
|
|
tfgbd_znc has joined #archiveteam |
00:03
π
|
|
tfgbd_znc has quit IRC (Client Quit) |
00:05
π
|
|
tfgbd_znc has joined #archiveteam |
00:05
π
|
|
jmtd has joined #archiveteam |
00:07
π
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
00:08
π
|
|
tomwsmf-a has joined #archiveteam |
00:14
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
00:15
π
|
|
tfgbd_znc has quit IRC (Ping timeout: 633 seconds) |
00:18
π
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
00:19
π
|
|
tomwsmf-a has joined #archiveteam |
00:21
π
|
|
BartoCH has joined #archiveteam |
00:26
π
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
00:47
π
|
|
BlueMaxim has joined #archiveteam |
00:50
π
|
|
nickname_ has joined #archiveteam |
00:50
π
|
nickname_ |
zippcast is shutting down |
01:00
π
|
|
nickname_ has quit IRC (Read error: Connection reset by peer) |
01:08
π
|
|
schbirid has quit IRC (Ping timeout: 258 seconds) |
01:12
π
|
r3c0d3x |
we know |
01:12
π
|
r3c0d3x |
(he left, oops, my bad) |
01:20
π
|
|
schbirid has joined #archiveteam |
01:25
π
|
|
Pudsey has joined #archiveteam |
01:27
π
|
Pudsey |
The wayback machine isn't letting me access the blip archive due to robots.txt. Is this temporary? |
01:35
π
|
|
tfgbd_znc has joined #archiveteam |
01:36
π
|
|
Pudsey has quit IRC (Remote host closed the connection) |
02:07
π
|
MrRadar |
I e-mailed both the IA (to let them know their robots.txt parser is broken in another way) and Maker Studios (to tell them to fix the blip.tv domain) so hopefully one of them will get that working again |
02:08
π
|
joepie91 |
lol |
02:09
π
|
MrRadar |
It's weird, I was definitely able to access blip.tv through the IA last week |
02:09
π
|
MrRadar |
And the robots.txt hasn't changed at all since then (same Cloudflare error page) |
02:35
π
|
MrRadar |
I'm getting a lot of 403s on Zippcast. Is anyone else seeing that? |
02:38
π
|
MrRadar |
Looks like it's just on .mp4 files |
03:11
π
|
|
Start has quit IRC (Ping timeout: 260 seconds) |
03:19
π
|
|
Start has joined #archiveteam |
03:38
π
|
|
tomwsmf-a has joined #archiveteam |
03:57
π
|
|
bwn has quit IRC (Read error: Operation timed out) |
04:05
π
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
04:07
π
|
|
tfgbd_znc has quit IRC (Ping timeout: 633 seconds) |
04:20
π
|
|
Sk1d has joined #archiveteam |
04:20
π
|
|
Sk1d has quit IRC (Connection closed) |
04:22
π
|
|
Stilett0 has quit IRC (Read error: Connection reset by peer) |
04:28
π
|
|
tfgbd_znc has joined #archiveteam |
04:51
π
|
|
tfgbd_znc has quit IRC (Read error: Connection reset by peer) |
05:14
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
05:17
π
|
|
dashcloud has joined #archiveteam |
05:18
π
|
|
Stiletto has joined #archiveteam |
05:19
π
|
|
Stiletto is now known as Stilett0 |
05:19
π
|
|
Stilett0 has quit IRC (Client Quit) |
05:21
π
|
|
Stiletto has joined #archiveteam |
05:50
π
|
|
JesseW has joined #archiveteam |
06:42
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
06:45
π
|
|
dashcloud has joined #archiveteam |
06:47
π
|
|
vitzli has joined #archiveteam |
06:52
π
|
|
ndiddy has quit IRC (Read error: Connection reset by peer) |
06:53
π
|
|
ndiddy has joined #archiveteam |
06:55
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
07:03
π
|
|
bwn has joined #archiveteam |
07:37
π
|
|
Fake-Name has quit IRC (Read error: Operation timed out) |
08:00
π
|
|
Fake-Name has joined #archiveteam |
08:12
π
|
|
WinterFox has joined #archiveteam |
08:21
π
|
|
dan- has quit IRC (Ping timeout: 260 seconds) |
08:22
π
|
|
purplebot has joined #archiveteam |
08:25
π
|
|
Sk1d has joined #archiveteam |
08:29
π
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
08:34
π
|
|
dan- has joined #archiveteam |
08:41
π
|
purplebot |
10,99PurpleSymphony edited 6,99University Web Hosting (3,99+384) just now: 7,99Add Bielefeld University -- http://www.archiveteam.org/?diff=25712&oldid=25546 |
08:46
π
|
|
dan- has quit IRC (Ping timeout: 260 seconds) |
08:47
π
|
xmc |
wow, that is un read a ble |
08:47
π
|
arkiver |
oh nie |
08:47
π
|
arkiver |
nice |
08:47
π
|
arkiver |
not sure though if it should be #archiveteam |
08:47
π
|
xmc |
it should not |
08:47
π
|
xmc |
-bs, maybe |
08:47
π
|
arkiver |
yeah |
08:48
π
|
PurpleSym |
Sure, I can move it. |
08:48
π
|
xmc |
ok |
08:48
π
|
schbirid |
nice |
08:48
π
|
xmc |
we'll see how it pans out |
08:48
π
|
xmc |
could be cool, could become obnoxious |
08:48
π
|
|
purplebot has left |
09:01
π
|
|
dan- has joined #archiveteam |
09:08
π
|
|
dan- has quit IRC (Ping timeout: 260 seconds) |
09:15
π
|
|
VADemon has joined #archiveteam |
09:16
π
|
HCross2 |
PurpleSym: could you at least remove the highlighting? |
09:17
π
|
PurpleSym |
HCross2: Highlighting as in color? Thatβs already gone. -> #-bs |
09:24
π
|
HCross |
PurpleSym, saw it sorry |
09:29
π
|
|
dan- has joined #archiveteam |
09:45
π
|
signius |
Hmnmm |
09:46
π
|
signius |
WARNING: 'autoheader' is missing on your system |
09:46
π
|
signius |
i have installed autoconf & i am still getting this error with the zippcast-grab |
09:48
π
|
HCross |
have you got flex installed? |
09:48
π
|
signius |
i just did an apt-cache search autoheader & got |
09:49
π
|
signius |
autoconf2.13 - automatic configure script builder (obsolete version) |
09:49
π
|
Meroje |
isn't it in build-utils ? |
09:51
π
|
signius |
I am seeing solution saying need autoreconf so trying that now to see if that resolves the errors |
09:51
π
|
signius |
this is happening on 3 different Debian boxes currently |
09:51
π
|
Meroje |
ah yes it may be bundled with autoconf |
09:52
π
|
signius |
ok the original error has gone but wget-lau still not building succesfully |
09:52
π
|
signius |
these 3 boxes have all been running other scripts previously for months btw |
09:53
π
|
signius |
https://paste.fedoraproject.org/374325/33981146/ |
09:54
π
|
signius |
2 of the boxes are running the script regardless of the lau error but 1 is refusing to execute |
09:56
π
|
signius |
getting this when i try and run the script on the box thats failing completely |
09:56
π
|
signius |
https://paste.fedoraproject.org/374326/14650341/ |
09:59
π
|
|
Tomcat_ has joined #archiveteam |
10:02
π
|
Meroje |
remove the zippcast-grab folder and try again |
10:03
π
|
signius |
Meroje, yeah i cannot do the "mv src/wget ../wget-lua" as src/wget doesnt seem to be getting created |
10:05
π
|
signius |
deleted & started again still the same :? |
10:05
π
|
signius |
https://paste.fedoraproject.org/374328/46503474/ |
10:08
π
|
signius |
ok sorted |
10:09
π
|
signius |
HCross, You are welcome to come & slap me with a wet fish :) you was on the money originally with "flex" :) |
10:11
π
|
Meroje |
signius also when debugging build issues of any software, the configure output is more interesting, so try to make a paste of everything |
10:13
π
|
signius |
Meroje, I will accepts the more detailed output of the complete config would have the exact issue in it, but 9 times out of 10 just the errors at the end point to the issue i find |
10:14
π
|
signius |
but the autoheader error really wasnt very helpful in this case :-/ |
10:35
π
|
|
pintoch has left WeeChat 1.0.1 |
10:43
π
|
|
HCross has quit IRC (Ping timeout: 370 seconds) |
10:46
π
|
|
hawc145 has joined #archiveteam |
10:53
π
|
|
hawc145 is now known as HCross |
10:57
π
|
|
schbirid has quit IRC (Quit: Leaving) |
11:30
π
|
|
schbirid has joined #archiveteam |
11:31
π
|
|
ris has joined #archiveteam |
11:32
π
|
ris |
http://www.bbc.co.uk/news/uk-36308976 |
12:19
π
|
|
Morbus has joined #archiveteam |
12:40
π
|
|
tfgbd_znc has joined #archiveteam |
12:49
π
|
|
ris has quit IRC () |
12:54
π
|
|
Tomcat_ has quit IRC (Remote host closed the connection) |
13:46
π
|
|
mismatch has joined #archiveteam |
14:00
π
|
|
rdtsc has joined #archiveteam |
14:00
π
|
rdtsc |
hi |
14:01
π
|
rdtsc |
a huge free french hoster is gonna close in the next days: https://www.olympe.in/end?lang=EN |
14:01
π
|
rdtsc |
if you're not aware |
14:02
π
|
rdtsc |
"our services will be disrupted as soon as this evening" https://twitter.com/OlympeNet/status/739083198089580544 |
14:07
π
|
zino |
Any idea on how to index that? |
14:07
π
|
arkiver |
Do you any idea of when they might shut down? |
14:08
π
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
14:09
π
|
|
ris has joined #archiveteam |
14:09
π
|
HCross |
Is this the week of things shutting down? |
14:09
π
|
arkiver |
or are they actually shutting down this evening? |
14:09
π
|
arkiver |
HCross: :P |
14:09
π
|
rdtsc |
they made a crowdfunding campaign since a few months ago, lately (from ~may 20) they prompted their users to save their data but weren't sure about their outcome and since june 1 they seem to be clear about it |
14:09
π
|
rdtsc |
and now they are talking about shutting down this evening |
14:10
π
|
rdtsc |
(it's 16:00 now in france) |
14:10
π
|
HCross |
Twitter says: "The services will be interrupted tonight" |
14:10
π
|
rdtsc |
but maybe we can talk to them on they're IRC |
14:10
π
|
zino |
Google translate says "CLOSING OLYMPE. The services will be interrupted tonight." |
14:11
π
|
rdtsc |
it's an associative hoster active since quite a long time, i think they are the largest after free.fr personal pages |
14:11
π
|
rdtsc |
s/they're IRC/their IRC/ |
14:12
π
|
zino |
According to their blog they should have ~90k accounts. |
14:13
π
|
zino |
Most of them are probably tiny, so if we can index it, big parts should be pretty quick to save if their servers can handle it. |
14:14
π
|
HCross |
There is https://www.olympe.in/directory but I cant get it to load |
14:15
π
|
zino |
arkiver: Can you reach out to their Twitter/IRC/email? |
14:15
π
|
rdtsc |
there seem to be only three idling people and all away on their Freenode IRC, but they're active on twitter |
14:15
π
|
HCross |
Which channel on Freenode? |
14:15
π
|
rdtsc |
#olympe |
14:16
π
|
HCross |
Just tempted to go in like "Hi, we want to archive your shit" |
14:16
π
|
Nemo_bis |
"web hosting, focusing on quality and availability" |
14:17
π
|
zino |
Damn, https://web.archive.org/web/20160329180825/https://www.olympe.in/directory tries to dynamically load the list. :-/ |
14:18
π
|
rdtsc |
just of curiosity, would you quality 90k as rather medium, large or small compared to what you are commonly seeing? |
14:18
π
|
rdtsc |
*out of |
14:20
π
|
zino |
I'd say medium to small, but I'm not involved in most projects, so don't take that as authorative. |
14:20
π
|
PurpleSym |
Directory works for me. |
14:20
π
|
zino |
Dump it. |
14:20
π
|
PurpleSym |
On it. |
14:26
π
|
zino |
On the positive side they have an export function in their user interface, even one for databases. On the negative side most of their users won't get the message in time to use it: https://www.olympe.in/doc/backups |
14:27
π
|
PurpleSym |
From directory/categories: https://6xq.net/paste/olympe.in-discovery.txt |
14:28
π
|
rdtsc |
I pinged them in french on twitter |
14:28
π
|
zino |
Great! 4.1k sites then. |
14:29
π
|
zino |
The manual says that it's an option to show up in the directory. It doesn't say if it's default on or off though. |
14:31
π
|
zino |
Need to run to do other things. I'll start some manual personal dumps of those for now. Hoping archiver will get a real script on it later. |
14:32
π
|
PurpleSym |
Discovered a few more through directory/search. Same URL. |
14:34
π
|
arkiver |
Thanks PurpleSym |
14:34
π
|
arkiver |
rdtsc: please let me know if you get a response from them |
14:36
π
|
Sanqui |
is it necessary to scrape google for domains? |
14:36
π
|
arkiver |
might be good to do |
14:36
π
|
arkiver |
not sure if PurpleSym is already doing that though |
14:37
π
|
Sanqui |
we may need to employ... something... like... crap, our wiki is having problems |
14:37
π
|
Sanqui |
got a 504 a few times |
14:38
π
|
PurpleSym |
I can do bing. |
14:38
π
|
rdtsc |
a 508 "Resource Limit Is Reached" too |
14:38
π
|
arkiver |
rdtsc: what exactly did you ask them and did you get any reply? |
14:38
π
|
Sanqui |
sorry, what I got was probably 508 |
14:38
π
|
arkiver |
in IRC |
14:40
π
|
rdtsc |
ariscop: "Hello, the archiveteam.org project may be interested in retrieving part of Olymbe before the shutdown. [1/2]" "[2/2] Would it be possible to come talking about it on IRC, #archiveteam @ irc.efnet.org? (webchat: chat.efnet.org)" |
14:40
π
|
rdtsc |
*arkiver: sorry |
14:41
π
|
rdtsc |
arkiver: asked 16:27 UTC+2, no reply yet |
14:41
π
|
arkiver |
thank you |
14:42
π
|
PurpleSym |
Added another 345 results from bing. |
14:54
π
|
arkiver |
awesome |
14:57
π
|
arkiver |
so. |
14:58
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
14:58
π
|
arkiver |
If we can't access the zippcast after the shutdown we'll only be able to archive a small percentage of all the videos. The website is simply not able to server everything before the shutdown. |
14:58
π
|
arkiver |
I sent a mail to the mailadresses of ZippCast. |
14:59
π
|
HCross |
yeah, im hitting it as hard as I can, and most of them idling |
15:00
π
|
HCross |
its upload that is also being an issue |
15:00
π
|
Rotab |
zippcast seems reaaaally slow |
15:01
π
|
|
dashcloud has joined #archiveteam |
15:01
π
|
arkiver |
that's because of us. I hope we'll be able to access the site after the shutdown |
15:01
π
|
arkiver |
Like with arto at the moment |
15:04
π
|
|
WinterFox has quit IRC (Remote host closed the connection) |
15:05
π
|
|
vitzli has quit IRC (Quit: Leaving) |
15:14
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
15:15
π
|
|
dashcloud has joined #archiveteam |
15:39
π
|
|
ris has quit IRC () |
16:21
π
|
|
ris has joined #archiveteam |
16:26
π
|
|
_ris has joined #archiveteam |
16:29
π
|
|
ris has quit IRC (Ping timeout: 250 seconds) |
16:44
π
|
|
JesseW has joined #archiveteam |
17:01
π
|
|
JesseW has quit IRC (Read error: Operation timed out) |
17:38
π
|
|
Fake-Name has quit IRC (Read error: Operation timed out) |
17:39
π
|
Medowar |
anything I can throw at my server? zippcast is running on rate-limit and I am banned from arto. Wiki currently doesnt compile. |
17:40
π
|
Medowar |
Also, I just recieved an Abuse complain from caltech.edu for scanning for ftp-servers |
17:48
π
|
HCross |
reply: "fuck you, stop thinking about deleting things" |
17:49
π
|
Medowar |
nah, the complaint was even more stupid: 37.120.165.49 was observed probing caltech.edu for security holes. It |
17:49
π
|
Medowar |
has been blocked at our border routers. It may be compromised. |
17:53
π
|
HCross |
lmao |
17:54
π
|
xmc |
ha |
17:58
π
|
Medowar |
How to Hack caltech: Step 1: zmap -p 21 -o - -v 5 | gzip > ftpsites.gz; Step 2: done |
17:58
π
|
xmc |
haaaack |
18:03
π
|
|
mutoso has joined #archiveteam |
18:06
π
|
|
JesseW has joined #archiveteam |
18:17
π
|
|
jason1089 has joined #archiveteam |
18:17
π
|
jason1089 |
hey guys Im having trouble connecting 8001 |
18:20
π
|
MrRadar |
Connecting to what? |
18:23
π
|
|
j08nY has joined #archiveteam |
18:24
π
|
j08nY |
Hey everybody, I'm wondering if there's an easy way of running archive-teams scripts outside of Warrior? |
18:25
π
|
j08nY |
Since I have a spare Raspberry Pi(no x86 Virtualbox or similiar) with lots of bandwith and an external drive, I would like to use. |
18:25
π
|
JesseW |
j08nY: The "easy" way is inside the Warrior -- but running them separately is widely done, yes. |
18:27
π
|
j08nY |
Well, is there any tutorial for running them outside of it? |
18:27
π
|
jason1089 |
localhost using the warrior |
18:28
π
|
Rotab |
are you sure you didnt bridge it? |
18:29
π
|
Rotab |
j08nY: https://github.com/ArchiveTeam |
18:29
π
|
jason1089 |
as of bridge it with my local network |
18:30
π
|
j08nY |
Rotab: Just found it. Thanks! |
18:30
π
|
|
j08nY has quit IRC (Quit: leaving) |
18:31
π
|
Rotab |
jason1089: if its bridged it'll get its own ip |
18:32
π
|
jason1089 |
ok im gonna fix that |
18:34
π
|
jason1089 |
quick question what is happening with rutracker. I didnt saw as a project lately |
18:39
π
|
|
jason1089 has quit IRC (Quit: http://chat.efnet.org ) |
18:48
π
|
|
philpem has quit IRC (Read error: Connection reset by peer) |
19:12
π
|
|
thefinn93 has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
gibigian1 has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
SirCmpwn has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
balrog has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
gibigiana has joined #archiveteam |
19:12
π
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
Gfy has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
marvinw has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
marvinw has joined #archiveteam |
19:12
π
|
|
sivoais_ has quit IRC (Read error: Operation timed out) |
19:12
π
|
|
sivoais has joined #archiveteam |
19:12
π
|
|
balrog has joined #archiveteam |
19:12
π
|
|
swebb sets mode: +o balrog |
19:13
π
|
|
mr-b has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
aMunster has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
Coderjoe has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
phuzion has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
MMovie has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
mhazinsk has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
beardicus has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
Coderjoe has joined #archiveteam |
19:13
π
|
|
ats has quit IRC (Read error: Operation timed out) |
19:13
π
|
|
bwn has quit IRC (Read error: Operation timed out) |
19:14
π
|
|
skrp has quit IRC (Read error: Operation timed out) |
19:14
π
|
|
_ris has quit IRC (Read error: Operation timed out) |
19:14
π
|
|
remsen has quit IRC (Read error: Operation timed out) |
19:14
π
|
|
SirCmpwn has joined #archiveteam |
19:14
π
|
|
_ris has joined #archiveteam |
19:14
π
|
|
Gfy has joined #archiveteam |
19:14
π
|
|
RKenshin has joined #archiveteam |
19:15
π
|
|
Kenshin has quit IRC (Read error: Operation timed out) |
19:15
π
|
|
RKenshin is now known as Kenshin |
19:15
π
|
|
dxrt has quit IRC (Read error: Operation timed out) |
19:16
π
|
|
_acridAxd has joined #archiveteam |
19:16
π
|
|
acridAxid has quit IRC (Read error: Operation timed out) |
19:16
π
|
|
will has quit IRC (Ping timeout: 633 seconds) |
19:16
π
|
|
_acridAxd is now known as acridAxid |
19:17
π
|
|
mistym- has joined #archiveteam |
19:17
π
|
|
swebb sets mode: +o mistym- |
19:18
π
|
|
mistym has quit IRC (Ping timeout: 633 seconds) |
19:18
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
19:18
π
|
|
tfgbd_znc has quit IRC (Read error: Operation timed out) |
19:20
π
|
|
nwf_ has quit IRC (Read error: Operation timed out) |
19:20
π
|
|
remsen has joined #archiveteam |
19:20
π
|
|
mr-b has joined #archiveteam |
19:20
π
|
|
dxrt has joined #archiveteam |
19:21
π
|
|
Mayonaise has joined #archiveteam |
19:21
π
|
|
aMunster has joined #archiveteam |
19:22
π
|
|
skrp has joined #archiveteam |
19:22
π
|
|
mhazinsk has joined #archiveteam |
19:22
π
|
|
beardicus has joined #archiveteam |
19:22
π
|
|
swebb sets mode: +o beardicus |
19:22
π
|
|
bwn has joined #archiveteam |
19:22
π
|
|
nwf_ has joined #archiveteam |
19:22
π
|
|
thefinn93 has joined #archiveteam |
19:22
π
|
|
phuzion has joined #archiveteam |
19:24
π
|
|
tfgbd_znc has joined #archiveteam |
19:24
π
|
|
will has joined #archiveteam |
19:27
π
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
19:28
π
|
|
phuzion has quit IRC (Read error: Operation timed out) |
19:28
π
|
|
aMunster has quit IRC (Read error: Operation timed out) |
19:28
π
|
|
beardicus has quit IRC (Read error: Operation timed out) |
19:28
π
|
|
mhazinsk has quit IRC (Read error: Operation timed out) |
19:29
π
|
|
MMovie has joined #archiveteam |
19:29
π
|
|
phuzion has joined #archiveteam |
19:29
π
|
|
aMunster has joined #archiveteam |
19:29
π
|
|
mhazinsk has joined #archiveteam |
19:29
π
|
|
Mayonaise has joined #archiveteam |
19:30
π
|
|
beardicus has joined #archiveteam |
19:30
π
|
|
swebb sets mode: +o beardicus |
19:30
π
|
|
pluesch has joined #archiveteam |
19:36
π
|
|
JesseW has joined #archiveteam |
19:42
π
|
|
ats has joined #archiveteam |
19:55
π
|
|
rossdylan has joined #archiveteam |
19:56
π
|
|
tomwsmf-a has joined #archiveteam |
20:03
π
|
|
schbirid has quit IRC (Quit: Leaving) |
20:05
π
|
|
JesseW has quit IRC (Ping timeout: 272 seconds) |
20:21
π
|
ErkDog |
so this newest version of wget-lua won't compile on ANY of my systems |
20:21
π
|
ErkDog |
CentOS 6, 7, debian nothing |
20:21
π
|
|
JesseW has joined #archiveteam |
20:21
π
|
ErkDog |
I have older versions on some boxes that I've just cp over, but two boxes I don't have any wget-lua from the older version |
20:22
π
|
MrRadar |
ErkDog: You need to have GNU autoconf and flex installed |
20:22
π
|
MrRadar |
At least that's all I needed on my Debian boxes |
20:22
π
|
MrRadar |
Beyond what the old version needed to build |
20:23
π
|
ErkDog |
ahhh flex |
20:23
π
|
ErkDog |
yeah the error I'm getting is "css" file doesn't exist |
20:23
π
|
|
BartoCH has quit IRC (Read error: Connection reset by peer) |
20:24
π
|
MrRadar |
Yeah, it's annoying that the configure script doesn't throw an error if it's not available |
20:24
π
|
ErkDog |
well on my CentOS 6.8 boxes I have both and it won't compile |
20:24
π
|
ErkDog |
same error, lol |
20:24
π
|
|
VADemon has quit IRC (Quit: left4dead) |
20:24
π
|
ErkDog |
Package flex-2.5.35-9.el6.x86_64 already installed and latest version |
20:24
π
|
ErkDog |
Package autoconf-2.63-5.1.el6.noarch already installed and latest version |
20:25
π
|
MrRadar |
I have flex 2.5.39 and autoconf 2.69-8 |
20:26
π
|
MrRadar |
Could you pastebin the css.c error? |
20:26
π
|
ErkDog |
ohhhh wait |
20:26
π
|
ErkDog |
different errors on this box |
20:26
π
|
ErkDog |
now it says no LUA |
20:26
π
|
ErkDog |
http://puu.sh/pgNLr/31f64b2ce5.png |
20:27
π
|
|
VADemon has joined #archiveteam |
20:27
π
|
yipdw |
then you probably need to install the Lua development libraries |
20:27
π
|
ErkDog |
ohhhh duh, thanks, also flex fixed debian |
20:27
π
|
yipdw |
https://github.com/ArchiveTeam/standalone-readme-template |
20:28
π
|
yipdw |
we have instructions |
20:28
π
|
ErkDog |
yipdw I know |
20:28
π
|
ErkDog |
and I follow them |
20:28
π
|
ErkDog |
but for example |
20:28
π
|
ErkDog |
the debian instructions say nothing about installing flex |
20:28
π
|
ErkDog |
so how would I know to do that? |
20:28
π
|
MrRadar |
The autoconf and flex dependencies are new |
20:28
π
|
MrRadar |
With the new wget-lua version |
20:29
π
|
ErkDog |
if you update readme-template does it update the instructions on all the things? |
20:30
π
|
yipdw |
no |
20:30
π
|
yipdw |
and it covers just the warrior script anyway, not wget-lua |
20:30
π
|
yipdw |
get-wget-lua.sh may need to be updated |
20:31
π
|
ErkDog |
ARG, well now it's saying autoconf is too old |
20:31
π
|
ErkDog |
http://puu.sh/pgO2c/a5ea066051.png |
20:32
π
|
yipdw |
take this to #warrior |
20:33
π
|
|
BartoCH has joined #archiveteam |
20:39
π
|
|
Aranje has joined #archiveteam |
21:06
π
|
|
bwn has quit IRC (Ping timeout: 244 seconds) |
21:07
π
|
|
Tomcat_ has joined #archiveteam |
21:18
π
|
|
bwn has joined #archiveteam |
21:19
π
|
zino |
Back. Did we get something started for olympe.in? |
21:25
π
|
ndiddy |
that site's shutting down? |
21:25
π
|
|
Tomcat_ has quit IRC (Remote host closed the connection) |
21:25
π
|
zino |
Yea |
21:26
π
|
zino |
Well, free site hoster |
21:27
π
|
ndiddy |
i wonder how 000webhost has lasted this long |
21:28
π
|
ndiddy |
besides arbitrarily deleting popular sites |
21:35
π
|
|
Fake-Name has joined #archiveteam |
21:35
π
|
|
bwn has quit IRC (Read error: Connection reset by peer) |
21:48
π
|
|
BubuAnabe has joined #archiveteam |
21:49
π
|
|
BubuAnabe has quit IRC (Client Quit) |
22:12
π
|
|
bwn has joined #archiveteam |
22:33
π
|
|
ZiNC has joined #archiveteam |
22:33
π
|
ZiNC |
Hey. |
22:34
π
|
ZiNC |
I know "you're not archive.org", but because of the similarity in goals I'm thinking maybe someone knows... |
22:36
π
|
ZiNC |
Any idea how to search web.archive.org, or properly browse forum archives that seem to have a lot of dead/unarchived bits due to session ids in the URL (among other changing URL parameters), or another way to browse difficult-to-browse forum archives? |
22:36
π
|
ZiNC |
:) |
22:37
π
|
JesseW |
ZiNC: If it was grabbed by us, you may be able to download it in full, and search it locally. |
22:37
π
|
ZiNC |
Grabbed directly, or from archive.org? |
22:38
π
|
JesseW |
grabbed by us and stored on archive.org |
22:39
π
|
ZiNC |
I guess that's unlikely because it's not a special site. |
22:41
π
|
JesseW |
we grab a lot through archivebot |
22:41
π
|
MrRadar |
Put the domain name into the search box on http://archive.fart.website/archivebot/viewer/ |
22:42
π
|
MrRadar |
That will tell you if it was grabbed by us through Archivebot |
22:44
π
|
ZiNC |
No, not there. Though a related domain is, surprisingly. |
22:44
π
|
ZiNC |
Are these crawls initiated by random users? |
22:44
π
|
MrRadar |
Yes |
22:45
π
|
MrRadar |
Basically, if someone finds it interesting or valuable enough they can queue it for archival it in #archivebot |
22:46
π
|
ZiNC |
Is it possible/a good idea to try to crawl web.archive.org? |
22:46
π
|
ZiNC |
For a specific domain/subpages. |
22:46
π
|
MrRadar |
No, we don't archive the archive ;) |
22:46
π
|
ZiNC |
Plain URL structure is a problem there, |
22:47
π
|
ZiNC |
but maybe there are meta ways. |
22:48
π
|
MrRadar |
If you just want a listing of URLs you can query the Wayback CDX Server: https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server |
22:48
π
|
ZiNC |
I guess the interwebz might implode if you plainly archive the archives recursively. |
22:48
π
|
MrRadar |
We do have a project to backup public files on the Internet Archive, though the source scrapes for the Wayback Machine are not included (since they are not public, for the most part) |
22:49
π
|
ZiNC |
Source as in the HTMLs linking to these files? |
22:49
π
|
MrRadar |
The "source" as in the raw web archive data |
22:50
π
|
ZiNC |
BTW, how do crawlers/archivers cleanup URLs, if at all, to ignore meaningless or reordered query parameters? |
22:51
π
|
MrRadar |
Archivebot has some logic to deal with stipping out the more common session ID parameters (and people can ignore URLs matching arbitrary regexes) and for dedicated projects we may use special-case logic if it's a problem but for the most part we just take the URLs as-is |
22:52
π
|
Frogging |
interwebs imploding sounds fun, but more likely it'd just waste a lot of bandwidth and disk space in an un-exciting manner |
22:52
π
|
Frogging |
:p |
22:54
π
|
ZiNC |
So... continue in #-bs? |
22:54
π
|
MrRadar |
Yes |
22:54
π
|
ZiNC |
Right. |
23:09
π
|
arkiver |
Nothing has started yet for olympe.in. Tomorrow I'll have a project started for it |
23:33
π
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
23:44
π
|
|
bwn has quit IRC (Read error: Operation timed out) |
23:46
π
|
|
aMunster has quit IRC (Read error: Operation timed out) |
23:46
π
|
|
rossdylan has quit IRC (Read error: Operation timed out) |
23:46
π
|
|
MMovie has quit IRC (Read error: Operation timed out) |
23:46
π
|
|
beardicus has quit IRC (Read error: Operation timed out) |
23:48
π
|
|
tfgbd_znc has quit IRC (Ping timeout: 633 seconds) |
23:48
π
|
|
mhazinsk has quit IRC (Write error: Broken pipe) |
23:49
π
|
|
phuzion has quit IRC (Read error: Connection reset by peer) |
23:49
π
|
|
mhazinsk has joined #archiveteam |
23:49
π
|
|
tfgbd_znc has joined #archiveteam |
23:50
π
|
|
phuzion has joined #archiveteam |
23:50
π
|
|
beardicus has joined #archiveteam |
23:50
π
|
|
swebb sets mode: +o beardicus |
23:51
π
|
|
aMunster has joined #archiveteam |
23:52
π
|
|
bwn has joined #archiveteam |
23:56
π
|
|
MMovie has joined #archiveteam |
23:57
π
|
|
WinterFox has joined #archiveteam |