Time |
Nickname |
Message |
00:17
🔗
|
|
RichardG has quit IRC (Ping timeout: 260 seconds) |
00:20
🔗
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
00:44
🔗
|
|
hawc145 has joined #archiveteam-bs |
00:45
🔗
|
|
HCross has quit IRC (Ping timeout: 246 seconds) |
01:10
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
01:59
🔗
|
|
BnA-Rob1n has quit IRC (Ping timeout: 260 seconds) |
02:00
🔗
|
|
BnA-Rob1n has joined #archiveteam-bs |
02:32
🔗
|
|
Start has joined #archiveteam-bs |
03:39
🔗
|
|
RichardG has joined #archiveteam-bs |
03:55
🔗
|
|
JesseW has joined #archiveteam-bs |
04:05
🔗
|
|
bwn has quit IRC (Ping timeout: 492 seconds) |
04:25
🔗
|
|
bwn has joined #archiveteam-bs |
04:56
🔗
|
|
Coderjoe has quit IRC (Read error: Operation timed out) |
05:09
🔗
|
|
Coderjoe has joined #archiveteam-bs |
05:58
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
06:03
🔗
|
|
Sk1d has joined #archiveteam-bs |
06:18
🔗
|
|
DFJustin has quit IRC (Remote host closed the connection) |
06:29
🔗
|
|
DFJustin has joined #archiveteam-bs |
06:29
🔗
|
|
swebb sets mode: +o DFJustin |
06:56
🔗
|
|
wp494 has quit IRC (Read error: Connection reset by peer) |
07:05
🔗
|
|
wp494 has joined #archiveteam-bs |
07:08
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
07:09
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
07:51
🔗
|
|
ersi has quit IRC (Ping timeout: 258 seconds) |
07:53
🔗
|
|
wp494 has quit IRC (Read error: Connection reset by peer) |
07:58
🔗
|
|
wp494 has joined #archiveteam-bs |
08:11
🔗
|
|
bwn has joined #archiveteam-bs |
08:28
🔗
|
|
ersi has joined #archiveteam-bs |
08:28
🔗
|
|
swebb sets mode: +o ersi |
08:34
🔗
|
|
koon has joined #archiveteam-bs |
10:45
🔗
|
|
schbirid has joined #archiveteam-bs |
11:10
🔗
|
|
metalcamp has joined #archiveteam-bs |
12:08
🔗
|
|
hawc145 is now known as HCross |
13:35
🔗
|
|
vitzli has joined #archiveteam-bs |
15:45
🔗
|
|
Apathy has quit IRC (Quit: OOOOoooooooooo................) |
16:35
🔗
|
|
JesseW has joined #archiveteam-bs |
17:02
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
17:14
🔗
|
|
MrRadar_ has joined #archiveteam-bs |
17:18
🔗
|
|
MrRadar has quit IRC (Ping timeout: 370 seconds) |
17:18
🔗
|
|
MrRadar_ is now known as MrRadar |
17:47
🔗
|
|
SN4T14_ has joined #archiveteam-bs |
17:48
🔗
|
|
SN4T14 has quit IRC (Read error: Operation timed out) |
18:15
🔗
|
|
JesseW has joined #archiveteam-bs |
18:32
🔗
|
|
bwn has quit IRC (Ping timeout: 246 seconds) |
18:32
🔗
|
|
SN4T14_ has quit IRC (Remote host closed the connection) |
18:34
🔗
|
|
JSharp___ has quit IRC (Ping timeout: 260 seconds) |
18:35
🔗
|
bsmith093 |
JesseW: how goes the repacking? |
18:38
🔗
|
bsmith093 |
if at all possible could you please include an inventory file, with the contents of each zip or whatever? |
18:41
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
18:47
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
18:48
🔗
|
JesseW |
bsmith093: what are your thoughts about dividing it up? Did you see my suggestions above? |
18:50
🔗
|
|
HCross2 has joined #archiveteam-bs |
18:51
🔗
|
bsmith093 |
if you're going to go with multiple files, is there a method that will produce standalone chunks? also I cannot stress this enough, please please make a list of whats where, people have been asking me for things, and i hate having to keep this huge monolithic tar file around. |
18:51
🔗
|
bsmith093 |
JesseW: i like your plan, multi chunk the biggest things, then just archive each category. |
18:52
🔗
|
JesseW |
I'll certainly make an index, of course. |
18:52
🔗
|
bsmith093 |
as you've noticed, the size drops off sharply. there's just so much of it! |
18:53
🔗
|
JesseW |
I wasn't actually thinking of making separate zip files for each category, but rather separate zip files for each *initial letter* (except for the giant top 3) |
18:53
🔗
|
bsmith093 |
that works too, i think, what would the sizes be like? |
18:54
🔗
|
|
BnA-Rob1n has quit IRC (Ping timeout: 260 seconds) |
18:54
🔗
|
|
Ctrl-S___ has quit IRC (Ping timeout: 260 seconds) |
18:54
🔗
|
|
vitzli has quit IRC (Leaving) |
18:54
🔗
|
|
johtso has quit IRC (Ping timeout: 260 seconds) |
18:54
🔗
|
JesseW |
well, the top 3 are 35, 18 and 16GB respectively. |
18:54
🔗
|
|
BnA-Rob1n has joined #archiveteam-bs |
18:55
🔗
|
JesseW |
I need to write up the script to calculate the other sizes. |
18:55
🔗
|
|
deathy has quit IRC (Ping timeout: 260 seconds) |
18:55
🔗
|
|
TheKiwi has joined #archiveteam-bs |
18:55
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
18:55
🔗
|
|
Boltsie has quit IRC (Ping timeout: 260 seconds) |
18:55
🔗
|
|
_desu___ has quit IRC (Ping timeout: 260 seconds) |
18:57
🔗
|
bsmith093 |
i found this online. $ mkdir -p output/{A..Z}; for i in tstdir/*; do export FILE=$(basename "$i"); LTR=$(echo" ${FILE:0:1}" | tr [a-z] [A-Z]); mv "$i" "output/$LTR/$FILE" ; done |
18:57
🔗
|
bsmith093 |
just move the 3 biggest out first |
18:57
🔗
|
JesseW |
yeah, that should probably work |
18:57
🔗
|
JesseW |
I need to fix the Fanfiction/Fanfiction bit too |
18:57
🔗
|
bsmith093 |
can't code for crap, but i can tweak. |
18:57
🔗
|
bsmith093 |
just make that it's own blob. |
18:58
🔗
|
|
HCross2 has joined #archiveteam-bs |
18:58
🔗
|
bsmith093 |
how big is that extra folder? |
18:58
🔗
|
|
bwn has joined #archiveteam-bs |
19:00
🔗
|
JesseW |
well, what I was planning to do was copy the 19 files with different versions over to the main one (with the older versions given a special extension, .bak or something), then delete the whole Fanfiction/Fanfiction hierarcy. |
19:00
🔗
|
bsmith093 |
how many dupes are there, is the bak thing really needed? |
19:00
🔗
|
bsmith093 |
also here http://unix.stackexchange.com/questions/111067/bash-script-to-sort-files-into-alphabetical-folders-on-readynas-duo-v1 |
19:00
🔗
|
JesseW |
The Fanfiction/Fanfiction directory is 2.4GB |
19:00
🔗
|
bsmith093 |
where i got the one-liner |
19:01
🔗
|
|
SN4T14 has joined #archiveteam-bs |
19:01
🔗
|
JesseW |
there are 19 files that differ from the older and the newer versions (all In-Progress ones that got re-written) |
19:01
🔗
|
JesseW |
I think it's worth keeping them. |
19:01
🔗
|
bsmith093 |
ok, thats fair. i like seeing old drafts of things :) |
19:01
🔗
|
|
johtso has joined #archiveteam-bs |
19:01
🔗
|
|
_desu___ has joined #archiveteam-bs |
19:01
🔗
|
|
JSharp___ has joined #archiveteam-bs |
19:02
🔗
|
|
Ctrl-S___ has joined #archiveteam-bs |
19:02
🔗
|
|
deathy has joined #archiveteam-bs |
19:02
🔗
|
JesseW |
I think this will work to move them: rsync --checksum -i -r -b --suffix=.bak Fanfiction/Fanfiction/ Fanfiction/ |
19:02
🔗
|
|
Boltsie has joined #archiveteam-bs |
19:06
🔗
|
|
JSharp___ has quit IRC (Ping timeout: 260 seconds) |
19:06
🔗
|
|
JSharp___ has joined #archiveteam-bs |
19:07
🔗
|
|
Boltsie has quit IRC (Ping timeout: 260 seconds) |
19:08
🔗
|
JesseW |
OK, running the rsync |
19:08
🔗
|
|
Boltsie has joined #archiveteam-bs |
19:09
🔗
|
|
deathy has quit IRC (Ping timeout: 260 seconds) |
19:11
🔗
|
|
TheKiwi has quit IRC (Ping timeout: 260 seconds) |
19:12
🔗
|
|
deathy has joined #archiveteam-bs |
19:12
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
19:13
🔗
|
|
JSharp___ has quit IRC (Ping timeout: 260 seconds) |
19:14
🔗
|
|
JSharp___ has joined #archiveteam-bs |
19:16
🔗
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
19:18
🔗
|
|
Ctrl-S___ has quit IRC (Ping timeout: 260 seconds) |
19:20
🔗
|
|
_desu___ has quit IRC (Ping timeout: 260 seconds) |
19:21
🔗
|
|
johtso has quit IRC (Ping timeout: 260 seconds) |
19:22
🔗
|
|
_desu___ has joined #archiveteam-bs |
19:24
🔗
|
|
Boltsie has quit IRC (Read error: Connection timed out) |
19:24
🔗
|
|
JSharp___ has quit IRC (Ping timeout: 260 seconds) |
19:24
🔗
|
|
johtso has joined #archiveteam-bs |
19:26
🔗
|
|
TheKiwi has joined #archiveteam-bs |
19:27
🔗
|
|
TheKiwi has quit IRC (Connection closed) |
19:27
🔗
|
|
deathy has quit IRC (Connection closed) |
19:28
🔗
|
|
TheKiwi has joined #archiveteam-bs |
19:29
🔗
|
|
johtso has quit IRC (Read error: Connection timed out) |
19:30
🔗
|
|
deathy has joined #archiveteam-bs |
19:31
🔗
|
|
HCross2 has joined #archiveteam-bs |
19:31
🔗
|
|
JSharp___ has joined #archiveteam-bs |
19:31
🔗
|
|
Boltsie has joined #archiveteam-bs |
19:32
🔗
|
|
Ctrl-S___ has joined #archiveteam-bs |
19:33
🔗
|
|
TheKiwi has quit IRC (Ping timeout: 260 seconds) |
19:33
🔗
|
JesseW |
OK, finished rsync |
19:36
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
19:37
🔗
|
|
HCross2 has joined #archiveteam-bs |
19:38
🔗
|
|
TheKiwi has joined #archiveteam-bs |
19:41
🔗
|
|
Ctrl-S___ has quit IRC (Ping timeout: 260 seconds) |
19:41
🔗
|
|
Boltsie has quit IRC (Ping timeout: 260 seconds) |
19:42
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
19:42
🔗
|
|
deathy has quit IRC (Ping timeout: 260 seconds) |
19:43
🔗
|
|
TheKiwi has quit IRC (Ping timeout: 260 seconds) |
19:43
🔗
|
|
Boltsie has joined #archiveteam-bs |
19:43
🔗
|
|
Ctrl-S___ has joined #archiveteam-bs |
19:44
🔗
|
|
deathy has joined #archiveteam-bs |
19:44
🔗
|
|
TheKiwi has joined #archiveteam-bs |
19:44
🔗
|
|
HCross2 has joined #archiveteam-bs |
19:45
🔗
|
|
johtso has joined #archiveteam-bs |
20:17
🔗
|
|
metalcamp has joined #archiveteam-bs |
20:18
🔗
|
ersi |
~derpnet~ |
20:19
🔗
|
joepie91 |
lol |
20:43
🔗
|
|
TheKiwi has quit IRC (Ping timeout: 260 seconds) |
20:43
🔗
|
|
deathy has quit IRC (Ping timeout: 260 seconds) |
20:44
🔗
|
|
deathy has joined #archiveteam-bs |
20:44
🔗
|
|
TheKiwi has joined #archiveteam-bs |
21:09
🔗
|
|
JetBalsa has joined #archiveteam-bs |
21:38
🔗
|
|
JetBalsa is now known as JRWR |
21:38
🔗
|
|
JRWR has quit IRC (Connection closed) |
21:38
🔗
|
|
JRWR has joined #archiveteam-bs |
21:48
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
21:48
🔗
|
|
HCross2 has joined #archiveteam-bs |
22:41
🔗
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
22:46
🔗
|
bsmith093 |
JesseW: updates? |
23:04
🔗
|
JesseW |
bsmith093: got them merged, and looked at the various initial letters |
23:04
🔗
|
JesseW |
There are a *few* lowercase letters, and punctuation -- I think I'll handle that by case-insenstivity and using the first alphabetical value |
23:04
🔗
|
JesseW |
Also, there are a bunch that start with a digit -- I think I'll combine all those. |
23:05
🔗
|
JesseW |
BTW, thanks for continuing to check on this. |
23:07
🔗
|
JesseW |
The digit ones come to 641MB |
23:09
🔗
|
bsmith093 |
fanficfare auto converted all unsafe chars to underscores, thats why there's so many folders that look weird |
23:11
🔗
|
JesseW |
Ah, that makes sense |
23:12
🔗
|
JesseW |
Actually, I think I'll put all the ones that don't start with capital letters in a misc.zip file at the end. |
23:14
🔗
|
JesseW |
generating sizes no |
23:14
🔗
|
JesseW |
w |
23:14
🔗
|
JesseW |
now |
23:15
🔗
|
bsmith093 |
inside the files, the names are preserved, in all their utf8 glory. |
23:16
🔗
|
bsmith093 |
to clarify, are you preserving the folder structure? |
23:16
🔗
|
bsmith093 |
and just re shuffling it into less folders? |
23:17
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
23:17
🔗
|
bsmith093 |
"Harry Potter/Completed/Harry Potter - author - title.txt" or just "H/Harry Potter - author - title.txt" |
23:18
🔗
|
JesseW |
I was planning to preserve the existing folder strcuture |
23:18
🔗
|
bsmith093 |
ok, great |
23:18
🔗
|
bsmith093 |
thanks! |
23:18
🔗
|
JesseW |
hm, A comes to 11G |
23:20
🔗
|
BlueMaxim |
the FF.net archive? that damn thing's huge |
23:21
🔗
|
BlueMaxim |
also both those setups are quite confusing when it comes to crossovers |
23:21
🔗
|
bsmith093 |
BlueMaxim: i know, i scraped it. |
23:21
🔗
|
bsmith093 |
BlueMaxim: not really, a crosover is just stored in the category folder for it |
23:22
🔗
|
bsmith093 |
eg Harry potter x men crossover would be Harry Potter_X Men/Completed/etc |
23:31
🔗
|
JesseW |
So the sizes so far are 11GB, 15GB, 12GB, 17GB for A-D -- then 3GB for E. |
23:31
🔗
|
JesseW |
This is including the three big ones, but I'll just exclude them after the count |
23:50
🔗
|
BlueMaxim |
yeah bsmith093 but it seemed quite random to me how they were sorted by the franchises inside them |
23:50
🔗
|
BlueMaxim |
I may have missed something though |
23:51
🔗
|
|
BlueMaxim is now known as BlueMax |
23:52
🔗
|
JesseW |
So the sizes range from 31GB (for S) |
23:55
🔗
|
JesseW |
and Harry Potter is 35G |
23:55
🔗
|
JesseW |
Q, U and Z are the only ones less than a GB |
23:56
🔗
|
JesseW |
assuming that a 35G (uncompressed) zip file is OK, I think this plan should work fine. |
23:57
🔗
|
BlueMax |
What are you trying to do JesseW |