Dar Documentation


TUTORIAL


This page was realised from the official Dar Tutorial, as of version 2.3.4 of June 30th, 2007, with some semantic markup added, to help you quickly spot what you're looking for, or simply ease reading. You may want to adjust the stylesheet to your taste.

Beware that it may not match the version you are using, so you may want to check official website for lastest documentation and downloads.



Introduction

This little tutorial shows you how to backup your file system (partially or totally) on ZIP drives.

STEP 1: The FULL backup

We need first to make a full backup, let's go:

Slices size

The size of a zip drive is 100 MB,
*** here comes the option: -s 100M
This tells dar to not create a single backup file but to split it in several files with a size of at most 100 Megabytes.

The size of the first slice

On your first ZIP drive we want to copy, outside the backup:
the dar binary
to be able to restore it in case of hard disk failure, for example.
IMPORTANT: dar binary relies on several libraries which must also be available in the rescue system or copied with the dar binary. But, if you don't want to worry about needed libraries, there is a static version of dar which only difference is that it has all required library included in the binary (thus it is a slightly larger binary). Its name is "dar_static", and its main reason of existence is to be placed beside backups in case something goes wrong in your system. Note that dar_static is useless for windows, you will always need the Cygwin dll.
You could also add man pages,
if you are scared not to be able to remember all the many feature of dar ;-) and find the -h option too sparse). Note that man pages are available on the web. OK you need an Internet access to read them.
This make the free space on the first ZIP floppy a bit smaller: 95 MB.
*** Here comes the option: -S 95M
(Note that '-s' is lowercase for all the slices, and '-S' is UPPERCASE meaning the initial slice only).

Pause between slices

We need to pause between slices to change the ZIP floppy when it is full
*** here comes the option: -p -b
telling dar to pause before writing a new slice (-p) and to ring the terminal bell (-b) when user action is needed.

Data compression

We will compress data inside the backup
*** here comes the option: -z
-z uses gzip compression algorithm, but you could also use -y to use bzip2 compression. Optionally, if speed is more important than archive size, you can degrade compression specifying the compression level : -z1 for example. 

Directories to backup

Now, we want to backup the whole file system.
*** here comes the option: -R /
This tells dar that no files out of this directory tree will be saved (here, it means that no files will be excluded from the backup, if no filter is specified, see below)

Excluding files

There are some files you don't want to backup like backup files generated by emacs "*~" and .*~".
*** here comes the options: -X "*~" -X ".*~"
Note that you have to quote the mask for it not to be interpreted by the shell the -X options do not apply to directories, nor to path, they just apply to filenames. (see also -I option in man page for more information)

Excluding directories

Among these files are several sub-trees you must not save: the /proc file system for example, as well as the /dev/pts . Theses are virtual file systems, saving them would only make your backup bigger filled with useless stuff.
*** here come the options: -P dev/pts -P proc
Note that
  • path must be relative to -R option (thus no leading '/' must be used)
  • Unlike the -X/-I options, the -P can apply to a directory.
  • If a directory matches -P option, all its subdirectory will also be excluded.
  • note also that -P can receive wildcards, and they must be quoted not to be interpreted by the shell:
    -P "home/*/.mozilla/cache" for example.
  • Lastly, -P can also be used to exclude a plain file (if you don't want to exclude all files of a given name using -X option):
    -P home/joe/.bashrc for example would only exclude joe's .bashrc file not any other file, while
    -X .bashrc will exclude any file of that name including joe's file.
    (see also -g option in man page for more)
More importantly we must not save the backup itself:
*** here comes the option: -P mnt/zip
assuming that your ZIP is mounted under /mnt/zip .
We should also have excluded all files of extension "dar" which are backup generated by dar with -X "*.*.dar", but this would exclude other dar archive from the backup, which may not always fit your need.

Keep track of pruned directories

Now, as we don't save the /dev/pts /proc and /mnt/zip directories, we would have to create them by hand at recovery time. But we can better use the -D option, which does not totally ignore excluded directories but rather stores them as empty.
*** here comes the option -D
thus at recovery time they will be generated automatically

Name the backup

Lastly, we have to give a name to this full backup. Let's call it "linux_full" as it is supposed to take place on the ZIP drive, its path is /mnt/zip/linux_full
*** here comes the option: -c /mnt/zip/linux_full
Note that linux_full is not a complete filename, it is a "basename", on which dar will add a number and an extension ".dar", this way the first slice will be a file of name linux_full.1.dar located in /mnt/zip

The procedure

Now, as we will have to mount and umount the /mnt/zip file system, we must not have any process using it, in particular, dar current directory must no be /mnt/zip so we change to / for example.

All together we follow this procedure for our example:

Put an empty ZIP floppy in the device, and mount it according to your /etc/fstab file.

mount /mnt/zip

Copy the dar binary to the zip drive (to be able to restore in case of big problem, like a crash disk) and eventually man pages

cp `which dar_static` /mnt/zip

then, type the following
cd /
dar -c /mnt/zip/linux_full -s 100M -S 95M -p -b -z -R / -X "*~" -X ".*~" -P dev/pts -P proc -P mnt/zip -D

Note that option order has no importance. Some options may be used several times (-X, -I, -P) some others cannot (see man page for more).
When the first slice will be done, DAR will pause, ring and display a message. You will have to unmount the floppy

umount /mnt/zip

eject and replace the floppy by an empty new one and mount it

mount /mnt/zip

To be able to to that, you can swap to another virtual console pressing ALT+F? keys, or open another xterm if under X-Windows, and suspend dar by typing CTRL-Z and reactivating it after mounting/unmounting by typing `fg` (without the quotes).

Then proceed with dar, pressing the <enter> key.

Dar will label slices this way:
slice 1: linux_full.1.dar
slice 2: linux_full.2.dar
and so on.

That's it! We finished the first step, it may take a long time depending on the size of the data to backup, The following step (differential backup) however can be done often, and it will stay fast every time (OK, except if a big part of your system have changed, in that case you can consider making another full backup).

Checking the whole archive

Just a little check on the archive you've just made: suppose you want to read the content of the backup you made, you would have
to run:

dar -l /mnt/zip/linux_full

Optionally if you don't trust your zip floppies, you can either check the archive contents, or compare what's stored in it with the current file system.

dar -t /mnt/zip/linux_full

will check the whole archive.

It is recommended to first unmount it and remount it, this to flush the cache. Else you may read data from cache (in memory) and do not detect an error on you floppy. dar -t cannot check a single slice, it checks all the archive.

Checking a single slice

If you need to check a single slice (for example after burning it on CD) you can use the diff command : for example, you burn the last completed slices on CD-R, but have just enough free space to store one slice on disk. You can thus check the slice typing something like:

diff /mnt/cdrom/linux_full.132.dar /tmp/linux_full.132.dar

where 132 has to be replaced by the real slice number.

If all is ok (which is when diff does not complain), you can delete the slice from the hard disk (/tmp/slice.x.dar), and continue with dar. Else, you will have to burn again the slice on a new CD-R.

Compare to a given directory

dar -d /mnt/zip/linux_full -R /

will compare the archive with filesystem tree located at / . Same remark as previously, it is recommended to first unmount and mount the floppy to flush the system cache.

STEP 2: DIFFERENTIAL BACKUP

The only thing to add is the base name of the backup we take as reference
*** here comes the option: -A /mnt/zip/linux_full

Of course, we have to choose another name for that backup, let's call it linux_diff1
*** here comes the option: -c /mnt/zip/linux_diff1

Last point: if you want to put the new backup at the end of the full backup, you will have to change the -S option according to the remaining space on it. suppose the last slice of linux_full takes 34MB you have 76MB available for the first slice of the differential backup (and always 100MB for the following ones),
*** here comes the option: -S 76M
but if you want to put the backup on a new floppy, just forget the -S option.

All together we get:

dar -c /mnt/zip/linux_diff1 -A /mnt/zip/linux_full -s 100M -S 76M -p -b -z -R / -X "*~" -X ".*~" -P dev/pts -P proc -P mnt/zip -D

the same way we have done previously, just before effectively starting to backup, dar will ask for the first and then the last slice of the archive of reference, then dar will pause for you to change the floppy and put the one you want to put the backup on (thanks to the -p option).

STEP 3: ENDLESS DIFFERENTIAL BACKUP

You can make another differential backup, taking linux_diff1 as reference, in this case you would change only the following

-c /mnt/zip/linux_diff2 -A /mnt/zip/linux_diff1

You could also decide to change of device, taking a 1'44MB floppy or a CD-R,  this would not cause any problem at all. It arrives a time when you get many differential backup for a single full backup, it is then time to make a new full backup, depending on your available time for doing it, or on your patient if one day you have to recover the whole data after a disk crash: You would then have to restore the full backup, then all the following differential backup up to the most recent one. This requires more user intervention than restoring a single full backup, all is a matter of balance, between the time it takes to backup and the time it takes to restore.

Note, that starting release 1.2.0 a new command appeared that helps restoring a small set of file from a lot a differential backup. Its name is dar_manager. See at the end of this tutorial and man page for more.

Another solution, is when you have too much differential backup, is to make the next differential backup taking the last full_backup as reference, instead of the last differential backup done. This way, it will take less time than doing a full backup, and you will not have to restore all intermediate differential backup. Some people make difference between "incremental" backup and "differential" backup. Here they are the same, all depends on the nature of the reference backup you take.

Of course, a given backup can be used as reference for several differential backup, there is no limitation in number nor in nature (the reference can be a full of differential backup).


STEP 4: RECOVER AFTER A DISK CRASH

Sorry, it arrived, your old disk has crashed. OK, you are happy because you have now a good argument to buy the very fast and very enormous very last hard disk available. Usually, you also cry because you have lost data and you will have to reinstall all your system, that was working for so long!

If however the last backup you made is recent, then keep smiling! OK, you have installed your new hard disk and configured you BIOS to it (if necessary). You will need a bootable floppy, with a minimum Linux system on it, that allows you to access your zip drive and your new empty hard disk (in the case your backup resided on ZIP disk). For example use the Slackware floppy disks, they are nicely done. You don't need to install something on your brand-new disk, just make partitions and format as you want: We suppose your new disk is /dev/hda and /dev/sga is your ZIP drive.

1. Create the partition table as you wish, using
fdisk /dev/hda

2. Format the ext2 partition which will receive your data
mke2fs /dev/hda1

3. Additionally format the swap partition (if needed)
mkswap -c /dev/hda2

3bis. If you have a lot of file to restore, you can activate the swap on the partition of your new hard drive:
swapon /dev/hda2

4. Now we must mount the hard disk, somewhere.

cd /
mkdir disk
mount -t ext2 /dev/hda1 /disk

would do the trick

4bis. If you want to restore your system over several partitions like /usr /var /home and / you must create the partitions, format them. Then create the directories that will be used as mounting point an mount the partitions on theses directories

mkdir /disk/usr /disk/var /disk/home
mount -t ext2 /dev/hda2 /disk/usr
mount -t ext2 /dev/hda3 /disk/var
mount -t ext2 /dev/hda4 /disk/home

for example if you have / /usr /var and /home partitions.

5. We need to copy the dar binary from the ZIP to your disk insert the floppy ZIP where you have copied the dar binary

cd /

mkdir zip
mount -t ext2 /dev/sga /zip
cp /zip/dar_static /disk

where /dev/sga points to your zip drive

6. Now we restore the archive. The stuff has to go in /disk subdirectory
*** here comes the option: -R /disk

7. The process may be long, thus it might be useful to be noticed when a user action is required by dar.
*** here comes the option: -b

let's go restoring!
/disk/dar_static -x /zip/linux_full -R /disk -b

... and when the next zip floppy is needed,
umount /zip
change the floppy and mount it:
mount -t ext2 /dev/sga /zip
as previously, use an alter xterm / virtual console or suspend dar by CTRL-Z and awake it back by the 'fg' command. Then press <enter> to proceed with dar


7. Once finished with the restoration of linux_full, we have to do the same with any following backup, This way however will warn you any time it restores a more recent file (file overwriting) or any time a file that has been removed since the backup of reference, has to be removed from file system (suppression). If you don't want to press the <enter> key several thousand times:
*** here comes the option: -w
(don't warn). All file will be overwritten without warning, but you may also use the -r option, that will avoid trying to overwrite more recent files than those on filesystem. It might not be of a great use, if you restore a differential backup after its reference backup on an initially empty disk (file stored in the differential archive are more recent than those in the reference). But, it might be useful in some other situations.

All together it makes:
/disk/dar_static -x /zip/linux_diff1 -R /disk -b -w

Then any additional archive:
/disk/dar_static -x /zip/linux_diff2 -R /disk -b -w
...
/disk/dar_static -x /zip/linux...    -R /disk -b -w


8. Finally, remove the dar binary from the disk:
rm /disk/dar_static

9. And launch lilo for your Linux box to boot properly (if needed):
lilo -r /disk

10. You can reboot you machine and be happy with you brand-new hard disk with your old precious data on it:
shutdown -r now

OK, one day, I will make something like a bootable floppy image with dar inside, maybe with a simple script for user interaction... if you have already done it, you can tell or send  me (or give me the URL where to get it, for I add a hyperlink on DAR's Homepage). [Note: Knoppix seems to include dar]

STEP 4(bis): recover only some files

Gosh, you have remove a important file by error. Thus, you just need to restore it, not the rest of the full and differential backups.

a) First method:

We could as previously, try all archive starting from the full backup up to the most recent differential backup, and restore just the file if it is present in the archive:

dar -R / -x /zip/linux_full -g home/denis/my_precious_file

This would restore only the file /home/denis/my_precious_file from the full backup.

OK, now we would also have to restore from all differential backup the same way we did. Of course, this file may have changed since the full backup.

dar -R / -x /zip/linux_diff1 -g home/denis/my_precious_file

and so on, up to the last differential archive.

dar -R / -x /zip/linux_diff29 -g home/denis/my_precious_file


b) Second method (more efficient):

We will restore our lost file, starting from the most recent differential backup and *maybe* up to the full backup. Our file may or may not be present in the each differential archive as it may have changed or not since the previous version, thus we have to check if our file is restored, using the -v option (verbose):

dar -R / -x /zip/linux_diff29 -v -g home/denis/my_precious_file

If we can see a line like

restoring file: /home/denis/my_precious_file

The it stops here, because we got the most recent backup version of our lost file. We have to otherwise have to continue with the previous differential backup, up to the full backup if it is necessary. This method has an advantage over first one, which is not to have *in all case* the need to use all the backup done since the full backup.

If you are lazy on the other hand, have a look at dar_manager (at the end of the tutorial)

OK, now you have two files to restore. No problem, just do the second method but add -r option not to override any more recent file already restored in a previous step:

dar -x /zip/linux_diff29 -R / -r -v -g home/denis/my_precious_file -g etc/fstab

Check the output to see if one or both of your files got restored. If not, continue with the previous backup, up to the time you have seen for each file a line indicating it been restored. Note that the most recent version of each files may not be located in the same archive, thus you might get /etc/fstab restored from linux_diff28, and /home/denis/my_precious_file restored at linux_diff27. In the case /etc/fstab is also present in linux_diff27 it would not have been overwritten by an older version, thanks to the -r option.

This option is very important where restoring more than one file using the second method. Instead, in the first method (restoring first from the full backup, then from all the following differential backups), -r option is not so important as if overwriting occurs when you restore lost files, you would only overwrite an older version by a newer.

Same thing here, even if you are not lazy, dar_manager can help you lots here to automate the restoration of a set of file.

How much can you restore with dar_manager?  As much as you can give to dar_manager as argument on command line or through  included files (-B option).


DAR_MANAGER TUTORIAL

dar_manager builds a database of all your archive contents, to automatically restore the latest versions of a given set of files. First, we have to create a "database" file :

dar_manager -C my_base.dmd

This created a file "my_base.dmd" where dmd stands for Dar Manager Database, but you are free to use any other filename.

This database is created empty. Each time you make a backup, may it be full or differential, you will have to add its catalogue to this database using the  following command:

dar_manager -B my_base.dmd -A /mnt/zip/linux_full

This will add ("A" stands for "add") the archive contents to the base. In some cases you may not have the archive available but its extracted catalogue instead. Of course, you can use the extracted catalogue in place of the archive!

dar_manager -B my_base.dmd -A ~/Catalogues/CAT_linux_full

The problem however is that when dar_manager will need to recover a file located in this archive it will try to open the archive   ~/Catalogue/CAT_linux_full for restoration, which does not contain any data because it is just the catalogue of the archive.

No problem if you made this mistake, thanks to the -b option we can change the basename of the archive, and thanks to the -p option you can change the path at any time. But first we will list the database contents:

dar_manager -B my_base.dmd -l

It shows the following:


dar path    :
dar options :

archive #   |    path      |    basename
------------+--------------+---------------
        1       /home/denis/Catalogues      CAT_linux_full

We should change the path of archive number 1 for dar_manager looks on the zip drive:

dar_manager -B my_base.dmd -p 1 /mnt/zip

and also replace the name of the extracted catalogue by the real archive name

dar_manager -B my_base.dmd -b 1 linux_full

Now we have exactly the same database as if we had use the real archive instead of its catalogue:

dar_manager -B my_base.dmd -l


dar path    :
dar options :

archive #   |    path      |    basename
------------+--------------+---------------
        1       /mnt/zip     linux_full


In place of using -b and -p options, you can also tell the path and the name of the real archive to use at restoration time this way:

dar_manager -B my_base.dmd -A ~/Catalogues/CAT_linux_full /mnt/zip/linux_full

This is done adding an optional argument. The first ~/Catalogue... is the archive to add, and the second /mnt/zip... is the name to keep for it. No access is done to this archive, thus it may be not available at the time the command is typed.

You can add up to 65534 archives to a given database, and have as much base as you want.

Suppose after each backup you took the time to update your database, and now you just have removed an important file by error.

Firstly, we did not yet gave important options in the database to be passed to dar. For example, you will likely restore from the root of your filesystem, therefor dar when called from dar_manager must get the "-R /" option. This is done with:

dar_manager -B my_base.dmd -o -R /

All that follows -o is passed to dar as-is. You can see the options passed to dar when listing the database contents (-l option).

Now, we can restore our /home/denis/my/precious/file :

dar_manager -B my_base.dmd -r home/denis/my/precious/file

dar_manager will find the proper archive to use, and call dar with the following options:

dar -x <archive> -R / home/denis/my/precious/file

Which in turn will ask you the corresponding slices.

Once an archive become obsolete you can delete it from the database thanks to the -D option, you can also change archive order (-m option), get a list in which is located a given file (-f option), get the list of most recent files in a given archive (-u option), and get overall statistics per archive (-s option). Lastly you can specify which dar command to use given its path (-d option), by default, dar_manager uses the PATH shell variable to choose the dar command.

An a new feature for those that are really very lazy : dar_manager has an interactive mode, so you don't have to remeber all theses command-line switch except one:

dar_manager -B my_base.dmd -i

Interactive mode allow you to do all operation except restoration which can be done as previously explained.