Size: 16207
Comment: Tar page updated massively. The section has been cut to bare necessity.
|
Size: 16152
Comment: Nice is not a backup program
|
Deletions are marked like this. | Additions are marked like this. |
Line 177: | Line 177: |
* [[http://en.wikipedia.org/wiki/Nice_(Unix)|nice]]. |
Introduction
Backup is essential. Without it, data is entirely exposed to loss under a number of circumstances such as, drive failure or data corruption. Users often only learn this lesson after critical information they require is permanently lost. Attempting to recover from such data loss can be both time consuming and extremely troublesome. So learn from others mistakes, and ensure beforehand that you have a system in place that protects your data and suits your needs.
Backing up files can be accomplished either from a graphical user interface (GUI) or by issuing commands to the command line interface (CLI). It doesn't matter which method you employ, so long as it works for you and you do it on a regular basis. To make sure it's regular, it's best to ensure your method is scheduled so you don't need to remember to do it.
It is also important to note that there are two primary means of backup, folder synchronization and the creation of archives.
Folder synchronization is a means to ensure two different folders at two locations hold the same data. The synchronization is usually modified by certain rules, such as directing the program to only write new files. It is most useful for multimedia files that will not benefit from compression. The reason for this is that during encoding of such media near maximal compression is usually achieved with the codec.
Creating an archive is mainly used for long-term storage, primarily of configuration files and documents. These will benefit greatly from the compression of gzip or bzip2 because the documents aren't stored in compressed formats. Since the size required on disk is much smaller than storing copies of multimedia files, users often create numerous full backups allowing them to go back and restore files from certain dates.
A third alternative is called drive imaging. Drive imaging is similar to the creation of archives, except that instead of copying just folders and files, the entire partition that is selected is backed up to an archive byte for byte. Such archives are useful, in the case of a hard drive failure, entire partitions can be restored quickly to a new hard drive and work resumed.
WARNING: Backups such as drive imaging work at a low level and if improperly used can result in severe data loss, especially with a command line program like dd. Ensure you understand what is being done!
One final note, some file systems support versioning, which allows you to see all changes to a file. This is a local backup, but will do nothing in the case of drive failure. See the appropriate section for a discussion.
Backup From a Graphical User Interface
GUIs offer a graphical interface from which a user can choose from a limited selection of options to customize operation. There are also usually configured defaults and guides that help in selecting the right options. There are a few to choose from, below you'll find several.
Simple Backup Suite
Simple Backup Suite is a program designed for desktop users. It's purpose is succinctly summarized by it's title, to be a simple backup suite. It is easily installed from the repositories and comes with defaults already loaded and configured. From there you can customize what directories it backs up, what it excludes, schedule it to certain times and manage old archives. This suite creates archives, keep that in mind during use.
For a thorough guide to installation and general use, please see BackupYourSystem/SimpleBackupSuite.
grsync
The program grsync is simply a GUI front end for the CLI utility rsync. It presents a clean yet simple interface for controlling the synchronization between two folders. It also supports multiple sessions (otherwise called profiles) and many of the commonly used basic and advanced options for modifying the synchronization.
For more information see the rsync page.
Backup From the Command Line
CLI's primary advantage over a GUI is that it exposes directly to you many more options. With that however comes the obligation of knowing exactly before hand what you want to do. There are no defaults with commands it's entirely up to the user to set the desired options. To better acquaint yourself beforehand, it is advisable that you read UsingTheTerminal.
Creating an Archive
When backing up from the command line tar is often used. The tar command creates an archive file, which can then be further compressed by gzip or bzip2 to decrease the size it occupies on the disk.
A full explanation of this command, including how to backup, restore and how to operate over a network among other things can be found at BackupYourSystem/TAR.
rsync
Rsync updates and copies the files that have changed and even then only transfers the parts of those files that have changed. That is useful for saving bandwidth when backing up over the network. For safety, transfer between two machines is done via SSH. Rsync is especially good for backing up home directories.
The command for transferring to a remote machine is:
sudo rsync --delete -azvv -e ssh /home remoteuser@remotehost.remotedomain:./backupdirectory
-z compresses the data --delete deletes files that don't exist on the system being backed up. Maybe you want this, maybe not. -a preserves the date and times of the files (same as -t), descends recursively into all directories (same as -r), copies symlinks as symlinks (same as -l), preserves file permissions (same as -p), preserves groups (same as -g), preserves file ownership (same as -o), and preserves devices as devices (same as -D). -vv increases the verbosity of the reporting process
rsync has several graphical frontends: grsync, Backup Monitor, QSync, Zynk, rsyncbackup, TKsync
For more options, there is a separate page about rsync. As a companion, the article on SSH might also be worth reading.
Backup Destination on a Remote Machine
Simple Backup Suite
BackupYourSystem/SimpleBackupSuite
Duplicity
BackupPC
Drive Imaging
Drive imaging (also called disc imaging and partition imaging) is very useful, but only in specific cases. In general, it isn't an operation a user would run weekly or daily, rsync and archive creation are much better for such backups. Drive imaging is better used as a long term archive of a partition. Say for instance a user's home machine is set up to dual boot between XP and Ubuntu, any number of things could corrupt his XP or Ubuntu partitions and necessitate a re-installation of either. With a drive image, a user could simply format the partition and copy the data from the archive into the partition and be back up and running quickly.
Drive imaging therefore, like all the other means described is complementary to a user's backup strategy and should be considered on it's merits.
Graphical Programs for Drive Imaging
Two good graphic textual-graphic programs are partimage and clonezilla. They can be found at the below sites.
Creating Disc Images Using dd
WARNING: Read carefully and don't make typos!
Introduction
dd is an old command line program used for low level copying of files. It is a legacy program from UNIX, as such it does not behave or configure like traditional programs in the GNU suite. Caution is advised, using this command with the wrong operators can cause serious data loss.
In the following commands, if and of are short for input file and output file respectively. As the names imply, the input is the partition being backed up and output is the directory where the archive is created. Last important note, you need to know what partition you want to backup and what it's label is. To find out, open a terminal and use the following command:
sudo fdisk -l
The output of the command will list each hard drive installed, and all the partitions on it. It is usually possible to tell which is which by looking at the filesystem installed (ntfs for Windows, Linux for Linux) and the size it occupies on the disk. The user is responsible for knowing which partition is to be backed up. In the following examples replace /dev/hda1 with the name of the partition listed under device boot in the output. Those who prefer a graphical display of your partitions, use gparted. If you want to backup an entire drives data, rather than simply a partition on it, omit the number after the drive name, in our example, use hda.
One additional important note is that if you are restoring a partition to the hard drive, it is highly recommended that you work from a live CD session of Ubuntu. Restoring partitions that are in use (i.e. while booted in Ubuntu or Windows) will cause errors.
Backup with dd
The following are simply examples to understand use of dd, they are informational in nature. Do not simply copy them and use them in a terminal, ensure their use is understand and they have been modified to a user's specific situation and needs.
The following example will create a drive image of partition hda1, the image will be created in the home directory as the file hda1.bin .
dd if=/dev/hda1 of=/home/hda1.bin
This command creates the drive image and then passes it through gzip for compression. Bzip2 could be substituted instead of gzip for greater compression.
dd if=/dev/hda1 | gzip > /home/hda1.bin.gz
Back up the MBR of the hard drive labelled hda.
dd if=/dev/hda of=/home/hda.boot.mbr bs=512 count=1
The following command will overwrite drive hdb to have the same data as hda, this will wipe out all data previously on hdb. This is the same as synchronizing all data on the first drive with the second.
dd if=/dev/hda of=/dev/hdb conv=noerror,sync bs=4k
Restoring a Drive Image
To restore a drive image, as noted above always perform such an action from a live environment to avoid problems with the running operating system, unless your certain it won't conflict. The drive or partition you want to restore to must be exactly equal to or larger than the original size of the partition. Restoration is quite simple, and really just involves reversing the if and of values. This will tell dd to overwrite the drive or partition with the data that is stored in the file. One caveat, ensure the file dd is using as reference isn't stored on the partition or drive your restoring to. If you do this, eventually during the operation dd will overwrite the reference and then fail. Best to use an external/other drive or network storage to hold such drive image archives.
First go superuser and switch to runlevel 1 so that you can fumble around with the harddisk without other services interfering:
sudo su init 1
To restore the first partition on hda continuing our example, we would do the following.
dd if=/home/hda1.bin of=/dev/hda1
The following command would restore the mbr of the first drive with the backup we previously made.
dd if=/mnt/hda1/home/hda.boot.mbr of=/dev/hda bs=512 count=1
If you restored the whole drive (/dev/hda), the system will not automatically create the devices (/dev/hda1, /dev/hda2). To force automatic detection, reboot after completion.
If you restored the system to a new drive, and the device identifiers (called UUIDs, see UsingUUID for a more complete discussion) changed then you must adapt the bootloader and the mount points. While still on runlevel 1, edit these files with your preferred text editor:
/boot/grub/menu.lst /etc/fstab
To know what the new UUIDs for your drives are, use the following command:
sudo blkid
From this list, you can cross-reference the information with that of fdisk to know which drive/parition is which. Then simply update the UUIDs in both GRUB and fstab files.
Once all this is finished, you will be able to resume normal operation where you left off. At this time, if the drive in question is larger than the original, you can edit the partition structure to your liking or add new ones. For this, see Gparted and decide on what action is best for your case.
You can use the "loopback device" to mount this partition image, making it easy to read the old version of files and still keep your current version untouched. Mounting disks with Linux's loopback device by Jason Boxman
Versioning File Systems
These will be very useful when they mature. For example, ext3cow file system states it, "provides a time-shifting interface that allows a real-time and continuous view of the past. This allows users to access their file system as it appeared at any point in time." This feature is automatic -- nothing (meaning no backups, manual copies, etc.) need be done to have this benefit when using a copy-on-write file system.
They let you see every change to a file over it's life span, but offer no protection against a hard drive that fails. These projects are therefore complimentary to the backups described above, which offer snapshots or synchronization that can be stored on other hard drives or even other locations on a network.
Versioning tools such as Subversion could also be useful while we wait for versioning file systems to become mature and fully supported.
Related Links
You might also want to check out these backup programs which will help you to make automated backups of your system:
Other wiki discussing various Linux backup techniques:
See the full discussion in these threads on the Ubuntu forums:
http://www.ubuntuforums.org/showthread.php?t=35087 Howto: Backup and restore your system! (very long thread; but all the good bits have (hopefully) already been summarized on this page.)
"Backup and Restore Your Ubuntu System using Sbackup" has nice step-by-step instructions (with screenshots) of using sbackup.
A personal blog listing many open source backup solutions: List of Free Open Source Linux and Ubuntu Backup Software
Early Stage Ideas
In addition to that already discussed, there are projects still in development that you may wish to keep appraised of or even participate in.