Obnam NEWS
==========
This file summarizes changes between releases of Obnam.
Version 0.22, released 2011-08-25; a BETA release
-------------------------------------------------
USER VISIBLE CHANGES:
* Obnam now reports its current configuration in the log file at startup.
This will hopefully remove one round of "did you use the --foo option?"
questions between developers and bug reporters.
BUG FIXES:
* The repository is now unlocked on exit only if it is still locked.
* A wrongly caught `GeneratorExit` is now dealt with properly.
* Keyboard interrupts are logged, so they don't show up as anonymous errors.
CHANGES RELEVANT TO DEVELOPERS ONLY:
* `setup.py` has been enhanced to work more like the old `Makefile` did:
`clean` removes more artifacts. Instructions in `README` have been updated
to point at `setup.py`.
* Compiler warning about `_XOPEN_SOURCE` re-definition fixed.
* Tests are now again run during a Debian package build.
Version 0.21, released 2011-08-23; a BETA release
-------------------------------------------------
USER VISIBLE CHANGES:
* Obnam will now unlock the repository if there's an error during a backup.
For the most part, the `force-lock` operation should now be unnecessary,
but it's still there in case it's useful some day.
BUG FIXES:
* Negative timestamps for files now work. Thanks to Jamil Djadala
for reporting the bug.
* The documentation for --checkpoint units fixed. Thanks, user weinzwang
from IRC.
* The connections to the repository and live data filesystem are now
properly closed. This makes benchmark read/write statistics be correct.
Version 0.20.1, released 2011-08-11; a BETA release
-------------------------------------------------
BUG FIXES:
* More cases of Unicode strings versus plain strings in filenames
over SFTP fixed. Thanks to Tapani Tarvainen.
Version 0.20, released 2011-08-09; a BETA release
-------------------------------------------------
BUG FIXES:
* Non-ASCII filenames over SFTP root now work. (Thanks, Tapani Tarvainen,
for the reproducible bug report.)
* The count of files while making a backup now counts all files found,
not just those backed up. The old behavior was confusing people.
USER VISIBLE CHANGES:
* The output of `obnam ls` now formats the columns a little prettier,
so that wide values do not cause misalignment.
* The error message when trying to use an encrypted repository without
encryption is now better (and suggests missing encryption being the
reason). Thanks, chrysn.
* Obnam now supports backing up of Unix sockets.
Version 0.19, released 2011-08-03; a BETA release
-------------------------------------------------
INCOMPATIBILITY CHANGES:
* We now require version 0.21 of the `larch` library, and this requires
bumping the repository format. This means old backup repositories can't
be used with this version, and you need to back up everything again.
(Please tell me when this becomes a problem.)
BUG FIXES:
* Found one more place where a file going missing during a backup may
cause a crash.
* Typo in error message about on-disk formats fixed.
(Thanks, Tapani Tarvainen.)
* The `--trace` option works again.
* `fcntl.F_SETFL` does not seem to work on file descriptors for files
owned by root that are read-only to the user running obnam. Worked
around by ignoring any problems with setting the flags.
* The funnest bug in this release: if no log file was specified with `--log`,
the current working directory was excluded from the backup.
USER VISIBLE CHANGES:
* `obnam(1)` manual page now discusses how configuration files are used.
* The manual page describes problems using sftp to access live data.
* The documentation for `--no-act` was clarified to say it only works
for `forget. (Thanks, Daniel Silverstone.)
* `obnam-benchmark` now has a manual page.
* The backup plugin logs files it excludes, so the user can find out what's
going on. A confused user is an unhappy user.
INTERNAL STUFF:
* Tracing statements added to various parts of the code, to help debug
mysterious problems.
* All exceptions are derived from `obnamlib.AppException` or
`obnamlib.Error`, and those are derived from `cliapp.AppException`,
so that the user gets nicer error messages than Python stack traces.
* `blackboxtests` is no longer run under fakeroot, because Debian packages
are built under fakeroot, and fakeroot within fakeroot causes trouble.
However, the point of running tests under fakeroot was to make sure
certain kinds of bugs are caught, and since Debian package building runs
the tests anyway, the test coverage is not actually diminished.
* The `Makefile` has new targets `fast-check` and `network-tests`. The
latter runs tests over sftp to localhost.
Version 0.18, released 2011-07-20; a BETA release
-------------------------------------------------
* The repository format has again changed in an incompatible manner,
so you will need to re-backup everything again. (If this is a problem,
tell me, and I'll consider adding backwards compatibility before 1.0
is released.)
* New option `--exclude-caches` allows automatic exclusion of cache
directories that are marked as such.
* Obnam now makes files in the repository be read-only, so that they're
that much harder to delete by mistake.
* Error message about files that can't be backed up now mentions the
correct file.
* Bugfix: unreadable files and directories no longer cause the backup
to fail. The problems are reported, but the backup continues.
Thanks to Jeff Epler for reporting the bug.
* Speed improvement from Jeff Epler for excluding files from backups.
* Various other speed improvements.
* Bugfix: restoring symlinks now works even if the symlink is restored
before its target. Also, the permissions of the symlink (rather than its
target) are now restored correctly. Thanks to Jeff Epler for an
exemplary bug report.
* New option `--one-file-system`, from Jeff Epler.
* New benchmarking tool `obnam-benchmark`, which is more flexible than
the old `run-benchmark`.
* When encrypting/decrypting data with GnuPG, temporary files are no
longer used.
* When verifying, `.../foo` and `.../foo/` now work the same way.
* New option `--symmetric-key-bits`.
* The chunk directory uses more hierarchy levels, and the way chunks
are stored there is now user-configurable (but you'll get into trouble
if you don't always use the same configuration). This should speed
things up a bit once the number of chunks grows very large.
* New `--chunkids-per-group` option, for yet more knobs to tweak when
searching for optimal performance.
* Local files are now opened using `O_NOATIME` so they can be backed
up without affecting timestamps.
* Now uses the `cliapp` framework for writing command line applications.
The primary user-visible effect is that the manpage now has an
accurate list of options.
* Bugfix: Obnam now again reports VFS I/O statistics.
* Bugfix: Obnam can again back up live data that is accessed using sftp.
Thanks to Tapani Tarvainen for reporting the problem.
Version 0.17, released 2011-05-21; a BETA release
-------------------------------------------------
* This is the second BETA release.
* The `run-benchmark` script now works with the new version of `seivot`.
The only benchmark size is one gibibyte, for now, because Obnam's too
slow to do big ones in reasonable time. As an aside, the benchmark
script got rewritten in Python, so it can be made more flexible.
* Benchmarks are run using encrypted backups.
* The kernel buffer cache is dropped before each obnam run, so the
benchmark result is more realistic (read: slower).
* Obnam now rotates its logs. See `--log-max` and `--log-keep` options
in the manual page. The default location for the log file is now
`~/.cache/obnam/obnam.log` for people, and
`/var/log/obnam.log` for root.
* Obnam now restores sparse files correctly.
* There have been some speed improvements to Obnam.
* The `--repository` option now has the shorter alias `-r`, since it
gets used so often.
* `obnam force-lock` now merely gives an error message, instead of a
Python stack trace, if the repository does not exist.
* Obnam now does not crash if files go missing during a backup, or can't
be read, or there are other problems with them. It will report the
problem, but then continue as if it had never heard of the file.
* Obnam now supports FIFO files.
* Obnam now verifies checksums when it restores files.
* Obnam now stores the checksum for the whole file, not just the checksum
for each chunk of its contents.
* Obnam's own log file is automatically excluded from backups.
* Obnam now stores and restores file timestamps to full accuracy,
instead of truncating them to whole seconds.
* The format of the backup repository has changed in an incompatible way,
and Obnam will now refuse to use an old repository. This means you
will need to use an old version to restore from them, and need to
re-backup everything. Sorry.
Version 0.16, released 2011-07-17; a BETA release
-------------------------------------------------
* This is the first BETA release. Obnam should now be feature complete
for real use. Performance is lacking and there are many bugs remaining.
There are no known bugs that would corrupt backed up data, or prevent
its recovery.
* Add encryption support. See the manual page for how to use it.
Version 0.15.1, released 2011-03-21; an ALPHA release
----------------------------------------------------
* Fix `setup.py` to not import `obnamlib`, so it works when building under
pbuilder on Debian. Meh.
Version 0.15, released 2011-03-21; an ALPHA release
----------------------------------------------------
Bugs fixed:
* Manual page GPL copyright blurb is now properly marked up as a comment.
(Thanks, Joey Hess.)
* README now links to python-lru correctly. (Thanks, Erik Johansson.)
Improvements and other changes:
* Filenames and directories are backed up in sorted order. This should
make it easier to know how far obnam's gotten.
* The location where backups are stored is now called the repository,
instead of the store. Suggested by Joey Hess.
* The repository and the target directory for restored data are now
both created by Obnam, if they don't already exist. Suggested by
Joey Hess.
* Better control of logging, using the new `--trace` option.
* Manual page now explains making backups a little better.
* Default value for `--lru-size` reduced to 500, for great improvement
in memory used, without, it seems, much decrease in speed.
* `obnam verify` now reports success explicitly. Based on question
from Joey Hess.
* `obnam verify` now accepts both non-option arguments and the `--root`
option. Suggested by Joey Hess.
* `obnam forget` now accepts "generation specifiers", not just numeric
generation ids. This means that `obnam forget latest` works.
* I/O statistics are logged more systematically.
* `obnam force-lock` introduced, to allow breaking a lock left behind
if obnam crashes. But it never does, of course. (Well, except if there's
a bug, like when a file changes at the wrong moment.)
* `obnam genids` introduced, to list generation ids without any other data.
The old command `obnam generations` still works, and lists other info
about each generation as well, but that's sometimes bad for scripting.
* The `--dump-memory-profile` option now accepts the value `simple`, for
reporting basic memory use. It has such a small impact that it's the
default.
* Obnam now stores the version of the on-disk format in the repository.
This should allow it to handle repositories created by a different
version and act suitably (hopefully without wiping all your backups).
Version 0.14, released 2010-12-29; an ALPHA release
----------------------------------------------------
This version is capable of backing up my laptop's home directory.
It is, however, still an ALPHA release, and you should not rely on
it as your sole form of backup. It is also slow. But if you're
curious, now would be a good time to try it out a bit.
Bug fixes:
* `COPYING` now contains GPL version 3, instead of 2. The code was
licensed under version 3 already. (Thank you Greg Grossmeier.)
* The manual page now uses `-` and `\-` correctly.
* `obnam forget` now actually removes data that is no longer used by
any generation.
* When backing up a new generation, if any of the root directories for
the backup got dropped by the user, they are now also removed from
the backup generation. Old generations obviously still have them.
* Only the per-client B-tree forest should have multiple trees. Now this
actually happens, whereas previously sometimes a very large number of
new trees would be created in some forests. (What's good for rain
forests is not good for saving disk space.)
* When recursing through directory trees, obnam no longer follows
symlinks to directories.
* obnam no longer creates a missing backup store when backing up to
a local disk. It never did this when backing up via sftp. (This
saves me from figuring out which of `store`, `stor`, and `sorte`
is the real directory.)
New features and stuff:
* `blackboxtest` has been rewritten to use Python's `unittest`
framework, rather than a homegrown bad re-implementation of some of it.
* `obnam ls` interprets arguments as "genspecs" rather than generation
identifiers. This means `obnam ls latest` works, and now `latest` is
also the default if you don't give any spec.
* `run-benchmarks` now outputs results into a git checkout of
, an ikiwiki instance hosted by
. The script also puts the results into
a suitable sub-directory, adds a page for the RSS feed of benchmark
results, and updates the report page that summarizes all stored results.
* There is now a 100 GiB benchmark.
* Clients are now called clients, instead of hosts. This terminology should
be clearer.
* The list of clients now stores a random integer identifier for each client
(unique within the store). The identifier is used as the name of the
per-client B-tree directory, rather than the hostname of the client.
This should prevent a teeny tiny bit of information leakage. It also
makes debugging things much harder.
* Various refactorings and prettifications of the code has happened.
For example, several classes have been split off from the `store.py`
module. This has also resulted in much better test coverage for those
classes.
* The per-client trees (formerly GenerationStore, now ClientMetadataTree)
have a more complicated key now: 4 parts, not 3. This makes it easier
to keep separate data about files, and other data that needs to be
stored per-generation, such as what the generation id is.
* `find-duplicate-chunks`, a tool for finding duplicate chunks of data
in a files in a directory tree, was added to the tree. I have used it
to find out if is worthwhile to do duplicate chunk removal at all.
(It is, at least for my data.) Also, it can be used to find good
values for chunk sizes for duplicate detection.
* The whole way in which obnam does de-duplication got re-designed and
re-implemented. This is tricky stuff, when there is more than one client.
* `SftpFS` now uses a hack copied from bzrlib, to use openssh if it is
available, and paramiko only if it is not. This speeds up sftp data
transfers quite a bit. (Where bzrlib supports more than just openssh,
we don't, since I have no way to test the other stuff. Patches welcome.)
* The way lists of chunk ids are stored for files got changed. Now we store
several ids per list item, which is faster and also saves some space
in the B-tree nodes. Also, it is now possible to append to the list,
which means the caller does not need to first gather a list of all ids.
Such a list gets quite costly when the file is quite big (e.g., in the
terabyte size).
* New `--dump-memory-profile` option was added to help do memory profiling
with meliae or heapy have been added. (Obnam's memory consumption finally
got annoying enough that I did something about it.)
Removed stuff:
* The functional specification was badly outdated, and has been removed.
I decided to stop kidding myself that I would keep it up to date.
* The store design document has been removed from the store tree.
The online version at is the
canonical version, and is actually kept up to date.
* The benchmark specification has likewise been replaced with
.
Version 0.13, released 2010-07-13; an ALPHA release
----------------------------------------------------
* Bug fix: a mistake in 0.12 caused checkpoints to happen after each
file after the first checkpoint. Now they happen at the right intervals
again.
* Upload speed is now displayed during backups.
* Obnam now tells the kernel that it shouldn't cache data it reads or
writes. It is not likely that data being backed up is going to be
needed again any time soon, so there's no point in caching it.
(The posix_fadvise call is used for this.)
* New --lru-size option sets size of LRU cache for nodes in memory.
The obnam default is large enough to suit large backups. This uses more
memory, but is faster than btree's small default of 100.
Version 0.12, released 2010-07-11; an ALPHA release
----------------------------------------------------
* NOTE: This version makes incompatible changes to the way data is stored
on-disk. Backups made with older versions are NOT supported. Sorry.
* The run-benchmark script has dropped some smaller sizes (they're too
fast to be interesting), and adds a 10 GiB test size.
* Various speed optimizations. Most importantly, the way file metadata
(results of lstat(2)) are encoded has changed. This is the incompatible
change from above. It's much faster now, though.
* Preliminary support for using SFTP for the backup store added. Hasn't
been used much yet, so might well be very buggy.
Version 0.11, released 2010-07-05; an ALPHA release
----------------------------------------------------
* Speed optimizations:
- chunk identifiers are now sequential, except for the first one, or
when there's a collision
- chunks are now stored in a more sensible directory hierarchy (instead
of one per directory, on average)
- adding files to a directory in the backup store is now faster
- only store a file's metadata that if it is changed
* New --exclude=regexp option to exclude files based on pathnames
* Obnam now makes checkpoints during backups. If a backup is aborted
in the middle and then re-started, it will continue from the latest
checkpoint rather than from the beginning of the previous backup run.
- New option --checkpoint to set the interval between checkpoints.
Defaults to 1 GiB.
* Options for various B-tree settings. This is mostly useful for finding
the optimal set of defaults, but may be useful in other situations for
some people.
- New options --chunk-group-size, --chunk-size, --node-size,
--upload-queue-size.
* Somewhat better progress reporting during backups.
Version 0.10, released 2010-06-29; an ALPHA release
---------------------------------------------------
* Rewritten from scratch.
* Old NEWS file entries removed (see bzr if you're interested).