e2fsprogs

Commit Graph

Author	SHA1	Message	Date
Darrick J. Wong	4892bce3c4	e2undo: ditch tdb file, write everything to a flat file The existing undo file format (which is based on tdb) has many problems. First, its comparison of superblock fields is ineffective, since the last mount time is only written by the kernel, not the tools (which means that undo files can be applied out of order, thus corrupting the filesystem); block numbers are written in CPU byte order, which will cause silent failures if an undo file is moved from one type of system to another; using the tdb database costs us an enormous amount of CPU overhead to maintain the key data structure, and finally, the tdb database is unable to deal with databases larger than 2GB. (Upstream tdb 1.2.12 can handle 4GB, but upgrading a 2TB FS to 64bit,metadata_csum easily produces 2.9GB of undo files, so we might as well move off of tdb now.) The last problem is fatal if you want to use tune2fs to turn on metadata checksumming, since that rewrites every block on the filesystem, which can easily produce a many-gigabyte undo file, which of course is unreadable and therefore the operation cannot be undone. Therefore, rip all of that out in favor of writing to a flat file. Old blocks are appended to a file and the index is written to the end when we're done. This implementation is much faster than wasting a considerable amount of time trying to maintain a hash index, which drops the runtime overhead of tune2fs -O metadata_csum from ~45min to ~20 seconds on a 2TB filesystem. I have a few reasons that factored in my decision not to repurpose the jbd2 file format for undo files. First, undo files are limited to 2^32 blocks (16TB) which some day might not serve us well. Second, the journal block size is tied to the file system block size, but mke2fs wants to be able to back up big chunks of old device contents. This would require large changes to the e2fsck journal replay code, which itself is derived from the kernel jbd2 driver, which I'd rather not destabilize. Third, I want to require undo files to store the FS superblock at the end of undo file creation so that e2undo can be reasonably sure that an undo file is supposed to apply against the given block device, and doing so would require changes to the jbd2 format. Fourth, it didn't seem like a good idea that external journals should resemble undo files so closely. v2: Provide a state bit that is only set when the undo channel is closed correctly so we can warn the user about potentially incomplete undo files. Straighten out the superblock handling so that undo files won't be confused for real ext* FS images. Record multi-block runs in each block key to reduce overhead even further. Support reopening an undo file so that we can combine multiple FS operations into one (overall smaller) transaction file, which will be easier to manage. Flush the undo index data if the program should terminate unexpectedly. Update the ext4 superblock bits if errors or -f is found to encourage fsck to do a full run the next time it's invoked. Enable undoing the undo. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:40:16 -04:00
Darrick J. Wong	ec2019d109	e2undo: fix memory leaks and tweak the error messages somewhat Fix memory leaks and improve the error messages to make it easier to figure out why e2undo went wrong. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:39:51 -04:00
Darrick J. Wong	3a82e80c55	undo-io: use a bitmap to track what we've already written It's really inefficient to (ab)use the TDB key store as a bitmap to find out if we've already written a block to the undo file, because the tdb code is reads the database key btree disk blocks for every query. Changing that logic to a bitmap reduces overhead by a large margin -- the overhead of using undo_io while converting a 2TB FS to metadata_csum is reduced from 55 minutes to 45. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:39:33 -04:00
Darrick J. Wong	344cd5325b	undo-io: be more flexible about setting block size Most of the e2fsprogs utilities set the IO block size multiple times (once to 1k to read the superblock, then again to set the real block size if we find a real superblock). Unfortunately, the undo IO manager only lets the block size be set once. For the non-mke2fs utilities we'd rather catch the real block size and use that. mke2fs of course wants to use a really large block size since it's probably writing a lot of data. Therefore, if we haven't written any blocks to the undo file, it's perfectly fine to allow block size changes. For mke2fs, we'll modify the IO channel option that lets us set the huge size to lock that in place. This greatly reduces index overhead for undo files for e2fsck/tune2fs/resize2fs while continuing the practice of reducing it even more for mke2fs. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:39:13 -04:00
Darrick J. Wong	c866515f02	undo-io: add new calls to and speed up the undo io manager Implement pass-through calls for discard, zero-out, and readahead in the IO manager so that we can take advantage of any underlying support. Furthermore, improve tdb write-out speed by disabling locking and only fsyncing at the end -- we don't care about locking because having multiple writers to the undo file will produce an undo database full of garbage blocks; and we only need to fsync at the end because if we fail before the end, our undo file will lack the necessary superblock data that e2undo requires to do replay safely. Without this, we call fsync four times per tdb update(!) This reduces the overhead of using undo_io while converting a 2TB FS to metadata_csum from 3+ hours to 55 minutes. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:38:34 -04:00
Theodore Ts'o	c46b57bc9d	ext2fs: fix "make check" by allowing EXT2FS_SHA256_LENGTH to be defined Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-05 10:01:40 -04:00
Theodore Ts'o	437651ad23	Update ext4 encryption format to final v4.1 version The directory hash is now calculated using the on-disk encrypted filename, and we no longer use the digest encoding or the SHA-256 encoding, so remove them from the ext2fs library until there is some reason we need them. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-05-03 17:01:59 -04:00
Darrick J. Wong	2f79cd18a9	tests: verify rebuilding of sparse extent trees & block map file conversion Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-21 16:23:09 -04:00
Darrick J. Wong	e228d700d5	e2fsck: rebuild sparse extent trees & convert non-extent ext3 files Teach e2fsck to (re)construct extent trees. This enables us to do either of the following: compress a highly sparse extent tree into fewer ETB blocks; or convert a ext3-style block mapped file to an extent file. The reconstruction is performed during pass 1E or 3A, as detailed below. For files that are already extent based, this algorithm will automatically run (pending user approval) if pass1 determines either (1) that a whole level of extent tree will fit into a higher level of the tree; (2) that the size of any level can be reduced by at least one ETB block; or (3) the extent tree is unnecessarily deep. It will not run at all if errors are found and the user declines to fix the errors. The option "-E bmap2extent" can be used to force e2fsck to convert all block map files to extent trees, and to rebuild all extent files' extent trees. After conversion, files larger than 12 blocks should be defragmented to eliminate empty holes where a block lives. The extent tree constructor is pretty dumb -- it creates a list of leaf extents (adjacent extents are collapsed), marks all indirect blocks / ETB blocks free, installs a new extent tree root in the inode, then loads the leaf extents into the tree. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-21 16:22:59 -04:00
Darrick J. Wong	a5abfe0382	e2fsck: read-ahead metadata during passes 1, 2, and 4 e2fsck pass1 is modified to use the block group data prefetch function to try to fetch the inode tables into the pagecache before it is needed. We iterate through the blockgroups until we have enough inode tables that need reading such that we can issue readahead; then we sit and wait until the last inode table block read of the last group to start fetching the next bunch. pass2 is modified to use the dirblock prefetching function to prefetch the list of directory blocks that are assembled in pass1. We use the "iterate a subset of a dblist" and avoid copying the dblist. Directory blocks are fetched incrementally as we walk through the directory block list. In previous iterations of this patch we would free the directory blocks after processing, but the performance hit to e2fsck itself wasn't worth it. Furthermore, it is anticipated that most users will then mount the FS and start using the directories, so they may as well remain in the page cache. pass4 is modified to prefetch the block and inode bitmaps in anticipation of pass 5, because pass4 is entirely CPU bound. In general, these mechanisms can decrease fsck time by 10-40%, if the host system has sufficient memory and the storage system can provide a lot of IOPs. Pretty much any storage system capable of handling multiple IOs in-flight at any time will see a fairly large performance boost. (Single-issue USB mass storage disks seem to suffer badly.) By default, the readahead buffer size will be set to the size of a block group's inode table (which is 2MiB for a regular ext4 FS). The -E readahead_kb= option can be given to specify the amount of memory to use for readahead or zero to disable it entirely; or an option can be given in e2fsck.conf. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-21 10:40:21 -04:00
Darrick J. Wong	79614b2709	libext2fs/e2fsck: provide routines to read-ahead metadata This patch adds to e2fsck the ability to pre-fetch metadata into the page cache in the hopes of speeding up fsck runs. There are two new functions -- the first allows a caller to readahead a list of blocks, and the second is a helper function that uses that first mechanism to load group data (bitmaps, inode tables). These new e2fsck routines require the addition of a dblist API to allow us to iterate a subset of a dblist. This will enable incremental directory block readahead in e2fsck pass 2. There's also a function to estimate the readahead given a FS. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-21 10:40:15 -04:00
Darrick J. Wong	76761ca221	e2fsck: turn inline data symlink into a fast symlink when possible When there's a problem accessing the EA part of an inline data symlink and we want to truncate the symlink back to 60 characters (hoping the user can re-establish the link later on, apparently) be sure to turn off the inline data flag to convert the symlink back to a regular fast symlink. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-20 21:48:02 -04:00
Darrick J. Wong	e0d5dd3602	e2fuzz: fuzz harder Once we've "fixed" the filesystem, try mounting and modifying it to see if we can break the kernel. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-20 21:47:18 -04:00
Theodore Ts'o	a6721909c2	Revert "libext2fs: encrypted symlinks are never fast" This reverts commit `ae73e88e82`. The latest kernel patches will now create fast encrypted symlinks Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-12 18:05:07 -04:00
Theodore Ts'o	fc898cb99b	Reserve superblock fields s_lpf_ino and s_encryption_level The s_lpf_ino field is intended to store the location of the lost and found directory if the root directory becomes encrypted (which is not yet supported). The s_encryption_level field is designed to allow support for future changes in the on-disk ext4 encryption format while this feature under development, without having to burn a large number of bits in the incompat feature flag. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-12 08:51:53 -04:00
Theodore Ts'o	4a05268cf8	Remove compression support The compression patches were an out-of-kernel patch set that was (a) only available for ext2, (b) something that was never could be stablized due to file system corruption, and (c) the most recent patches were for 3.1, last updated in 2011. The history of the compression patches has been a bit checkered. There is a long history here at http://e2compr.sourceforge.net which lists the perspective of the people working on it from the e2compr side. From the ext2/3/4 mainline developers' perspective, initial compression support was added to e2fsprogs in 2000 (in the Linux 2.2 era), but due to stability concerns the kernel patches were never merged into the mainline kernel. While there were some sporadic efforts to try to get the ext2 compression patches working in the 2.4 and 2.6 era, by that time mainline work had moved on to ext4, and the e2compr approach could only work with 32-bit block numbers and indirect mapped files. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-12 08:42:40 -04:00
Theodore Ts'o	8dbcedd702	Merge branch 'maint' into next	2015-04-05 20:44:39 -04:00
Theodore Ts'o	a0556bd8e1	e4crypt: add the get_policy command Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-05 20:43:24 -04:00
Theodore Ts'o	654531df2a	tune2fs: add ability to enable the encrypt feature Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-05 20:42:58 -04:00
Theodore Ts'o	f7257a93f9	Change filename encryption to use CTS mode Previously we were using a weird hybrid CBC/CTS. Switch things so we are using straight CTS; this corresponds to changes made in the latest ext4 encryption patches. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-05 20:39:57 -04:00
Theodore Ts'o	8afaf3be33	libext2fs: fix bug in ext2fs_digest_encode() The ext2fs_digest_encode() function was broken for any input which was a multiple of 3. Previously we never hit that case, so we never noticed it was busted. Also fix up the unit test so future problems like this get noticed quickly. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-04-05 20:35:50 -04:00
Theodore Ts'o	4fb758aa4b	Clean up and fix Android build files Add missing new lib/ext2fs source files that were added for encryption support. Also move configuration #define's from individual Android.mk to the android_config.h file, since we've moved away from specifying configuration #define's on the command-line upstream. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-30 14:50:55 -04:00
Theodore Ts'o	8b5c6c78d5	Update version.h Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-30 02:13:09 -04:00
Theodore Ts'o	1e734e72e1	e4crypt: change the UI to use a subcommand style Also add a new subcommand "new_session", which works much like keyctl new_session does. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-30 02:13:09 -04:00
Darrick J. Wong	ce93d0ea3d	libext2fs: zero hash in ibody extended attributes The kernel never updates the extended attribute hash value for attributes stored in the inode. However, fsck has always checked this value (if it's nonzero) and will complain if the hash doesn't match the xattr. Therefore, always zero the hash value when writing to in-ibody xattrs to avoid creating "corrupt" attribute errors downstream. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-29 00:12:53 -04:00
Darrick J. Wong	dbb328576d	e2fsck: actually fix inline_data flags problems when user says to do so fix_problem() returning 1 means to fix the fs error, so do that. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-29 00:04:46 -04:00
Darrick J. Wong	fae2467fb6	libext2fs: ext2fs_new_block2() should call alloc_block hook If ext2fs_new_block2() is called without a specific block map, we should call the alloc_block hook before checking fs->block_map. This helps us to avoid a bug in e2fsck where we need to allocate a block but instead of consulting block_found_map, we use the FS bitmaps, which (prior to pass 5) could be wrong. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-28 23:58:20 -04:00
Darrick J. Wong	3d28f54589	libext2fs: zero blocks via FALLOC_FL_ZERO_RANGE in ext2fs_zero_blocks Plumb a new call into the IO manager to support translating ext2fs_zero_blocks calls into the equivalent FALLOC_FL_ZERO_RANGE fallocate flag primitive when possible. This patch provides _only_ support for file-based images. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-28 23:08:25 -04:00
Theodore Ts'o	f096708126	e2fsck: use PROMPT_NONE for FUTURE_SB_LAST_*_FUDGED problems This allows us to print a message warning the user that there is something funny going on with their hardware clock (probably time zone issues caused by trying to be compatible with legacy OS's such as Windows), without triggering a full file system check. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-28 21:39:54 -04:00
Theodore Ts'o	41f2210131	Add support for a password salt stored in the superblock Previously, e4crypt required the user to manually specify the salt used for their passphrase. This was user unfriendly to say the least. The e4crypt program can now request the salt using an ioctl, which will automatically generate the salt if necessary, and keep it in the ext4 superblock. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-28 20:15:02 -04:00
Ildar Muslukhov	bfa4b350b1	misc: add e4crypt tool This patch adds new e4crypt tool for encryption management in the ext4 filesystem. Signed-off-by: Ildar Muslukhov <muslukhovi@gmail.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-26 09:30:03 -04:00
Theodore Ts'o	c4241cf50a	libext2fs: fix blocksize for SHA512 The blocksize of SHA512 is 128 bytes, not 512. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-26 00:17:48 -04:00
Ildar Muslukhov	bbb859496a	misc: teach mke2fs to create encrypted file systems Also enable support for encryption in e2fsprogs. Signed-off-by: Ildar Muslukhov <muslukhovi@gmail.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-08 23:29:04 -04:00
Theodore Ts'o	62ad24802c	e2fsck: handle encrypted directories which are indexed using htree Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-08 19:09:52 -04:00
Theodore Ts'o	6a5bdaf73d	libext2fs: fix up ext2fs_sha256() and ext2fs_sha512() Add const annotation to the input pointers; also run the tst_sha256 and tst_sha512 unit tests on a "make check". Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-08 18:19:05 -04:00
Theodore Ts'o	bf34b4af70	libext2fs: add ext2fs_digest_encode() Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-08 18:15:47 -04:00
Theodore Ts'o	68a1de3df3	debugfs: pretty print encrypted filenames in the ls command Added the -r (raw) option to print the actual encrypted entry. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-08 18:04:04 -04:00
Theodore Ts'o	baa14bd17f	e2fsck: fix spurious duplicate directory entries with encrypted filenames Use memcmp() instead of strncmp() since encrypted directory names can contain NUL characters. For non-encrypted directories, we've already checked for the case of NUL characters in file names, so it's safe to use memcmp() here in all cases. Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-02 11:40:18 -05:00
Theodore Ts'o	ae73e88e82	libext2fs: encrypted symlinks are never fast Teach ext2fs_inodes_has_valid_blocks2() that encrypted symlinks always use an external block (i.e., we never try to store the symlink in the i_blocks[] array if it is encrypted). Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-03-01 16:58:46 -05:00
Theodore Ts'o	321f3446f3	Add files to build on Android The Android.mk files were taken from the Android AOSP sources, and updated for the 1.43 next branch. The intention is that this will allow the repository which is currently located in external/e2fsprogs with one which is based off of the upstream e2fsprogs. Right now external/e2fsprogs was not created using "git clone", so it means that git merges don't work. After the external/e2fsprogs Android repository is replaced, with one based off the upstream repository, Android will be able to synchronize with the upstream repository by pulling and merging from upstream, and then running the script "./util/gen-android-files" to update any generated files. (This is necessary because in the Android build system, the Android.mk files are rather stylized and don't make it easy to run arbitrary shell scripts during the build phase.) Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>	2015-03-01 15:45:11 -05:00
Theodore Ts'o	c379cdc334	e2fsck: clean up assertions in dict.c The C preprocessing symbol NDEBUG is also defined (differently) by Android's build files, and this was causing compilation failures. So change assert() to dict_assert() and manually define it instead of relying on the NDEBUG and <assert.h> semantics. Also make sure the necessary debugging functions are available is DICT_NODEBUG is not defined, so that dict.c will correctly build with and without DICT_NODEBUG. Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>	2015-03-01 15:44:55 -05:00
Theodore Ts'o	52a06740ef	libext2fs: make sure dirent functions have prototypes if inline is disabled Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-23 23:00:17 -05:00
Theodore Ts'o	569ee9020d	libext2fs: add functions for sha256 and sha512 Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-23 22:38:46 -05:00
Theodore Ts'o	b59014c994	libe2p: add support for printing and parsing the encryption mode Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-23 18:05:21 -05:00
Theodore Ts'o	dbff534ec6	e2fsck: suppress bad name checks for encrypted directories Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-23 17:44:23 -05:00
Theodore Ts'o	8b39e4cf77	Add support for the read-only feature Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-23 13:04:47 -05:00
Darrick J. Wong	aeb933b3f2	e2fsck: add a 'yes to all' response in interactive mode Provide a mechanism for a user to switch fsck into '-y' mode if they start an interactive session and then get tired of pressing 'y' in response to numerous prompts. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-16 10:41:20 -05:00
Theodore Ts'o	ad5d05d645	Merge branch 'maint' into next	2015-02-16 10:17:21 -05:00
Theodore Ts'o	49d0fe2a14	libext2fs: fix potential buffer overflow in closefs() The bug fix in f66e6ce4446: "libext2fs: avoid buffer overflow if s_first_meta_bg is too big" had a typo in the fix for ext2fs_closefs(). In practice most of the security exposure was from the openfs path, since this meant if there was a carefully crafted file system, buffer overrun would be triggered when the file system was opened. However, if corrupted file system didn't trip over some corruption check, and then the file system was modified via tune2fs or debugfs, such that the superblock was marked dirty and then written out via the closefs() path, it's possible that the buffer overrun could be triggered when the file system is closed. Also clear up a signed vs unsigned warning while we're at it. Thanks to Nick Kralevich <nnk@google.com> for asking me to look at compiler warning in the code in question, which led me to notice the bug in `f66e6ce444`. Addresses: CVE-2015-1572 Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-02-11 15:06:18 -05:00
Darrick J. Wong	4a3dc1f0b6	e2fsck: salvage under-sized dirents by removing them If the directory processing code ends up pointing to a directory entry that's so close to the end of the block that there's not even space for a rec_len/name_len, just substitute dummy values that will force e2fsck to extend the previous entry to cover the remaining space. We can't use the helper methods to extract rec_len because that's reading off the end of the buffer. This isn't an issue with non-inline directories because the directory check buffer is zero-extended so that fsck won't blow up. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu>	2015-01-29 11:09:07 -05:00

... 2 3 4 5 6 ...

5335 Commits (dcb8e1fa0452a3d31b678351c2297ad0255d3e4b) All Branches Search

5335 Commits (dcb8e1fa0452a3d31b678351c2297ad0255d3e4b)

All Branches