linux

Commit Graph

Author	SHA1	Message	Date
Chris Mason	a74a4b97b6	Btrfs: Replace the transaction work queue with kthreads This creates one kthread for commits and one kthread for deleting old snapshots. All the work queues are removed. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Chris Mason	89ce8a63d0	Add btrfs_end_transaction_throttle to force writers to wait for pending commits The existing throttle mechanism was often not sufficient to prevent new writers from coming in and making a given transaction run forever. This adds an explicit wait at the end of most operations so they will allow the current transaction to close. There is no wait inside file_write, inode updates, or cow filling, all which have different deadlock possibilities. This is a temporary measure until better asynchronous commit support is added. This code leads to stalls as it waits for data=ordered writeback, and it really needs to be fixed. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Chris Mason	a213501153	Btrfs: Replace the big fs_mutex with a collection of other locks Extent alloctions are still protected by a large alloc_mutex. Objectid allocations are covered by a objectid mutex Other btree operations are protected by a lock on individual btree nodes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Chris Mason	925baeddc5	Btrfs: Start btree concurrency work. The allocation trees and the chunk trees are serialized via their own dedicated mutexes. This means allocation location is still not very fine grained. The main FS btree is protected by locks on each block in the btree. Locks are taken top / down, and as processing finishes on a given level of the tree, the lock is released after locking the lower level. The end result of a search is now a path where only the lowest level is locked. Releasing or freeing the path drops any locks held. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Sven Wegener	3b96362cc8	Btrfs: Invalidate dcache entry after creating snapshot and We need to invalidate an existing dcache entry after creating a new snapshot or subvolume, because a negative dache entry will stop us from accessing the new snapshot or subvolume. --- ctree.h \| 23 +++++++++++++++++++++++ inode.c \| 4 ++++ transaction.c \| 4 ++++ 3 files changed, 31 insertions(+) Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Chris Mason	48ec2cf873	Btrfs: Fix race in running_transaction checks When a new transaction was started, the code would incorrectly set the pointer in fs_info before all the data structures were setup. fsync heavy workloads hit races on the setup of the ordered inode spinlock Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:03 -04:00
Chris Mason	a061fc8da7	Btrfs: Add support for online device removal This required a few structural changes to the code that manages bdev pointers: The VFS super block now gets an anon-bdev instead of a pointer to the lowest bdev. This allows us to avoid swapping the super block bdev pointer around at run time. The code to read in the super block no longer goes through the extent buffer interface. Things got ugly keeping the mapping constant. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:02 -04:00
Chris Mason	d6bfde8765	Btrfs: Fixes for 2.6.18 enterprise kernels 2.6.18 seems to get caught in an infinite loop when cancel_rearming_delayed_workqueue is called more than once, so this switches to cancel_delayed_work, which is arguably more correct. Also, balance_dirty_pages can run into problems with 2.6.18 based kernels because it doesn't have the per-bdi dirty limits. This avoids calling balance_dirty_pages on the btree inode unless there is actually something to balance, which is a good optimization in general. Finally there's a compile fix for ordered-data.h Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:02 -04:00
Chris Mason	81d7ed29ff	Btrfs: Throttle file_write when data=ordered is flushing the inode Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:02 -04:00
Chris Mason	ce9adaa5a7	Btrfs: Do metadata checksums for reads via a workqueue Before, metadata checksumming was done by the callers of read_tree_block, which would set EXTENT_CSUM bits in the extent tree to show that a given range of pages was already checksummed and didn't need to be verified again. But, those bits could go away via try_to_releasepage, and the end result was bogus checksum failures on pages that never left the cache. The new code validates checksums when the page is read. It is a little tricky because metadata blocks can span pages and a single read may end up going via multiple bios. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:01 -04:00
Chris Mason	0b86a832a1	Btrfs: Add support for multiple devices per filesystem Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:00 -04:00
Chris Mason	80b6794d11	Btrfs: Lower stack usage in transaction.c Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:00 -04:00
Chris Mason	4529ba495c	Btrfs: Add data block hints to SSD mode too Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:04:00 -04:00
Chris Mason	d1310b2e0c	Btrfs: Split the extent_map code into two parts There is now extent_map for mapping offsets in the file to disk and extent_io for state tracking, IO submission and extent_bufers. The new extent_map code shifts from [start,end] pairs to [start,len], and pushes the locking out into the caller. This allows a few performance optimizations and is easier to use. A number of extent_map usage bugs were fixed, mostly with failing to remove extent_map entries when changing the file. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	e18e4809b1	Btrfs: Add mount -o ssd, which includes optimizations for seek free storage Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	4d5e74bc0a	Btrfs: Fix data=ordered vs wait_on_inode deadlock on older kernels Using ilookup5 during data=ordered writeback could deadlock on I_LOCK. This saves a pointer to the inode instead. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	2da98f003f	Btrfs: Run igrab on data=ordered inodes to prevent deadlocks during writeout Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	cee36a03e8	Rework btrfs_drop_inode to avoid scheduling Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	e2008b6140	Btrfs: Add some simple throttling to wait for data=ordered and snapshot deletion Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	3063d29f2a	Btrfs: Move snapshot creation to commit time It is very difficult to create a consistent snapshot of the btree when other writers may update the btree before the commit is done. This changes the snapshot creation to happen during the commit, while no other updates are possible. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	dc17ff8f11	Btrfs: Add data=ordered support This forces file data extents down the disk along with the metadata that references them. The current implementation is fairly simple, and just writes out all of the dirty pages in an inode before the commit. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:59 -04:00
Chris Mason	4313b3994d	Btrfs: Reduce stack usage in the resizer, fix 32 bit compiles Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:58 -04:00
Chris Mason	6da6abae02	Btrfs: Back port to 2.6.18-el kernels Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:58 -04:00
Christian Hesse	17636e03f4	Btrfs: section mismatch warnings --Boundary-00=_CcOWHFYK4T+JwSj Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Hello everybody, compiling btrfs into the kernel results in section mismatch warnings. __exit functions are called where they are not allowed to. The attached patch fixes this for me. Not sure if it is correct though. Signed-off-by: Christian Hesse <mail@earthworm.de> -- Regards, Chris --Boundary-00=_CcOWHFYK4T+JwSj Content-Type: text/x-diff; charset="iso-8859-1"; name="btrfs-section_mismatches.patch" Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename="btrfs-section_mismatches.patch" Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:58 -04:00
Chris Mason	35ebb934bd	Btrfs: Fix PAGE_CACHE_SHIFT shifts on 32 bit machines Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:57 -04:00
Chris Mason	a6b6e75e09	Btrfs: Defrag only leaves, and only when the parent node has a single objectid This allows us to defrag huge directories, but skip the expensive defrag case in more common usage, where it does not help as much. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:57 -04:00
Chris Mason	4dc119046d	Btrfs: Add an extent buffer LRU to reduce radix tree hits Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	6b80053d02	Btrfs: Add back the online defragging code Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	db94535db7	Btrfs: Allow tree blocks larger than the page size Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	1a5bc167f6	Btrfs: Change the remaining radix trees used by extent-tree.c to extent_map trees Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	f510cfecfc	Btrfs: Fix extent_buffer and extent_state leaks Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	5f39d397df	Btrfs: Create extent_buffer interface for large blocksizes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:03:56 -04:00
Chris Mason	d3c2fdcf7b	Btrfs: Use balance_dirty_pages_nr on btree blocks btrfs_btree_balance_dirty is changed to pass the number of pages dirtied for more accurate dirty throttling. This lets the VM make better decisions about when to force some writeback. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2008-09-25 11:00:48 -04:00
Chris Mason	5ce14bbcdd	Btrfs: Find and remove dead roots the first time a root is loaded. Dead roots are trees left over after a crash, and they were either in the process of being removed or were waiting to be removed when the box crashed. Before, a search of the entire tree of root pointers was done on mount looking for dead roots. Now, the search is done the first time we load a root. This makes mount faster when there are a large number of snapshots, and it enables the block accounting code to properly update the block counts on the latest root as old versions of the root are reaped after a crash. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-09-11 11:15:39 -04:00
Josef Bacik	58176a9604	Btrfs: Add per-root block accounting and sysfs entries Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-29 15:47:34 -04:00
Josef Bacik	15ee9bc7ed	Btrfs: delay commits during fsync to allow more writers Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-10 16:22:09 -04:00
Chris Mason	e9d0b13b5b	Btrfs: Btree defrag on the extent-mapping tree as well Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-10 14:06:19 -04:00
Chris Mason	409eb95d7f	Btrfs: Further reduce the concurrency penalty of defrag and drop_snapshot Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-08 20:17:12 -04:00
Chris Mason	26b8003f10	Btrfs: Replace extent tree preallocation code with some bit radix magic. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-08 20:17:12 -04:00
Chris Mason	f4468e94c8	Btrfs: Let some locks go during defrag and snapshot dropping Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-08 10:08:58 -04:00
Chris Mason	6702ed490c	Btrfs: Add run time btree defrag, and an ioctl to force btree defrag This adds two types of btree defrag, a run time form that tries to defrag recently allocated blocks in the btree when they are still in ram, and an ioctl that forces defrag of all btree blocks. File data blocks are not defragged yet, but this can make a huge difference in sequential btree reads. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-07 16:15:09 -04:00
Chris Mason	9f3a742736	Btrfs: Do snapshot deletion in smaller chunks. Before, snapshot deletion was a single atomic unit. This caused considerable lock contention and required an unbounded amount of space. Now, the drop_progress field in the root item is used to indicate how far along snapshot deletion is, and to resume where it left off. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-08-07 15:52:19 -04:00
Zach Brown	ec6b910fb3	Btrfs: trivial include fixups Almost none of the files including module.h need to do so, remove them. Include sched.h in extent-tree.c to silence a warning about cond_resched() being undeclared. Signed-off-by: Zach Brown <zach.brown@oracle.com> Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-07-11 10:00:37 -04:00
Chris Mason	ccd467d60e	Btrfs: crash recovery fixes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-28 15:57:36 -04:00
Chris Mason	4b52dff6d3	Btrfs: Fix super block updates during transaction commit The super block written during commit was not consistent with the state of the trees. This change adds an in-memory copy of the super so that we can make sure to write out consistent data during a commit. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-26 10:06:50 -04:00
Chris Mason	22bb92f376	Btrfs: Documentation update Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-22 14:49:31 -04:00
Chris Mason	5eda7b5e9b	Btrfs: Add the ability to find and remove dead roots after a crash. Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-22 14:16:25 -04:00
Chris Mason	54aa1f4dfd	Btrfs: Audit callers and return codes to make sure -ENOSPC gets up the stack Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-22 14:16:25 -04:00
Chris Mason	8c2383c3dd	Subject: Rework btrfs_file_write to only allocate while page locks are held Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-18 09:57:58 -04:00
Chris Mason	340887809d	Btrfs: i386 fixes from axboe Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-12 11:36:58 -04:00
Chris Mason	6cbd557078	Btrfs: add GPLv2 Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-12 09:07:21 -04:00
Chris Mason	0cf6c62017	Btrfs: remove device tree Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-09 09:22:25 -04:00
Chris Mason	ad693af684	Btrfs: reap dead roots right after commit Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-09 08:19:57 -04:00
Chris Mason	facda1e787	Btrfs: get forced transaction commits via workqueue Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-08 18:11:48 -04:00
Chris Mason	08607c1b18	Btrfs: add compat ioctl Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-06-08 15:33:54 -04:00
Chris Mason	e37c9e6921	Btrfs: many allocator fixes, pretty solid Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-05-09 20:13:14 -04:00
Chris Mason	35b7e47610	Btrfs: fix page cache memory leak Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-05-02 15:53:43 -04:00
Chris Mason	31f3c99b73	Btrfs: allocator improvements, inode block groups Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-30 15:25:45 -04:00
Chris Mason	7c4452b9a6	Btrfs: smarter transaction writeback Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-28 09:29:35 -04:00
Chris Mason	9078a3e1e4	Btrfs: start of block group code Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-26 16:46:15 -04:00
Chris Mason	8fd17795b2	Btrfs: early fsync support Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-19 21:01:03 -04:00
Chris Mason	8352d8a473	Btrfs: add disk ioctl, mostly working Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-12 10:43:05 -04:00
Chris Mason	7eccb903a8	Btrfs: create a logical->phsyical block number mapping scheme Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-11 15:53:25 -04:00
Chris Mason	2619ba1f0f	Btrfs: subvolumes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-10 16:58:11 -04:00
Chris Mason	d0dbc6245c	Btrfs: drop owner and parentid Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-10 12:36:36 -04:00
Chris Mason	1b05da2ee6	Btrfs: drop the inode map tree Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-10 12:13:09 -04:00
Chris Mason	0f7d52f443	Btrfs: groundwork for subvolume and snapshot roots Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-09 10:42:37 -04:00
Chris Mason	d6e4a428eb	Btrfs: start of support for many FS volumes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-06 15:37:36 -04:00
Chris Mason	7cfcc17e7f	Btrfs: corruptions fixed Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-02 14:53:59 -04:00
Chris Mason	2c90e5d658	Btrfs: still corruption hunting Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-04-02 10:50:19 -04:00
Chris Mason	d602557953	Btrfs: corruption hunt continues Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-30 14:27:56 -04:00
Chris Mason	22b0ebda6c	Btrfs: hunting slab corruption Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-30 08:47:31 -04:00
Chris Mason	d98237b3ed	Btrfs: use a btree inode instead of sb_getblk Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-28 13:57:48 -04:00
Chris Mason	78fae27ebf	Btrfs: leak fixes, pinning fixes Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-25 11:35:08 -04:00
Chris Mason	d571976292	btrfs_create, btrfs_write_super, btrfs_sync_fs Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-23 10:01:08 -04:00
Chris Mason	79154b1b5b	Btrfs: transaction rework Signed-off-by: Chris Mason <chris.mason@oracle.com>	2007-03-22 15:59:16 -04:00

... 2 3 4 5 6

276 Commits