linux

Commit Graph

Author	SHA1	Message	Date
Sage Weil	dbad185d49	ceph: drop src address(es) from message header [new protocol feature] The CEPH_FEATURE_NOSRCADDR protocol feature avoids putting the full source address in each message header (twice). This patch switches the client to the new scheme, and _requires_ this feature on the server. The server will support both the old and new schemes. That means an old client will work with a new server, but a new client will not work with an old server. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:32 -07:00
Dan Carpenter	a5ee751c15	ceph: cleanup: remove unused assignement We don't ever use "dirty" so we can remove it. Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:32 -07:00
Sage Weil	0f8605f2bd	ceph: clean up cap release loop vs spinlock Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:31 -07:00
Sage Weil	31e0cf8f6a	ceph: name bdi ceph-%d instead of major:minor The bdi_setup_and_register() helper doesn't help us since we bdi_init() in create_client() and bdi_register() only when sget() succeeds. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:30 -07:00
Sage Weil	56b7cf9581	ceph: skip mds sync on forced unmount Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:30 -07:00
Sage Weil	b736b3d9d0	ceph: adjust masked struct_v variable names Reported-by: Bill Pemberton <wfp5p@virginia.edu> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:29 -07:00
Sage Weil	6e19a16ef2	ceph: clean up mount options, ->show_options() Ensure all options are included in /proc/mounts. Some cleanup. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:29 -07:00
Sage Weil	1cd3935bed	ceph: set dn offset when spliced We want to assign an offset when the dentry goes from null to linked, which is always done by splice_dentry(). Notably, we should NOT assign an offset when a dentry is first created and is still null. BUG if we try to splice a non-null dentry (we shouldn't). Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:28 -07:00
Sage Weil	1b7facc41b	ceph: don't clobber i_max_offset on already complete dir This can screw up offsets assigned to new dentries and break dcache readdir results. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:27 -07:00
Sage Weil	e8a7498715	ceph: skip set_dentry_offset work if directory not I_COMPLETE Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:27 -07:00
Sage Weil	f1f2765fae	ceph: set next_offset on readdir finish Set next_offset to 2 (always 2!), not 0, on readdir finish. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:26 -07:00
Henry C Chang	bddfa3cc18	ceph: listxattr should compare version by >= If the version hasn't changed, don't rebuild the index. Signed-off-by: Henry C Chang <henry_c_chang@tcloudcomputing.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:26 -07:00
Sage Weil	a6424e48c8	ceph: fix xattr dangling pointer / double free If we use the xattr_blob, clear the pointer so we don't release the memory at the bottom of the fuction. Reported-by: Henry C Chang <henry_c_chang@tcloudcomputing.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:25 -07:00
Sage Weil	9dd4658db1	ceph: close messenger race Simplify messenger locking, and close race between ceph_con_close() setting the CLOSED bit and con_work() checking the bit, then taking the mutex. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:25 -07:00
Sage Weil	4f48280ee1	ceph: name msgpools; useful error messages Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:24 -07:00
Sage Weil	8c6efb58a5	ceph: fix memory leak due to possible dentry init race Free dentry_info in error path. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:23 -07:00
Sage Weil	559c1e0073	ceph: include auth method in error messages Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:23 -07:00
Sage Weil	f26e681d52	ceph: osdtimeout=0 for now timeout Allow the osd reset timeout to be disabled. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:22 -07:00
Dan Carpenter	0d509c949a	ceph: d_obtain_alias() returns ERR_PTR() d_obtain_alias() doesn't return NULL, it returns an ERR_PTR(). Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:22 -07:00
Yehuda Sadeh	c473ad927e	ceph: wake up mount thread when getting osdmap Now that the mount thread waits for the osdmap, it needs to be awaken. Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>	2010-05-17 15:25:21 -07:00
Huang Weiyi	1bb71637d0	ceph: remove unused #includes Remove unused #include's in fs/ceph/super.c Signed-off-by: Huang Weiyi <weiyi.huang@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:21 -07:00
Sage Weil	6822d00b54	ceph: wait for both monmap and osdmap when opening session Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>	2010-05-17 15:25:20 -07:00
Sage Weil	6f2bc3ff4c	ceph: clean up connection reset Reset out_keepalive_pending and peer_global_seq, and drop unused var. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:20 -07:00
Sage Weil	bb257664f7	ceph: simplify ceph_msg_new We only need to pass in front_len. Callers can attach any other payload pieces (middle, data) as they see fit. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:19 -07:00
Sage Weil	a79832f26b	ceph: make ceph_msg_new return NULL on failure; clean up, fix callers Returning ERR_PTR(-ENOMEM) is useless extra work. Return NULL on failure instead, and fix up the callers (about half of which were wrong anyway). Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:18 -07:00
Sage Weil	d52f847a84	ceph: rewrite msgpool using mempool_t Since we don't need to maintain large pools of messages, we can just use the standard mempool_t. We maintain a msgpool 'wrapper' because we need the mempool_t* in the alloc function, and mempool gives us only pool_data. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:18 -07:00
Cheng Renquan	640ef79d27	ceph: use ceph_sb_to_client instead of ceph_client ceph_sb_to_client and ceph_client are really identical, we need to dump one; while function ceph_client is confusing with "struct ceph_client", ceph_sb_to_client's definition is more clear; so we'd better switch all call to ceph_sb_to_client. -static inline struct ceph_client ceph_client(struct super_block sb) -{ - return sb->s_fs_info; -} Signed-off-by: Cheng Renquan <crquan@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:17 -07:00
Cheng Renquan	2d06eeb877	ceph: handle kzalloc() failure Signed-off-by: Cheng Renquan <crquan@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:16 -07:00
Sage Weil	7c315c552c	ceph: drop unnecessary msgpool for mon_client subscribe_ack Preallocate a single message to reuse instead. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:16 -07:00
Sage Weil	6694d6b95c	ceph: drop unnecessary msgpool for mon_client auth_reply Preallocate a single reply message that we can reuse instead. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:15 -07:00
Sage Weil	3143edd3a1	ceph: clean up statfs Avoid unnecessary msgpool. Preallocate reply. Fix use-after-free race. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:15 -07:00
Sage Weil	6f46cb2935	ceph: fix theoretically possible double-put on connection This would only trigger if we bailed out before resetting r_con_filling_msg because the server reply was corrupt (oversized). Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:14 -07:00
Dan Carpenter	c7708075f1	ceph: cleanup: remove dead code "xattr" is never NULL here. We took care of that in the previous if statement block. Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:14 -07:00
Sage Weil	104648ad3f	ceph: reduce build_path debug output Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:13 -07:00
Yehuda Sadeh	31459fe4b2	ceph: use __page_cache_alloc and add_to_page_cache_lru Following Nick Piggin patches in btrfs, pagecache pages should be allocated with __page_cache_alloc, so they obey pagecache memory policies. Also, using add_to_page_cache_lru instead of using a private pagevec where applicable. Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:12 -07:00
Stephen Rothwell	f553069e5d	ceph: update for removal of kref_set Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au> Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:12 -07:00
Sage Weil	21b667f69b	ceph: simplify page setup for incoming data Drop largely useless helper __prepare_pages(), and simplify sanity checks. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 15:25:11 -07:00
Sage Weil	81a6cf2d30	ceph: invalidate affected dentry leases on aborted requests If we abort a request, we return to caller, but the request may still complete. And if we hold the dir FILE_EXCL bit, we may not release a lease when sending a request. A simple un-tar, control-c, un-tar again will reproduce the bug (manifested as a 'Cannot open: File exists'). Ensure we invalidate affected dentry leases (as well dir I_COMPLETE) so we don't have valid (but incorrect) leases. Do the same, consistently, at other sites where I_COMPLETE is similarly cleared. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 10:25:45 -07:00
Sage Weil	b4556396fa	ceph: fix race between aborted requests and fill_trace When we abort requests we need to prevent fill_trace et al from doing anything that relies on locks held by the VFS caller. This fixes a race between the reply handler and the abort code, ensuring that continue holding the dir mutex until the reply handler completes. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 10:25:45 -07:00
Sage Weil	e1518c7c0a	ceph: clean up mds reply, error handling We would occasionally BUG out in the reply handler because r_reply was nonzero, due to a race with ceph_mdsc_do_request temporarily setting r_reply to an ERR_PTR value. This is unnecessary, messy, and also wrong in the EIO case. Clean up by consistently using r_err for errors and r_reply for messages. Also fix the abort logic to trigger consistently for all errors that return to the caller early (e.g., EIO from timeout case). If an abort races with a reply, use the result from the reply. Also fix locking for r_err, r_reply update in the reply handler. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-17 10:25:44 -07:00
Sage Weil	e84346b726	ceph: preserve seq # on requeued messages after transient transport errors If the tcp connection drops and we reconnect to reestablish a stateful session (with the mds), we need to resend previously sent (and possibly received) messages with the _same_ seq # so that they can be dropped on the other end if needed. Only assign a new seq once after the message is queued. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 21:20:38 -07:00
Sage Weil	f818a73674	ceph: fix cap removal races The iterate_session_caps helper traverses the session caps list and tries to grab an inode reference. However, the __ceph_remove_cap was clearing the inode backpointer _before_ removing itself from the session list, causing a null pointer dereference. Clear cap->ci under protection of s_cap_lock to avoid the race, and to tightly couple the list and backpointer state. Use a local flag to indicate whether we are releasing the cap, as cap->session may be modified by a racing thread in iterate_session_caps. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 20:56:31 -07:00
Sage Weil	45c6ceb547	ceph: zero unused message header, footer fields We shouldn't leak any prior memory contents to other parties. And random data, particularly in the 'version' field, can cause problems down the line. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 15:17:40 -07:00
Sage Weil	9abf82b8bc	ceph: fix locking for waking session requests after reconnect The session->s_waiting list is protected by mdsc->mutex, not s_mutex. This was causing (rare) s_waiting list corruption. Fix errors paths too, while we're here. A more thorough cleanup of this function is coming soon. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 09:53:57 -07:00
Sage Weil	d85b705663	ceph: resubmit requests on pg mapping change (not just primary change) OSD requests need to be resubmitted on any pg mapping change, not just when the pg primary changes. Resending only when the primary changes results in occasional 'hung' requests during osd cluster recovery or rebalancing. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 09:53:56 -07:00
Sage Weil	04d000eb35	ceph: fix open file counting on snapped inodes when mds returns no caps It's possible the MDS will not issue caps on a snapped inode, in which case an open request may not __ceph_get_fmode(), botching the open file counting. (This is actually a server bug, but the client shouldn't BUG out in this case.) Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 09:53:55 -07:00
Sage Weil	0ceed5db32	ceph: unregister osd request on failure The osd request wasn't being unregistered when the osd returned a failure code, even though the result was returned to the caller. This would cause it to eventually time out, and then crash the kernel when it tried to resend the request using a stale page vector. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-11 09:53:18 -07:00
Sage Weil	54ad023ba8	ceph: don't use writeback_control in writepages completion The ->writepages writeback_control is not still valid in the writepages completion. We were touching it solely to adjust pages_skipped when there was a writeback error (EIO, ENOSPC, EPERM due to bad osd credentials), causing an oops in the writeback code shortly thereafter. Updating pages_skipped on error isn't correct anyway, so let's just rip out this (clearly broken) code to pass the wbc to the completion. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-05 21:31:40 -07:00
Sage Weil	5dfc589a84	ceph: unregister bdi before kill_anon_super releases device name Unregister and destroy the bdi in put_super, after mount is r/o, but before put_anon_super releases the device name. For symmetry, bdi_destroy in destroy_client (we bdi_init in create_client). Only set s_bdi if bdi_register succeeds, since we use it to decide whether to bdi_unregister. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-04 16:14:46 -07:00
Sage Weil	b0930f8d38	ceph: remove bad auth_x kmem_cache It's useless, since our allocations are already a power of 2. And it was allocated per-instance (not globally), which caused a name collision when we tried to mount a second file system with auth_x enabled. Signed-off-by: Sage Weil <sage@newdream.net>	2010-05-03 10:49:25 -07:00

1 2 3 4 5 ...

300 Commits