drtracing.org Git - deliverable/linux.git/commit

author	Lars Ellenberg <lars.ellenberg@linbit.com>
	Mon, 28 Apr 2014 16:43:18 +0000 (18:43 +0200)
committer	Jens Axboe <axboe@fb.com>
	Wed, 30 Apr 2014 19:46:54 +0000 (13:46 -0600)
commit	0e49d7b014c5d591a053d08888a455bd74a88646
tree	addb770d9de32c447e12d0dd51eb383fe40fdc90	tree \| snapshot
parent	6377b9235056452cd5d592c3739baa379a8735fe	commit \| diff

drbd: fix potential distributed deadlock during verify or resync

If max-buffers and socket buffer sizes are "too small" for the chosen
resync rate, this could lead potentially lead to a distributed deadlock,
which may or may not resolve itself via the "ko-count" and request
timeout mechanism, or could be resolved by forced disconnect.

One option to deal with this is proper configuration:
use larger max-buffer and socket buffers settings,
or reduce the resync rate.

But even with bad configuration we should not deadlock,
but "gracefully" recover.

The issue is avoided by using only up to max-buffers/2 for resync
requests, and by using max-buffers not as a hard limit for data buffer
allocations, but as a throttle threshold only.

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: Jens Axboe <axboe@fb.com>

drivers/block/drbd/drbd_receiver.c		diff \| blob \| blame \| history
drivers/block/drbd/drbd_worker.c		diff \| blob \| blame \| history