vitastor

antilles

vitastor

Author	SHA1	Message	Date
Vitaliy Filippov	1c10430ae1	Release 0.9.4 - Improve QEMU driver performance by integrating io_uring in it (up to 1.5x total iops improvement) - Fix QEMU driver deadlocks which started to reproduce in qemu-img after iothread fixes - Fix `vitastor-cli status` reporting more etcds than actually exists (fix etcd address duplication in config on reload) - Fix `vitastor-cli ls` crashing on inodes in non-existing pools - Delete old garbage /pool/stats/ keys for non-existing (deleted) pools - Reduce memory usage of etcds initialized by make-etcd script - Fix OSDs almost always crashing on etcd restart due to "revisions were compacted" (support reloading state from etcd) - Fix a crash and a stall possible mostly in HDD setups with small journal and big (512k, 900k) random writes - Add notes about HDDs to documentation. You are officially allowed to use HDD-only Vitastor with HGST/Toshiba/EXOS :)	2023-07-19 02:50:30 +03:00
Vitaliy Filippov	d0e257ee81	Fix non-existing pool handling in `vitastor-cli ls`	2023-07-18 23:52:02 +03:00
Vitaliy Filippov	9815d70ffc	It is impossible to use io_uring with older vitastor-client because it does not have vitastor_c_uring_has_work()	2023-07-18 23:37:53 +03:00
Vitaliy Filippov	4a4627dcab	Do not use bool in C library	2023-07-18 23:37:53 +03:00
Vitaliy Filippov	ba7427020e	Fix deadlocks possible in qemu-img after fixing iothread Deadlock was caused by switching QEMU coroutines directly inside vitastor_co_read_bitmap_cb() callback. The correct way is to schedule a BH /BH is a QEMU term for setImmediate() :)/, same as in read and write callbacks.	2023-07-18 23:32:16 +03:00
Vitaliy Filippov	ac7b834af3	Disable journal_no_same_sector_overwrites by default for HDD-only	2023-07-10 00:34:35 +03:00
Vitaliy Filippov	57ad4c3636	Add a note about HDD, enable throttling only for hybrid OSDs	2023-07-09 12:45:11 +03:00
Vitaliy Filippov	b7e4d0c9bf	Fix journal dirty_start position tracking and some debug prints Fixes two bugs found during HDD testing :-) 1) OSD crashed with "BUG: Attempt to overwrite used offset of the journal" during `fio -bs=900k -iodepth=128` test with 16 MB journal 2) OSD stalled during `fio -bs=512k -iodepth=128` test with 64 MB journal	2023-07-09 01:17:55 +03:00
Vitaliy Filippov	161a23c966	Support reloading state when etcd says "revisions were compacted" Before this change, OSDs almost always died when one of the etcds was restarted, even though the rest of them was still in quorum and the lease was still active	2023-07-07 01:33:48 +03:00
Vitaliy Filippov	45c0694853	Clear etcd_local addresses on reload and also skip duplicates	2023-07-06 00:39:39 +03:00
Vitaliy Filippov	30ac899074	Make QEMU driver compatible with older vitastor_client and with systems without io_uring	2023-07-04 15:51:43 +03:00
Vitaliy Filippov	2348d39cf4	Avoid repeated qemu_uring_handlers, add 2.0-2.7 compatibility	2023-07-04 00:28:23 +03:00
Vitaliy Filippov	3de7929fe5	Integrate v2 - direct epoll	2023-07-04 00:28:23 +03:00
Vitaliy Filippov	07b2196bc2	Integrate QEMU driver with io_uring	2023-07-04 00:28:23 +03:00
Vitaliy Filippov	a612cdca47	Release 0.9.3 - Add patch for libvirt 9.0 - Add support for Proxmox VE 8.0 - Fix compatibility of the QEMU driver with iothread (QEMU rebuilds are coming) - Fix vitastor-cli rm-data/rm/merge hanging when some OSDs are down. Allow deletions in unclean cluster at the cost of some data possibly "reappearing" when those OSDs start back. In that case you can just repeat the deletion request using rm-data. - A bunch of bug fixes for snapshots: - Fix snapshot reads often not working at all with snapshot chain size > 2 - Fix optimized snapshot data merge (children to parent) - Fix updating of image name index key during optimized merge - Fix auto-selection preventing the use of optimized merge with only 1 snapshot - Fix incorrect CAS retries during snapshot merge - Fix snapshot merge progress reporting - Fix primary_read bitmap buffers use-after-free which could lead to incorrect allocation map reads - Remove /usr/local/bin path from make-etcd - Some documentation fixes	2023-07-01 00:25:58 +03:00
Vitaliy Filippov	c8d61568b5	Fix primary_read bitmap buffers being freed too early (use-after-free)	2023-06-30 12:47:45 +03:00
Vitaliy Filippov	84ed3c6395	Fix CAS retries during snapshot merge	2023-06-30 02:30:23 +03:00
Vitaliy Filippov	a7b57386c0	Do not print last subcommand result twice during "inverse" snapshot merge	2023-06-30 02:07:10 +03:00
Vitaliy Filippov	9d4ea5f764	Fix inverse parent selection which prevented the use of optimized merge in case of only 1 snapshot	2023-06-30 01:39:11 +03:00
Vitaliy Filippov	000e4944ec	Remove "inverse parent" image name index key from etcd during snapshot merge	2023-06-30 01:23:30 +03:00
Vitaliy Filippov	8426616d89	Warn about unfinished deletions in rm-data	2023-06-30 01:18:25 +03:00
Vitaliy Filippov	1a841344ec	Print progress of all operations during snapshot merge	2023-06-30 01:13:47 +03:00
Vitaliy Filippov	8603b5cb1d	Do not hang on inactive OSDs during delete, report and skip them instead	2023-06-30 00:15:16 +03:00
Vitaliy Filippov	878ccbb6ea	Fix snapshot chain "down-merge" ("up-merge" worked well...)	2023-06-29 00:47:21 +03:00
Vitaliy Filippov	63c2b9832c	Fix chained (snapshot) reads often not working at all with chain size > 2	2023-06-28 18:54:03 +03:00
Vitaliy Filippov	a11ca56fb1	Fix compatibility of the QEMU driver with iothread	2023-06-21 02:11:28 +03:00
Vitaliy Filippov	b84927b340	Fix \n in nbd_proxy	2023-06-19 01:48:58 +03:00
Vitaliy Filippov	926be372fd	Release 0.9.2 - Measure and report scrub I/O statistics in vitastor-cli status - Make aggregated statistics in vitastor-cli status much smoother (first derive, then sum instead of first summing and then deriving) - Fix an old rare bug leading to journal corruption (try to use scrub if you think you're affected...) - Do not start EC PGs without at least <data chunks> OSDs in each old set (prevents spurious read errors with EC during reconnections/restarts) - Fix failed assert(!scrub_list_op) on OSD restart with pending scrubs - Fix future planned scrubs not starting because of incorrect time comparison - Build packages for Debian 12 (Bookworm)	2023-06-18 19:44:33 +03:00
Vitaliy Filippov	c74a424930	Report scrub I/O in vitastor-cli status	2023-06-17 21:11:21 +03:00
Vitaliy Filippov	32f2c4dd27	Measure scrub statistics	2023-06-17 20:56:26 +03:00
Vitaliy Filippov	3ad16b9a1a	Fix auto_scrubs not starting because of < vs <= =))	2023-06-17 17:32:21 +03:00
Vitaliy Filippov	1c2df841c2	Fix failed assert(!scrub_list_op) on OSD restart with pending scrubs	2023-06-17 17:02:54 +03:00
Vitaliy Filippov	aa5dacc7a9	Do not start EC PGs without at least pg_data_size connections to old OSDs from each set	2023-06-17 02:16:30 +03:00
Vitaliy Filippov	4fdc49bdc7	Add another assert-type check (it does not fire, just as a safety measure for the future)	2023-06-17 00:07:22 +03:00
Vitaliy Filippov	86b4682975	Put get_trim_pos into the "critical section". Fixes rare journal corruption issue The consequence of this issue was that in some very rare cases (only reproduced under load in CI when running 4+ tests in parallel) small write data written to journal could overwrite journal entries. Also add an assert-type safety check to be able to catch this issue in the future again in case of a regression.	2023-06-17 00:06:42 +03:00
Vitaliy Filippov	bdd48e4cf1	Release 0.9.1 - Fix "Client XX command out of sync" messages sometimes happening on OSD reconnections - Fix a bug where EC reads parallel with writes to the same object failed with -ERANGE error - Slightly reduce the amount of metadata writes during journal flushing - Correctly unmap NBD volumes when Proxmox forces map_volume use (with SWTPM and maybe some other cases)	2023-06-10 11:42:49 +03:00
Vitaliy Filippov	f9fbea25a4	Remove double write when old and new locations are in the same metadata block Also add another metadata entry fool-safety check which, ideally, will never fire %)	2023-06-03 00:47:10 +03:00
Vitaliy Filippov	2c9a10d081	Fix an idiotic bug leading to failed reads with -ERANGE with EC :D	2023-06-03 00:44:52 +03:00
Vitaliy Filippov	150968070f	Slightly improve some debug prints	2023-05-29 01:04:16 +03:00
Vitaliy Filippov	cdfc74665b	Close client FDs only when destroying the client, after handling all async reads/writes Fixes "Client XX command out of sync" sometimes happening on reconnections	2023-05-25 00:52:43 +03:00
Vitaliy Filippov	3b4cf29e65	Release 0.9.0 New features: - Scrubbing! Check documentation: [auto_scrub](src/branch/master/docs/config/osd.en.md#auto_scrub) - Document online-updatable configuration parameters Bug fixes: - Fix NaN during PG optimisation if there are nonexisting OSDs in node_placement - Fix monitor crash on pool deletion - Clear journal_device and meta_device before initialising the next OSD in automatic mode - Sync unsynced deletes before overwriting them with a lower version (reproducted mostly/only after scrubbing)	2023-05-21 15:07:14 +03:00
Vitaliy Filippov	ce02f47de6	Allow to disable scrub_find_best	2023-05-21 12:33:38 +03:00
Vitaliy Filippov	f1961157f0	Fix brute-force error locator for EC n+k with k > 2	2023-05-21 00:57:14 +03:00
Vitaliy Filippov	88c1ba0790	Fix compile errors with gcc 10	2023-05-20 23:20:09 +03:00
Vitaliy Filippov	fa90b5a4e7	Schedule automatic scrubs correctly (not just after previous scrub)	2023-05-20 23:20:09 +03:00
Vitaliy Filippov	8d40ad99a6	Add scrub documentation	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	6ca20aa194	Allow scrub to fix corrupted object states	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	4bfd994341	Sync unsynced deletes before overwriting them with a lower version	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	59e959dcbb	Do not die when "different versions are returned from subops"	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	a9581f0739	Handle dirty deletes during read correctly O_o	2023-05-20 23:19:39 +03:00

1 2 3 4 5 ...

576 Commits (1c10430ae1bd98eec56843639d2040797d81bcf1)