vitastor

Commit Graph

Author	SHA1	Message	Date
Vitaliy Filippov	59e959dcbb	Do not die when "different versions are returned from subops"	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	a9581f0739	Handle dirty deletes during read correctly O_o	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	105a405b0a	Implement vitastor-cli fix	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	0e5d0e02a9	Add "vitastor-cli describe" command	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	0439981a66	Implement "describe object(s)" operation Required to implement fixing inconsistent objects in vitastor-cli	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	6648f6bb6e	Implement ambiguity detection during scrub	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	281be547eb	Implement brute-force error locator for EC	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	0c78dd7178	Add no_scrub flag	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	3c924397e7	Store next scrub timestamp instead of last scrub timestamp	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	c3bd26193d	Implement PG scrub runner	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	43b77d7619	Implement scrubbing "data path" - OSD_OP_SCRUB	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	a6d846863b	Add min/max stripe and limit to OP_LIST	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	8dc427b43c	Retry failed reads (including chained and RMW) from other replicas	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	bf2112653b	Refcount object_states	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	0538a484b3	Add corrupted object state	2023-05-20 23:19:39 +03:00
Vitaliy Filippov	97720fa6b4	Remove unused capture	2023-05-20 22:58:51 +03:00
Vitaliy Filippov	e60e352df6	Improve vitastor-nbd documentation	2023-05-20 22:58:51 +03:00
Vitaliy Filippov	629999f789	Clear journal_device and meta_device before initialising the next OSD in automatic mode	2023-05-15 23:58:55 +03:00
Vitaliy Filippov	5a9e1ede52	Release 0.8.9 - The tests are now stable and run in a CI system based on Gitea CI - The release includes final bug fixes for EC: - Implement missing EC recovery of allocation bitmap when built with ISA-L - Fix broken snapshot export with EC (allocation bitmap reads were giving incorrect results previously) - Also fixed bugs manifesting under heavy load: - Fix monitor possibly applying incorrect PG history on retries - Fix monitor incorrectly changing PG count when last_clean_pgs contains less PGs than the new number - Allow writes to wait for free space again, but now correctly (previously dropped in 0.8.2) - Fix a rare segfault in client (handle client stop during incoming stream handling in 1 more place) - Make monitor correctly handle etcd connection errors - it could die instead of connecting to another etcd - Fix OSD rarely being unable to report PG states after a PG was taken over by another OSD - Fixed return code for incomplete EC objects (now EIO) and made cluster client retry this error - Made other small changes for tests: timeouts, nice/ionice for etcd, waiting conditions, NBD device checks and so on	2023-05-14 01:25:09 +03:00
Vitaliy Filippov	de3e609166	Add a FIXME about QEMU driver thread safety	2023-05-14 00:06:09 +03:00
Vitaliy Filippov	11481170f5	Add a FIXME about ENOSPC	2023-05-13 23:59:44 +03:00
Vitaliy Filippov	6442010f93	Skip offline PGs during state reporting when the state is already deleted or taken over by another OSD This fixes OSDs being unable to report PG states in rare conditions	2023-05-12 23:17:45 +03:00
Vitaliy Filippov	ce4a8067b5	Handle client stop during incoming stream handling in 1 more place	2023-05-11 01:53:41 +03:00
Vitaliy Filippov	8cac795445	Return EIO instead of EINVAL for incomplete EC objects	2023-05-11 01:15:23 +03:00
Vitaliy Filippov	a409598b16	Wait for free space again, but count on big_write flushes instead of just flusher activity	2023-05-10 01:51:02 +03:00
Vitaliy Filippov	f4c6765522	Ignore ENOENT in epoll_ctl	2023-05-08 20:39:20 +03:00
Vitaliy Filippov	5da1d8e1b5	Fix EC just-bitmap reads (len=0) (fixes SCHEME=ec test_snapshot.sh)	2023-05-07 14:00:08 +03:00
Vitaliy Filippov	44f86f1999	Add a basic EC 2+2 recovery test (not really required, but let it be there)	2023-05-07 11:26:27 +03:00
Vitaliy Filippov	2d9a80c6f6	Implement missing bitmap recovery with ISA-L \(°□°)/	2023-05-07 11:25:51 +03:00
Vitaliy Filippov	ab615849d6	Release 0.8.8 - Fix vitastor-cli rm/rm-data broken in 0.8.6 (missing messenger initialization) - Prepare OSD read handler for upcoming version with scrub - allow "secondary reads" to return errors - Fix OSDs re-peering PGs infinitely with a big number of PGs (reproduced in test_add_osd) - Fix another variant of flusher sync-waiting stall (reproduced in test_write) - Fix other tests in tests/ (will add them to Gitea CI soon) - Add patches for QEMU 6.2-8.0 - Fix QEMU driver compatibility with QEMU 8.0 - Build packages for RHEL 9 clones (based on AlmaLinux 9)	2023-04-28 11:22:00 +03:00
Vitaliy Filippov	b94587ef0e	Fix some build warnings	2023-04-28 00:44:27 +03:00
Vitaliy Filippov	c768a9015f	Fix QEMU driver compatibility with QEMU 8.0	2023-04-25 11:20:21 +03:00
Vitaliy Filippov	b74ccb613c	Fix another variant of flusher sync-waiting stall	2023-04-24 00:44:41 +03:00
Vitaliy Filippov	a04dab0840	Initialize messenger in cluster_client listings	2023-04-24 00:44:41 +03:00
Vitaliy Filippov	160863f707	Print op pointer values in slow log	2023-04-23 17:54:00 +03:00
Vitaliy Filippov	2877cd0adb	Allow OP_SEC_READ to return errors (do not hang the connection)	2023-04-23 17:54:00 +03:00
Vitaliy Filippov	480509f5b9	Fix pg_data_size > 1 for replicas (harmless bug)	2023-04-23 01:50:42 +03:00
Vitaliy Filippov	46462da45e	Preload own PG history updates to fix PG state loop possibly applying the old metadata version	2023-04-23 01:50:30 +03:00
Vitaliy Filippov	7e958afeda	Release 0.8.7 This release includes a bunch of important bugfixes for erasure-coded setups with disabled immediate_commit. After these fixes, "test_heal" OSD killing test now passes fine with EC: - Fix cluster write stalls with "Error while doing flush on OSD xx: -16 (Device or resource busy)" in OSD logs possible in EC setups with disabled immediate_commit by selectively syncing nonsynced objects on STABILIZE/ROLLBACK (https://github.com/vitalif/vitastor/issues/51) - Fix other EC + disabled immediate_commit problems: - Fix "opcode=5 retval=-2" errors happening on SYNC retries - Fix non-working "pagination" during PG dirty object flushing - Fix write operations not continued correctly after dirty object flushing - Fix incorrect parity read-modify-write calculation when writing into a lost chunk - Fix OSDs losing left_on_dead PG state of non-clean PGs and thus not removing junk data in the cluster - Fix a small memory leak caused by bad indexing of EC recovery matrices - Fix a rare use-after-free in cluster_client caused by a reenterability issue - Fix vitastor-cli create command syntax in the CSI driver - Allow to start OSDs without local store for tests - Fix memory allocation error in disk_tool_meta for non-standard metadata block sizes - Fix delete operations received before loading pool metadata crashing OSDs with "null pointer exception" - Improve "theoretical performance" Russian documentation New features: - Implement online configuration update for some parameters. Documentation is coming soon :)	2023-04-11 02:11:57 +03:00
Vitaliy Filippov	2f5e769a29	Fix a small memory leak caused by bad indexing of EC recovery matrices	2023-04-11 00:30:36 +03:00
Vitaliy Filippov	3237014608	Fix incorrect parity read-modify-write calculation when writing into a lost chunk	2023-04-09 02:06:10 +03:00
Vitaliy Filippov	baaf8f6f44	Fix write operations not continued correctly after flush	2023-04-09 02:06:10 +03:00
Vitaliy Filippov	1d83fdcd17	Add debug logs to osd_flush	2023-04-09 02:06:10 +03:00
Vitaliy Filippov	0ddd787c38	Fix non-working "pagination" during PG dirty object flushing	2023-04-08 02:44:02 +03:00
Vitaliy Filippov	6eff3a60a5	Do not lose left_on_dead PG state of non-clean PGs	2023-04-08 02:44:02 +03:00
Vitaliy Filippov	888a6975ab	Fix a rare use-after-free in cluster_client caused by a reenterability issue	2023-04-08 02:44:02 +03:00
Vitaliy Filippov	cd1e890bd4	Fix "opcode=5 retval=-2" errors sometimes possible with EC	2023-04-08 02:44:02 +03:00
Vitaliy Filippov	0fbf4c6a08	Selectively sync nonsynced objects on STABILIZE/ROLLBACK (fix for github issue #51 )	2023-04-08 02:44:02 +03:00
Vitaliy Filippov	d06ed2b0e7	Implement online config update	2023-03-26 19:21:50 +03:00
Vitaliy Filippov	2fb0c85618	Allow to start OSDs without local store (only for tests)	2023-03-15 01:13:59 +03:00
Vitaliy Filippov	d81a6c04fc	Update cmake min version so it does not complain about deprecation	2023-03-15 01:08:23 +03:00
Vitaliy Filippov	7b35801647	Fix possible bad realloc in disk_tool_meta for non-standard metadata block sizes	2023-03-15 01:08:23 +03:00
Vitaliy Filippov	f3228d5c07	Fix typo (did not affect execution though)	2023-03-15 01:08:23 +03:00
Vitaliy Filippov	18366f5055	Fix read/write return type in rw_blocking	2023-03-15 01:08:14 +03:00
Vitaliy Filippov	851507c147	Add missing close() in test stubs	2023-03-15 00:23:56 +03:00
Vitaliy Filippov	9aaad28488	Fix "null pointer exception" for unhandled OSD_OP_DELETEs (when pool is not loaded yet)	2023-03-02 11:16:39 +03:00
Vitaliy Filippov	8810eae8fb	Release 0.8.6 Important fixes: - Fix possibly incorrect EC parity chunk updates with EC n+k, k > 1 and when the first parity chunk is missing Minor fixes and improvements: - Fix incorrect EC free space statistics in vitastor-cli df output - Speedup vitastor-cli startup in clusters with RDMA - Remove unused PG "peered" state (previously used to update PG epoch) - Use sfdisk with just --json in vitastor-disk (--dump --json isn't needed) - Allow trailing comma in sfdisk output (fixes sfdisk 2.36 compatibility) - Slightly improve RDMA send/receive code - Reduce RDMA memory consumption by default (rdma_max_recv/send = 16/8) - Use vitastor-cli instead of direct etcd interaction in the CSI driver	2023-02-28 11:18:48 +03:00
Vitaliy Filippov	14d6acbcba	Set default rdma_max_recv/send to 16/8, fix documentation	2023-02-28 11:00:56 +03:00
Vitaliy Filippov	1e307069bc	Fix missing parity chunk calculation for EC n+k, k > 1 and first parity chunk missing	2023-02-28 02:40:19 +03:00
Vitaliy Filippov	c3e80abad7	Allow to send more than 1 operation at a time	2023-02-26 02:01:04 +03:00
Vitaliy Filippov	138ffe4032	Reuse incoming RDMA buffers	2023-02-26 00:55:01 +03:00
Vitaliy Filippov	4ab630b44d	Use just sfdisk --json, --dump is not needed	2023-02-23 00:55:47 +03:00
Vitaliy Filippov	2c8241b7db	Remove PG "peered" state	2023-02-21 01:30:42 +03:00
Vitaliy Filippov	36a7dd3671	Move tests to "make test"	2023-02-21 01:30:42 +03:00
Vitaliy Filippov	936122bbcf	Initialize msgr lazily in client to speedup vitastor-cli with RDMA enabled	2023-02-19 18:59:07 +03:00
Vitaliy Filippov	1a1ba0d1e7	Add set_immediate to ringloop and use it for bs/osd ops to prevent reenterability issues	2023-02-09 17:37:26 +03:00
Vitaliy Filippov	3d09c9cec7	Remove unused wait_sqe() from ringloop	2023-02-09 17:37:26 +03:00
Vitaliy Filippov	3d08a1ad6c	Fix cluster_client test after last reenterability fixes	2023-02-05 01:47:32 +03:00
Vitaliy Filippov	aba93b951b	Fix incorrect EC free space statistics in vitastor-cli df output	2023-01-26 02:04:29 +03:00
Vitaliy Filippov	d125fb1f30	Release 0.8.5 - Fix a possible "double free" bug in the client library happening on OSD restart - Fix a possible write hang on PG history update when only epoch is changed - Fix incorrect systemd target "local.target" in mon/make-etcd - Allow "content" option in PVE storage plugin to allow to enable containers - Build client library without tcmalloc which fixes "attempt to free invalid pointer" errors when, for example, trying to run QEMU with both Vitastor and Ceph RBD disks	2023-01-25 01:43:49 +03:00
Vitaliy Filippov	8b552a01f9	Do not retry successful operation parts in client (could lead to "double free" bugs)	2023-01-25 01:30:36 +03:00
Vitaliy Filippov	0385b2f9e8	Fix write hangs on PG epoch update - always set pg.history_changed to true	2023-01-25 01:30:15 +03:00
Vitaliy Filippov	9f4e34a8cc	Build client library without tcmalloc Fixes "[src/tcmalloc.cc:332] Attempt to free invalid pointer ..." when trying to run QEMU with both Vitastor and Ceph RBD disks and other possible allocator collisions.	2023-01-15 00:01:11 +03:00
Vitaliy Filippov	81fc8bb94c	Release 0.8.4 New features: - Implement QCOW2 image/snapshot export via qemu-img (bdrv_co_block_status in the driver) - Remove OSDs from PG history during `vitastor-cli rm-osd` to prevent `left_on_dead` PG states after deletion - Add a new recovery_pg_switch setting to mix all PGs during recovery, to almost fully reduce the probability of ENOSPC during rebalance - Introduce partial ENOSPC ("OSD is full") handling - now ENOSPC doesn't turn into cascades of crashes - Add migration support to Proxmox VE Vitastor driver - Track last_clean_pgs on a per-pool basis thus reducing data movement in a cluster with pools remaining unclean/degraded for a long time Bug fixes: - Fix a bug where monitor could generate degraded PGs if one of the hosts had no OSDs - Fix a bug where monitor could skip PG redistribution with a lot of OSDs in cluster - Report PG history synchronously on the first write, which improves PG consistency and availability at the same time, because history now gets reported correctly and doesn't get reported without the need for it - Fix possible write and recovery stalls which could happen in a cluster with both EC and replicated pools - Make OSD and monitors sanitize & deduplicate PG history items in etcd - Fix non-working OSD peer config safety check - Fix a rare journal flush stall where flushing wasn't activated with full journal, but with empty flush queue - Fix builds without ISA-L (jerasure-only) crashing with EC N+K, K>=2 due to the lack of 16-byte buffer alignment - Fix a possible crash for EC N+K, K>=2 when calculating a parity chunk with previous parity chunk missing - Fix a bug where vitastor-disk purge with suppressed warnings didn't work	2023-01-13 23:59:54 +03:00
Vitaliy Filippov	bc465c16de	Fix arithmetic on void* for clang	2023-01-13 23:58:42 +03:00
Vitaliy Filippov	8763e9211c	Fix qemu driver compilation warning/error	2023-01-13 23:44:39 +03:00
Vitaliy Filippov	fe87b4076b	Fix backwards compatibility in cluster_client	2023-01-12 02:37:31 +03:00
Vitaliy Filippov	137309cf29	Implement bdrv_co_block_status for snapshot export support	2023-01-07 17:06:58 +03:00
Vitaliy Filippov	373f9d0387	Try to re-peer PGs on history change	2023-01-06 12:46:44 +03:00
Vitaliy Filippov	c4516ea971	Also remove deleted OSD from PG configuration and last_clean_pgs	2023-01-06 12:46:44 +03:00
Vitaliy Filippov	91065c80fc	Try to prevent left_on_dead when deleting OSDs by removing them from PG history	2023-01-06 12:46:43 +03:00
Vitaliy Filippov	02e7be7dc9	Prevent reenterability side effects during PG history operation resume	2023-01-03 02:20:50 +03:00
Vitaliy Filippov	73940adf07	Prioritize EC (non-instantly-stable) operations under journal pressure This reduces the probability of hitting OSD stalls with EC due to "deadlocks" where two parallel write operations wait for each other to complete	2023-01-03 00:05:45 +03:00
Vitaliy Filippov	e950c024d3	Do not sync peer OSDs before listing Sync before listing was added to wait for all PG writes possibly left in queue from the previous master to finish before listing it But in fact it may block the cluster when EC is used and some unstable writes are left in the queue - they block journal flushing, rollback/stabilize is required to unblock them, but rollback/stabilize may only happen after PG is peered. But peering needs listings, listings are requested only after sync, and sync itself waits for currently blocked writes waiting in the queue	2023-01-03 00:05:45 +03:00
Vitaliy Filippov	71d6d9f868	Fix possible crash on ENOSPC during operation cancel in blockstore	2023-01-03 00:05:45 +03:00
Vitaliy Filippov	a4dfa519af	Report PG history synchronously during write This has 2 effects: 1) OSD sets aren't added into PG history until actual write attempts anymore which removes unneeded extra osd_sets in PG history 2) New OSD sets are reported synchronously and can't be lost on PG restarts happening at the same time with reconfiguration	2023-01-01 23:41:05 +03:00
Vitaliy Filippov	67019f5b02	Make OSD sort & sanitize PG history items	2023-01-01 23:17:42 +03:00
Vitaliy Filippov	0593e5c21c	Fix OSD peer config safety check	2022-12-31 02:24:42 +03:00
Vitaliy Filippov	998e24adf8	Add a new recovery_pg_switch setting to mix all PGs during recovery	2022-12-30 02:03:33 +03:00
Vitaliy Filippov	d7bd36dc32	Fix another rare journal flush stall	2022-12-30 02:03:33 +03:00
Vitaliy Filippov	cf5c562800	Log all object locations when peering PGs	2022-12-30 02:03:33 +03:00
Vitaliy Filippov	629200b0cc	Return ENOSPC as the primary OSD	2022-12-30 02:03:33 +03:00
Vitaliy Filippov	3589ccec22	Do not disconnect peer on ENOSPC during write	2022-12-30 01:54:25 +03:00
Vitaliy Filippov	8d55a1e780	Build osd_rmw_test both with and without ISA-L	2022-12-29 19:13:57 +03:00
Vitaliy Filippov	65f6b3a4eb	Fix jerasure crashing on bitmap calculation/restoration due to the lack of 16-byte alignment	2022-12-29 19:13:57 +03:00
Vitaliy Filippov	fd216eac77	Add a test for missing parity chunk calculation	2022-12-29 19:13:57 +03:00
Vitaliy Filippov	61fca7c426	Fix crash when calculating a parity chunk with previous parity chunk missing (test coming shortly)	2022-12-29 19:13:57 +03:00
Vitaliy Filippov	68f3fb795e	Suppress warnings in vitastor-disk purge correctly	2022-12-27 11:09:19 +03:00
Vitaliy Filippov	fa90f287da	Release 0.8.3 - Implement a new "vitastor-disk purge" command to remove OSDs with safety checks - Implement a new "vitastor-cli rm-osd" command to only remove OSD metadata from etcd - Fix a bug where the monitor could ignore OSD removal and other /osd/stats key changes - Fix a bug where garbage could be returned when reading objects being written at the same time - Fix a rare write stall where journal space could be not reclaimed where there were no new operations in the flush queue - Fix a rare peering stall caused by a previous long listing operations queues limiting attempt - Fix total object count statistic in OSD on object creation - Add missing offset&len into vitastor-disk dump-journal for big_writes, fix JSON format - Make vitastor-cli print help on missing command - Make vitastor-cli translate all '-' to '_' in CLI options	2022-12-27 02:40:55 +03:00
Vitaliy Filippov	795020674d	Loop journal flusher when the queue is empty but there is a trim request	2022-12-27 02:28:20 +03:00

1 2 3 4 5 ...

578 Commits (1c316ef350cf0f02a38c9b121c5890c8867dd87f)