vitastor

Commit Graph

Author	SHA1	Message	Date
Vitaliy Filippov	6909807068	Allow to start the OSD just to flush the journal completely	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	ec90fe6ec1	Release 0.5.13 Another followup to 0.5.11	2021-04-09 12:10:16 +03:00
Vitaliy Filippov	18c72f4835	Correct reenterability fix (now verified with a test) It's rather funny but 0.5.12 has to be re-published again	2021-04-09 12:10:16 +03:00
Vitaliy Filippov	59fbcef734	Release 0.5.12 Fix qemu driver broken in 0.5.11 :)	2021-04-08 15:47:18 +03:00
Vitaliy Filippov	40b7c21fb1	Followup to `307c1731c1` - fix mark_stable	2021-04-08 15:47:18 +03:00
Vitaliy Filippov	efb3678606	Fix qemu-img broken in 0.5.11 Caused by the lack of reenterability of the main cluster_client function	2021-04-08 14:59:20 +03:00
Vitaliy Filippov	462650134e	Release 0.5.11 Another bunch of fixes, including important ones. Now OSDs are stable in SSD+HDD configurations and everything is mostly ready for the merge of master branch. Features: - Add min_flusher_count configuration (good for HDDs) - Shuffle PGs for better data device utilisation - Make OSDs benefit from the immediate_commit=small setting if it's applicable Bug fixes: - Rework client code to fix write ordering during operation replay - Rework error handling code so OSDs don't crash in reaction to a crash of their peer OSDs - Fix several block layer problems related to the journal, some of which were leading to double allocations of the same block during journal replay - Fix monitors crashing during the removal of OSD keys from etcd - Fix data fsyncs being incorrectly disabled when only disable_journal_fsync was set - Always zero out unused part of request/reply headers - Fix some theoretically possible read/write ordering issues - Don't try to "recover" misplaced objects if it would make them degraded - Fix heartbeats sometimes preventing OSD to establish connections	2021-04-08 01:18:46 +03:00
Vitaliy Filippov	8d87e32175	Fix msgr_op.h includes	2021-04-08 01:18:46 +03:00
Vitaliy Filippov	b0b2e7df3c	Fix use-after-free in keepalive_timer and rework stop_client() The bug reproduced if fio was temporarily stopped with SIGSTOP during write test and then resumed after 10 seconds. In this case "pings" were failed for all clients and fio process crashed with 'use-after-free' in keepalive_timer. It happened because it called stop_client while having a live iterator to the map.	2021-04-07 11:06:31 +03:00
Vitaliy Filippov	97efb9e299	Do not crash on PG re-peering events when operations are in progress	2021-04-07 11:06:31 +03:00
Vitaliy Filippov	f6d705383a	Fix client connection recovery bugs, add dirty_ops limit	2021-04-07 11:06:31 +03:00
Vitaliy Filippov	68567c0e1f	Fix messenger possibly trying to connect to the same OSD twice	2021-04-07 01:30:38 +03:00
Vitaliy Filippov	04b00003e9	Log ping failures	2021-04-07 01:30:38 +03:00
Vitaliy Filippov	307c1731c1	Forget all dirty_entries before stable big_write or delete during initialisation This fixes a 'double_alloc' assertion in the following case: - big_write object #1 v1 to block #100 - big_write object #1 v2 to block #101 - big_write object #2 v1 to block #100	2021-04-07 01:30:38 +03:00
Vitaliy Filippov	75a6a556b5	Shuffle PGs for better data device utilisation	2021-04-07 01:30:38 +03:00
Vitaliy Filippov	a48e2bbf18	Fix write replay ordering when immediate_commit != all Previous implementation didn't respect write ordering and could lead to corrupted data when restarting writes after an OSD outage Also rework cluster_client queueing logic and add tests for it to verify the correct behaviour	2021-04-03 14:51:52 +03:00
Vitaliy Filippov	688821665a	Remove stoull_full() from etcd_state_client.cpp	2021-04-03 14:36:04 +03:00
Vitaliy Filippov	3e162d95a0	Remove http_client.h include from etcd_state_client.h	2021-04-03 14:36:04 +03:00
Vitaliy Filippov	829381b335	Extract some definitions to msgr_op.{cpp,h}	2021-04-03 14:36:04 +03:00
Vitaliy Filippov	54f2353f24	Use bitmap granularity for alignment checks	2021-04-03 14:36:04 +03:00
Vitaliy Filippov	e47f6fba60	Remove cluster_client_t::stop()	2021-04-03 14:35:42 +03:00
Vitaliy Filippov	883bf84a16	Fix build	2021-04-03 01:47:15 +03:00
Vitaliy Filippov	52097c4856	Stop flushing when less than min_flusher_count operations are available (unless a trim is forced)	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	e1355cbc74	Report failed operation name in cluster_client	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	8f8b90be7a	Add min_flusher_count configuration	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	ad9f619370	Skip double allocs when reading journal	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	f4769ba7c7	Collapse create+delete journal entry pairs if they're already flushed Old journal replay mechanism could lead to a double allocation of the same block and a "Fatal error: tried to overwrite non-zero metadata entry"	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	843b7052d2	Add an assertion when clearing deleted metadata entries, add debug details when freeing blocks	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	df99e232ee	Deduplicate osd_sets in pg history + raise request size limit for etcd	2021-04-03 00:53:28 +03:00
Vitaliy Filippov	3a40fa4127	Fix monitor errors in case of OSD removal	2021-03-27 01:15:18 +03:00
Vitaliy Filippov	4095bcc558	Do not ignore object deletion journal entries when they are preceded by a big write	2021-03-25 11:00:10 +03:00
Vitaliy Filippov	564d64e271	Add some details for debug prints	2021-03-25 11:00:10 +03:00
Vitaliy Filippov	cf54741c95	Followup to `05db1308aa` Don't do anything with the object state after errors because it's freed by PG re-peer in this case	2021-03-25 11:00:10 +03:00
Vitaliy Filippov	18a5fafa2a	Fix rollback	2021-03-25 02:41:58 +03:00
Vitaliy Filippov	06f4978085	Fix fsync check in blockstore_flush (data fsyncs were disabled instead of journal fsyncs)	2021-03-25 02:41:58 +03:00
Vitaliy Filippov	7ebf1588c5	Check for immediate_commit==small in the OSD code	2021-03-25 02:41:58 +03:00
Vitaliy Filippov	b0ad1e1e6d	Remember writes as "unsynced" only after completing them Previously BS_OP_SYNC could take unfinished writes and add them into the journal before they were actually completed. This was leading to crashes with the message "BUG: Unexpected dirty_entry 2000000000001:9f2a0000 v3 unstable state during flush: 338"	2021-03-25 02:41:58 +03:00
Vitaliy Filippov	0949f08407	Extract osd_primary write and sync code into separate files	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	04a1f18fa5	Assign .req as a whole to always zero out the remaining part Also clear .reply before processing the operation	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	cf9a641d66	Skip disconnected OSDs during sync	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	05db1308aa	Fix two potential read/write ordering problems (even though not yet seen in tests) - Write operations could be 'stabilized' and previous versions could be purged from OSDs before the removal of version_override and following reads could potentially hit different version in EC pools - Object was marked clean after completing the delete during recovery, so reads could in theory hit a deleted version and return nothing	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	98b54ca948	Don't try to "recover" misplaced objects if it would make them degraded	2021-03-21 01:37:23 +03:00
Vitaliy Filippov	23225c5e62	Do not run ping on clients that are not yet connected	2021-03-21 01:37:23 +03:00
Vitaliy Filippov	7e6e1a5a82	Release 0.5.10 The version seems to be stable after this bunch of fixes :) - Fix delete & write operation ordering during rebalance to not lose objects in the immediate_commit=off mode - Fix a possible crash caused by very high iodepths - Re-distribute PG primaries over OSDs that come up after a short downtime - Allow to specify etcd URLs for OSDs with http://, do not die with a strange error if -etcd option is missing for fio - Fix a journal flushing deadlock which sometimes occurred in the immediate_commit=off mode - Fix a bug where OSDs could hang if the data device filled up - Fix an allocator bug where it was unable to allocate up to last (n%64) data device blocks - Fix monitor crash that occurred on removal of some etcd keys - Fix a bug where PGs could remain incomplete due to incorrect PG history with just zeroes in osd_sets	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	435045751d	Delete objects only after a SYNC during rebalance in the non-immediate_commit mode Previously OSDs could commit deletes before writes during recovery or rebalance in the "lazy fsync" (immediate_commit=off) mode which could result in lost objects	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	c5fb1d5987	Do not duplicate blockstore operations when io_uring fills up This bug was leading to OSDs dying with "Assertion `fulfilled == read_op->len' failed" when testing fio -rw=randread -numjobs=8 -iodepth=128	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	9f59381bea	Re-distribute PG primaries over OSDs that come up after a short downtime	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	9ac7e75178	Allow to specify etcd URLs for OSDs with http://, do not die with a strange error if -etcd option is missing for fio	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	88671cf745	Fix a bug causing all flushers to wait for an fsync without actually trying to do it This happened because flusher_count became dynamic and fsync_batch() was comparing the number of flushers currently ready to do an fsync with the maximum number of flushers. Also the number wasn't rechecked on every loop which was also incorrect. Now the interrupted_rebalance test passes even without IMMEDIATE_COMMIT=1.	2021-03-13 17:27:29 +03:00
Vitaliy Filippov	fe1749c427	Fix the multiple_interrupted_rebalance test	2021-03-13 17:19:45 +03:00

1 2 3 4 5 ...

663 Commits (rel-0.5) All Branches Search

663 Commits (rel-0.5)

All Branches