vitastor

Commit Graph

Author	SHA1	Message	Date
Vitaliy Filippov	4c9bf6727b	Experimental: Handle degraded deletions by comparing object versions with epochs CAUTION! This version is not fool proof yet. If you purge data of an OSD by overwriting the disk with zeroes and restart it then the same data will also be removed from other replicas :-). I plan to add protection from this situation before merging it into master. The idea is to make each OSD store a random "cookie" on disk and remove itself from history automatically if the cookie doesn't match.	2023-04-29 00:21:22 +03:00
Vitaliy Filippov	629200b0cc	Return ENOSPC as the primary OSD	2022-12-30 02:03:33 +03:00
Vitaliy Filippov	a0cae4c180	Rename "jerasure" to "ec" in pool configuration, function names, fix documentation and Debian build scripts Old pool configurations with "jerasure" also remain supported as an alias for "ec"	2022-06-03 15:40:00 +03:00
Vitaliy Filippov	83146fa3e2	Fix the same HUGE bug for regular reads during rebalance	2022-04-08 11:50:09 +03:00
Vitaliy Filippov	7bdd92ca4f	Fix build under clang and some warnings Build problems fixed: - void* pointer arithmetic which is a GNU extension (works as byte*) - "variable size object may not be initialized" which is OK under GCC - nullptr_t related error in json11 (it lacks 'operator <' in clang) Warnings fixed: - empty nested struct initializer { 0 } replaced by {} - removed several unused lambda captures	2022-01-16 00:02:54 +03:00
Vitaliy Filippov	5cf1157f16	Return real version on CAS failure	2021-08-01 20:05:19 +03:00
Vitaliy Filippov	acf637950c	Implement layer merge A new command merges multiple snapshot/clone layers into one of them, so merged layers can be deleted after this procedure	2021-07-31 00:23:30 +03:00
Vitaliy Filippov	aad7792d3f	Check for loops in parent inode chains	2021-06-20 00:23:03 +03:00
Vitaliy Filippov	891250d355	Implement CAS writes From now on, reads will return the server-side object version numbers and writes and deletes will have an additional "version" parameter which, if set to a non-zero value, will be atomically compared with the current version of the object plus 1 and the modification will fail if it doesn't match. This feature opens the road to correct online flattening of snapshot layers and other interesting things.	2021-06-15 00:12:35 +03:00
Vitaliy Filippov	38a3df4a0e	Implement chained (optimized) read in the primary OSD code	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	d6524670e1	Introduce data distribution locality	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	ab39ce2bbb	Use clean_entry_bitmap_size instead of entry_attr_size back because of changed bitmap handling	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	d0c2e31312	Add a test for snapshots, fix bugs. Now the test passes	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	9038d42327	Fix several snapshot I/O bugs	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	0aa2dd2890	Send bitmaps with primary-reads, actually read bitmaps for READ ops	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	6bf88883ac	Allocate bitmaps along with stripes to avoid memory fragmentation	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	004f265393	Remove cryptic bitmap inlining from bs_op_t and osd_op_t, use bitmap in primary OSD code	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	95c29b9dc3	Add "external" bitmap support to osd_rmw	2021-04-10 17:44:12 +03:00
Vitaliy Filippov	97efb9e299	Do not crash on PG re-peering events when operations are in progress	2021-04-07 11:06:31 +03:00
Vitaliy Filippov	54f2353f24	Use bitmap granularity for alignment checks	2021-04-03 14:36:04 +03:00
Vitaliy Filippov	883bf84a16	Fix build	2021-04-03 01:47:15 +03:00
Vitaliy Filippov	0949f08407	Extract osd_primary write and sync code into separate files	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	cf9a641d66	Skip disconnected OSDs during sync	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	05db1308aa	Fix two potential read/write ordering problems (even though not yet seen in tests) - Write operations could be 'stabilized' and previous versions could be purged from OSDs before the removal of version_override and following reads could potentially hit different version in EC pools - Object was marked clean after completing the delete during recovery, so reads could in theory hit a deleted version and return nothing	2021-03-24 14:20:56 +03:00
Vitaliy Filippov	435045751d	Delete objects only after a SYNC during rebalance in the non-immediate_commit mode Previously OSDs could commit deletes before writes during recovery or rebalance in the "lazy fsync" (immediate_commit=off) mode which could result in lost objects	2021-03-16 12:48:26 +03:00
Vitaliy Filippov	1be94da437	Check & remove extra chunks for degraded / incomplete objects, too	2021-03-08 17:04:10 +03:00
Vitaliy Filippov	21e7686037	Fix possible "assertion failed: pg.inflight >= 0" error during PG stop	2021-03-08 17:04:10 +03:00
Vitaliy Filippov	ab21a1908b	Check for the dirty PG flag when trying to continue to stop it after sync	2021-03-08 17:04:10 +03:00
Vitaliy Filippov	6155b23a7e	Replace pgs[id] with pgs.at(id) to prevent accidental auto-vivification	2021-02-28 19:36:59 +03:00
Vitaliy Filippov	bf9a175efc	Move C/C++ sources to src subdirectory	2021-02-25 23:59:03 +03:00

30 Commits (4c9bf6727b3f9c81906569764336c0b4d83d8766)