Vitaliy Filippov
089f138e0c
Allow situations where the journal contains a big_write(v1) after delete(v2) and v1 < v2
...
Fixes a crash in the following scenario:
- client issues a delete request (object version is at least 2)
- OSD has time to flush it to the metadata, but doesn't have time to move the journal start pointer on disk
- client overwrites the same object and it gets the version number 1 again
- OSD is restarted and sees delete(v=2), big_write(v=1) in the journal
- dirty_db sequence gets broken and OSD crashes with assert("Writes and deletes shouldn't happen at the same time")
2020-12-04 11:47:27 +03:00
Vitaliy Filippov
bcc8e697f9
Delete PGs when deleting pools
...
(All OSD crash with "Online PG count change not allowed" if you try to delete an active pool though)
2020-12-04 11:47:27 +03:00
Vitaliy Filippov
a4c46ba745
Add jerasure EC support (reed_sol_van, others are slower) (not tested yet)
2020-12-04 11:47:27 +03:00
Vitaliy Filippov
5596ad8997
Use custom QEMU build for CentOS 7
2020-12-04 11:47:05 +03:00
Vitaliy Filippov
59c29b0cee
Fix RPATH for CentOS builds, add additional repos into the CentOS installation instructions
2020-12-04 11:47:04 +03:00
Vitaliy Filippov
959089b919
Enable progress_notify=true for etcd watches
2020-11-17 16:29:42 +03:00
Vitaliy Filippov
d3e7749616
Final fixes for packaging
2020-11-10 23:33:07 +03:00
Vitaliy Filippov
b56f8820ec
Container packaging for Debian 11 Bullseye, CentOS 7 and CentOS 8
2020-11-10 00:02:53 +03:00
Vitaliy Filippov
4bd2bd48eb
Build Vitastor packages, too
2020-11-09 14:41:39 +03:00
Vitaliy Filippov
a3fc9f8d7d
Add a Dockerfile to build patched QEMU for Debian (Buster)
2020-11-09 02:30:41 +03:00
Vitaliy Filippov
530975aed7
Make it also build with GCC 8 and on Debian Buster
2020-11-09 00:07:07 +03:00
Vitaliy Filippov
1446aad107
Simple patch for qemu-kvm .spec
2020-11-08 02:14:53 +03:00
Vitaliy Filippov
46479e2456
Add RPM build scripts for CentOS 8
2020-11-08 01:55:17 +03:00
Vitaliy Filippov
e41bee72a5
Lower node.js requirement to 10.x
2020-11-08 01:54:12 +03:00
Vitaliy Filippov
2e0f223ddb
Add RPM build scripts for CentOS 7
2020-11-07 01:52:10 +03:00
Vitaliy Filippov
3be7bc29d8
Make it build with QEMU 2.0, too
...
Also begin to work on rpms
2020-11-06 20:05:00 +03:00
Vitaliy Filippov
0c43ff9daf
Add scripts to copy fio and qemu includes to the source package
2020-11-06 18:40:42 +03:00
Vitaliy Filippov
64d471cf53
Add simple Debian packaging
2020-11-06 18:40:42 +03:00
Vitaliy Filippov
809b2ad8cd
Add install target
2020-11-06 01:12:22 +03:00
Vitaliy Filippov
550d4af151
Rename test.cpp to test_shit.cpp (random shit)
2020-11-06 01:12:22 +03:00
Vitaliy Filippov
cf0f23ab8e
Add patches for QEMU QAPI IDL
2020-11-04 23:30:51 +03:00
Vitaliy Filippov
a516fefa8c
Add qemu_module_dummy and qemu_stamp_xxx to qemu_driver.c
2020-11-04 23:10:29 +03:00
Vitaliy Filippov
3b7279b376
Add Ceph EC 2+1 test results
2020-11-01 14:13:35 +03:00
Vitaliy Filippov
824ea507d0
Do not try to push more segments than IOV_MAX at once as it leads to EMSGSIZE
2020-10-30 01:25:43 +03:00
Vitaliy Filippov
23ea409081
Fix "can't get SQE, will fall out of sync with EPOLLET" when overflowing the ring
...
OSDs shouldn't crash or hang with long iodepths anymore
2020-10-30 01:06:36 +03:00
Vitaliy Filippov
2ccb75974b
Fix a rare crash caused by a stopped client still being in write_ready_clients
2020-10-30 01:04:58 +03:00
Vitaliy Filippov
6561d4e040
Validate pool ID before executing the operation
...
Reply -EPIPE for non-existing pools because we assume that it means
that pool config isn't loaded yet. Previously OSD crashed on such operations
2020-10-30 01:02:46 +03:00
Vitaliy Filippov
1eda7f529d
Note about Linux 5.8+
2020-10-28 19:17:22 +03:00
Vitaliy Filippov
0a174bb313
Return success for already finished rollback operations
...
There was a FIXME and I actually hit it during tests :)
2020-10-24 18:46:19 +03:00
Vitaliy Filippov
720985e4c7
Fix NULL rmw buffer after the latest changes and add a testcase for it
2020-10-24 18:29:19 +03:00
Vitaliy Filippov
4872f617a4
Clear connect timeout in stop_client() to stop races during disconnections
2020-10-24 10:37:16 +03:00
Vitaliy Filippov
e8ac08be14
Allow to overwrite incomplete objects or parts of objects to recover them
2020-10-24 02:14:41 +03:00
Vitaliy Filippov
660c2412fb
Improve debugging output for incomplete/degraded
2020-10-24 01:28:47 +03:00
Vitaliy Filippov
faa5e1436f
Attempt journal trim even without new flushes
...
This is the new guaranteed unblocking method which replaces old trims
in init and rollback, and also fixes a possible stall when just several
writes in the beginning of the journal are flushed without triggering
a subsequent trim.
2020-10-24 01:28:47 +03:00
Vitaliy Filippov
5fbe36198a
Fix journal trimming
...
1) Update journal's used_start in memory only after updating journal superblock.
Doing the opposite is incorrect because part of the journal will be lost if writers
overwrite its old beginning.
2) Sync journal device after updating the superblock.
3) Do not trim in rollback and init because trimming there would also require
updating the superblock. And the only reason to trim in both those places was
to unblock writers. And a guaranteed unblocking method will follow in the next
commit :)
2020-10-24 01:08:33 +03:00
Vitaliy Filippov
99c45bb5ed
Fix debugging output during journal loading
2020-10-24 01:08:33 +03:00
Vitaliy Filippov
701eb79422
Stabilize writes before deleting extra chunks to not stall peer journals
2020-10-23 22:45:05 +03:00
Vitaliy Filippov
220bda0667
Fix possible buffer over(under)flow when handling LIST
2020-10-23 02:17:44 +03:00
Vitaliy Filippov
1e8f0328e0
Cancel outbound operations after re-peering PGs
2020-10-22 22:54:38 +00:00
Vitaliy Filippov
f011e0c675
Do not block stabilize by list and list by write
2020-10-22 22:13:40 +00:00
Vitaliy Filippov
1a694c387e
Print slow ops in log
2020-10-20 23:41:23 +00:00
Vitaliy Filippov
738ad5af79
Fix infinite looping in continue_recovery_op() when pg_cancel_write_queue() is called
2020-10-20 22:23:15 +00:00
Vitaliy Filippov
9abf3c17c9
Correct fix for "Pool %u PG %u configuration is invalid" during startup
...
Establish watcher connection after loading PGs
2020-10-20 21:09:14 +00:00
Vitaliy Filippov
d2b901aa09
Fix default auto-created failure domains
2020-10-20 21:07:40 +00:00
Vitaliy Filippov
befff09370
Fix possible crash due to uninitialized ring_data_t in ringloop
2020-10-20 10:44:38 +03:00
Vitaliy Filippov
d1645551d4
Implement write batching
...
Also fix possible race condition which could in theory lead to "command out of sync"
and a buffer overflow that could happen on incorrect server response.
2020-10-20 03:29:17 +03:00
Vitaliy Filippov
7cb561f95a
Add etcd to the example service generator
2020-10-20 01:50:56 +03:00
Vitaliy Filippov
ae480196e2
Add a note about etcd bug, fix simple-offsets.js cmdline
2020-10-19 17:05:45 +03:00
Vitaliy Filippov
398c86f943
Improve PG-related log messages
2020-10-18 12:17:22 +00:00
Vitaliy Filippov
bec5f921a6
Fix buffer overflows in the no_same_sector_overwrites mode
2020-10-17 23:30:16 +00:00