pve-qemu

Commit Graph

Author	SHA1	Message	Date
Fiona Ebner	ffda59f626	add patches to fix regression with LSI SCSI controller The patch 0008-memory-prevent-dma-reentracy-issues.patch introduced a regression for the LSI SCSI controller leading to boot failures [0], because, in its current form, it relies on reentrancy for a particular ram_io region. [0]: https://forum.proxmox.com/threads/123843 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:22 +01:00
Fiona Ebner	3c4f941ac7	add more stable fixes The patches were selected from the recent "Patch Round-up for stable 7.2.1" [0]. Those that should be relevant for our supported use-cases (and the upcoming nvme use-case) were picked. Most of the patches added now have not been submitted to qemu-stable before. The follow-up for the virtio-rng-pci migration fix will break migration between versions with the fix and without the fix when a virtio-pci-rng(-non)-transitional device is used. Luckily Proxmox VE only uses the virtio-pci-rng device, and this was fixed by 0006-virtio-rng-pci-fix-migration-compat-for-vectors.patch which was applied before any public version of Proxmox VE's QEMU 7.2 package was released. [0]: https://lists.nongnu.org/archive/html/qemu-stable/2023-03/msg00010.html [1]: https://bugzilla.redhat.com/show_bug.cgi?id=2162569 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:19 +01:00
Fiona Ebner	3a94e1a186	fixup patch "ide: avoid potential deadlock when draining during trim" The patch was incomplete and (re-)introduced an issue with a potential failing assertion upon cancelation of the DMA request. There is a patch on qemu-devel now[0], and it's the same as this one code-wise (except for comments). But the discussion is still ongoing. While there shouldn't be a real issue with the patch, there might be better approaches. The plan is to use this as a stop-gap for now and pick up the proper solution once it's ready. [0]: https://lists.nongnu.org/archive/html/qemu-devel/2023-03/msg03325.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:19 +01:00
Fiona Ebner	58659169de	add patch to avoid potential deadlock with trim for IDE/SATA and draining In particular, the deadlock can occur, together with unlucky timing between the QEMU threads, when the guest is issuing trim requests during the start of a backup operation. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> [ T: resolve trivial merge conflict in series file ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-03-08 14:22:36 +01:00
Fiona Ebner	10691e04e9	add patch fixing Linux boot failures with megasas SCSI A regression in 7.2 and easily reproduced. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-07 19:50:12 +01:00
Fiona Ebner	00e2507aac	add fix for iscsi double free issue leading to crashes Reported here[0] and here[1]. [0]: https://gitlab.com/qemu-project/qemu/-/issues/1378 [1]: https://forum.proxmox.com/threads/122776/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 13:49:19 +01:00
Fiona Ebner	e7e5f63573	add patch fixing DMA reentrancy issues that could lead to use-after-frees and stack overflows with a malicious (or buggy) guest. See [0] for a good summary: [0]: https://lore.kernel.org/qemu-devel/CAFEAcA_23vc7hE3iaM-JVA6W38LK4hJoWae5KcknhPRD5fPBZA@mail.gmail.com Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 10:18:35 +01:00
Fiona Ebner	1688b43738	QMP backup: use correct errno when getting blockdrive length fails di->size would only be set later. The errno is minus the return value from the function. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 09:19:16 +01:00
Fiona Ebner	eee064d954	savevm-async: keep more free space when entering final stage In qemu-server, we already allocate 2 * $mem_size + 500 MiB for driver state (which was 32 MiB long ago according to git history). It seems likely that the 30 MiB cutoff in the savevm-async implementation was chosen based on that. In bug #4476 [0], another issue caused the iteration to not make any progress and the state file filled up all the way to the 30 MiB + pending_size cutoff. Since the guest is not stopped immediately after the check, it can still dirty some RAM and the current cutoff is not enough for a reproducer VM (was done while bug #4476 still was not fixed), dirtying memory with > stress-ng -B 2 --bigheap-growth 64.0M' After entering the final stage, savevm actually filled up the state file completely, leading to an I/O error. It's probably the same scenario as reported in the bug report, the error message was fixed in commit `a020815` ("savevm-async: fix function name in error message") after the bug report. If not for the bug, the cutoff will only be reached by a VM that's dirtying RAM faster than can be written to the storage, so increase the cutoff to 100 MiB to have a bigger chance to finish successfully, while still trying to not increase downtime too much for non-hibernation snapshots. [0]: https://bugzilla.proxmox.com/show_bug.cgi?id=4476 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 08:39:08 +01:00
Fiona Ebner	8051a24b5f	fix #4476 : savevm-async: avoid looping without progress when pend_postcopy is large. By definition, pend_postcopy won't decrease when iterating, so a value larger than the cutoff of 400000 would lead to essentially empty iterations, filling up the state file until only 30 MiB + pending_size remain and the second half of the check would trigger. Avoid this, by not considering pend_postcopy for the cutoff to enter the final phase. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 08:39:08 +01:00
Fiona Ebner	d5f6ef56f0	add patch to fix issue with VirtIO disk using detect-zeroes=unmap Affects Proxmox VE, when the discard disk setting is used for a VirtIO disk. Upstream bug report: https://gitlab.com/qemu-project/qemu/-/issues/1404 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-27 09:36:41 +01:00
Fiona Ebner	a02081501a	savevm-async: fix function name in error message which also makes it distinguishable from the other "qemu_savevm_state_iterate error" message. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-24 17:08:54 +01:00
Fiona Ebner	48c307550a	add regression fix for migration with virtio-rng device between QEMU less than 7.2 and QEMU 7.2 without the fix (both directions are affected). As mentioned in the patch message, this fix itself will break migration between QEMU 7.2 and QEMU 7.2 with the fix (in both directions, if a virtio-rng device is attached), but this is fine, because no pve-qemu-kvm package with QEMU 7.2 has been publicly released yet. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-12 13:10:19 +01:00
Fiona Ebner	f64132208a	cherry-pick stable fixes for 7.2 Two for virtio-mem and one for vIOMMU. Both features are not yet exposed in PVE's qemu-server, but planned to be added. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-10 15:42:28 +01:00
Fiona Ebner	271ac0a8a7	add QAPI naming exceptions in patches introducing them Avoids a patch and is required to compile when not all patches are applied. No functional change is intended. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-10 15:42:16 +01:00
Fiona Ebner	d03e1b3ce3	update submodule and patches to 7.2.0 User-facing breaking change: The slirp submodule for user networking got removed. It would be necessary to add the --enable-slirp option to the build and/or install the appropriate library to continue building it. Since PVE is not explicitly supporting it, it would require additionally installing the libslirp0 package on all installations and there is very little mention on the community forum when searching for "slirp" or "netdev user", the plan is to only enable it again if there is some real demand for it. Notable changes: * The big change for this release is the rework of job locking, using a job mutex and introducing _locked() variants of job API functions moving away from call-side AioContext locking. See (in the qemu submodule) commit 6f592e5aca ("job.c: enable job lock/unlock and remove Aiocontext locks") and previous commits for context. Changes required for the backup patches: * Use WITH_JOB_LOCK_GUARD() and call the _locked() variant of job API functions where appropriate (many are only availalbe as a _locked() variant). * Remove acquiring/releasing AioContext around functions taking the job mutex lock internally. The patch introducing sequential transaction support for jobs needs to temporarily unlock the job mutex to call job_start() when starting the next job in the transaction. * The zeroinit block driver now marks its child as primary. The documentation in include/block/block-common.h states: > Filter node has exactly one FILTERED\|PRIMARY child, and may have > other children which must not have these bits Without this, an assert will trigger when copying to a zeroinit target with qemu-img convert, because bdrv_child_cb_attach() expects any non-PRIMARY child to be not FILTERED: > qemu-img convert -n -p -f raw -O raw input.raw zeroinit:output.raw > qemu-img: ../block.c:1476: bdrv_child_cb_attach: Assertion > `!(child->role & BDRV_CHILD_FILTERED)' failed. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-12-16 11:47:20 +01:00
Thomas Lamprecht	8a38e1da9e	cherry-pick "block/block-backend: blk_set_enable_write_cache is IO_CODE" albeit I was short from disarming that GLOBAL_STATE_CODE assert completely, as its just bogus to assert that on runtime for a lot of call sites, rather it should be verified on compilation (function coloring with attributes and maybe a compiler plugin). But, as this is already solved upstream lets take in that patch. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-11-22 09:19:00 +01:00
Thomas Lamprecht	509409fb64	init: daemonize: defuse PID file resolve error to warning fixes file restore, where we actively unlink the PID file of the transient VM ourself after opening it - while we use it only for tracking when the QEMU process itself has finished start up, it's easier and cleaner to fix this regression now, than to rework that to something that doesn't depends on the PID file at all. Applying Fiona's patch as patch-patch tracked under extra, as I expect that something similar to this gets accepted upstreamed. Link: https://lists.proxmox.com/pipermail/pve-devel/2022-October/054448.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-10-28 10:22:26 +02:00
Fiona Ebner	0af826b448	savevm async IO channel: channel writev: fix return value in error case The documentation in include/io/channel.h states that -1 or QIO_CHANNEL_ERR_BLOCK should be returned upon error. Simply passing along the return value from the blk-functions has the potential to confuse the call sides. Non-blocking mode is not implemented currently, so -1 it is. The "return ret" was mistakenly left over from the previous QEMUFileOps based implementation. Also, use error_setg_errno(), since the blk(_co)_p{readv,writev} functions return errno codes. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-18 15:32:13 +02:00
Fiona Ebner	4e1935c2c9	{alloc track, pbs} block driver: bdrv_co_preadv: adapt return values to be in-line with what other implementations in QEMU do. Commit 1d39c7098bbfa6862cb96066c4f8f6735ea397c5 mentions the EIO bit and the function is expected to return 0 upon success (see other implementations). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:36 +02:00
Fiona Ebner	a262e9642b	savevm async: cleaner initialization of target_close_wait member Suggested-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:34 +02:00
Fiona Ebner	73912aee39	cherry-pick upstream fixes for 7.1.0 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:32 +02:00
Fiona Ebner	5b15e2ecaf	update submodule and patches to 7.1.0 Notable changes: * The only big change is the switch to using a custom QIOChannel for savevm-async, because the previously used QEMUFileOps was dropped. Changes to the current implementation: * Switch to vector based methods as required for an IO channel. For short reads the passed-in IO vector is stuffed with zeroes at the end, just to be sure. * For reading: The documentation in include/io/channel.h states that at least one byte should be read, so also error out when whe are at the very end instead of returning 0. * For reading: Fix off-by-one error when request goes beyond end. The wrong code piece was: if ((pos + size) > maxlen) { size = maxlen - pos - 1; } Previously, the last byte would not be read. It's actually possible to get a snapshot .raw file that has content all the way up the final 512 byte (= BDRV_SECTOR_SIZE) boundary without any trailing zero bytes (I wrote a script to do it). Luckily, it didn't cause a real issue, because qemu_loadvm_state() is not interested in the final (i.e. QEMU_VM_VMDESCRIPTION) section. The buffer for reading it is simply freed up afterwards and the function will assume that it read the whole section, even if that's not the case. * For writing: Make use of the generated blk_pwritev() wrapper instead of manually wrapping the coroutine to simplify and save a few lines. * Adapt to changed interfaces for blk_{pread,pwrite}: * a9262f551e ("block: Change blk_{pread,pwrite}() param order") * 3b35d4542c ("block: Add a 'flags' param to blk_pread()") * bf5b16fa40 ("block: Make blk_{pread,pwrite}() return 0 on success") Those changes especially affected the qemu-img dd patches, because the context also changed, but also some of our block drivers used the functions. * Drop qemu-common.h include: it got renamed after essentially everything was moved to other headers. The only remaining user I could find for things dropped from the header between 7.0 and 7.1 was qemu_get_vm_name() in the iscsi-initiatorname patch, but it already includes the header to which the function was moved. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:29 +02:00
Wolfgang Bumiller	ed01236593	add patch: PVE Backup: allow passing max-workers performance setting Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-10-10 11:55:15 +02:00
Fiona Ebner	1976ca4607	savevm-async: set SAVE_STATE_DONE when closing state file was successful Without this change, it's necessary to send a second savevm-end QMP command after aborting a snaphsot, before a new savevm-start QMP command can succeed. In process_savevm_finalize(), no longer set an error in the abort scenario. If there already is another error, there's no need to override it. If canceling was done intentionally, qmp_savevm_end() is responsible for setting the state now. Reported-by: Mira Limbeck <m.limbeck@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-08-19 09:44:16 +02:00
Fiona Ebner	563c592898	savevm-async: avoid segfault when aborting snapshot Reported in the community forum[0]. For 6.1.0, there were a few changes to the coroutine-sleep API, but the adaptations in `f376b2b` ("update and rebase to QEMU v6.1.0") made a mistake. Currently, target_close_wait is NULL when passed to qemu_co_sleep_ns_wakeable(), which further passes it to qemu_co_sleep(), but there, it is dereferenced when trying to access the 'to_wake' member: > Thread 1 "kvm" received signal SIGSEGV, Segmentation fault. > qemu_co_sleep (w=0x0) at ../util/qemu-coroutine-sleep.c:57 To fix it, create a proper struct and pass its address instead. Also call qemu_co_sleep_wake unconditionally, because the NULL check (for the 'to_wake' member) is done inside the function itself. This patch is based on what the QEMU commits introducing the changes to the coroutine-sleep API did to the callers in QEMU: eaee072085 ("coroutine-sleep: allow qemu_co_sleep_wake that wakes nothing") 29a6ea24eb ("coroutine-sleep: replace QemuCoSleepState pointer with struct in the API") [0]: https://forum.proxmox.com/threads/112130/ Tested-by: Mira Limbeck <m.limbeck@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-08-19 09:44:14 +02:00
Fabian Ebner	0e88ec19db	add two more stable patches For the io_uring patch, it's not very clear which configurations can trigger it, but it should be rather uncommon. See qemu commit be6a166fde652589761cf70471bcde623e9bd72a for a bit more information. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-07-19 17:22:10 +02:00
Fabian Ebner	14ed554660	cherry-pick upstream fixes for 7.0.0 coming in via qemu-stable (except for the vdmk fix, which was tagged for-7.0 on the qemu-devel list, but didn't make it into the release). Also took the chance to switch the gluster fix to the version that made it into upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:30 +02:00
Fabian Ebner	dc9827a6a4	update submodule and patches to 7.0.0 Only very minor changes needed: * Most patches in extra (or some version of them) are part of 7.0.0. * aio_set_fd_handler got an extra parameter, but can just pass NULL like we did for the related 'poll' parameter. See QEMU commit 826cc32423db2a99d184dbf4f507c737d7e7a4ae for more. * Add include for qemu/memalign.h in vma.c and vma-writer.c. * Add reverts for fixups of already reverted 0347a8fd4c ("block/rbd: implement bdrv_co_block_status") that came in with 7.0.0. Those fixups are not enough, see Proxmox bugzilla #4047. * Two trivial context changes for bitmap-mirror patches. * block_int.h got split up into multiple headers. * Some context changes in configure and meson.build. * Used the oppurtunity to squash fixup of bdrv_backuo_dump_create typo in a later patch into the patch introducing the function (had to move code to new header during rebase). Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:21 +02:00
Thomas Lamprecht	39e84ba82d	vma/alloc-track improvements Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-06-22 15:52:16 +02:00
Thomas Lamprecht	4fd0fa7fb3	re-export patches in normalized form iow. using: git format-patch --zero-commit --no-signature --no-numbered --diff-algorithm=myers ... Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-06-22 15:49:53 +02:00
Dominik Csapak	539e333eaa	add 'namespace' to BlockdevOptionsPbs so that we can use it for the -blockdev options (used for live-restore) Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2022-06-22 15:10:49 +02:00
Fabian Ebner	7bd4d8645a	fix #4101 : acquire job's aio context before calling job_unref Otherwise, we might run into an abort via bdrv_co_yield_to_drain() (can at least happen when a disk with iothread is used): > #0 0x00007fef4f5dece1 __GI_raise (libc.so.6 + 0x3bce1) > #1 0x00007fef4f5c8537 __GI_abort (libc.so.6 + 0x25537) > #2 0x00005641bce3c71f error_exit (qemu-system-x86_64 + 0x80371f) > #3 0x00005641bce3d02b qemu_mutex_unlock_impl (qemu-system-x86_64 + 0x80402b) > #4 0x00005641bcd51655 bdrv_co_yield_to_drain (qemu-system-x86_64 + 0x718655) > #5 0x00005641bcd52de8 bdrv_do_drained_begin (qemu-system-x86_64 + 0x719de8) > #6 0x00005641bcd47e07 blk_drain (qemu-system-x86_64 + 0x70ee07) > #7 0x00005641bcd498cd blk_unref (qemu-system-x86_64 + 0x7108cd) > #8 0x00005641bcd31e6f block_job_free (qemu-system-x86_64 + 0x6f8e6f) > #9 0x00005641bcd32d65 job_unref (qemu-system-x86_64 + 0x6f9d65) > #10 0x00005641bcd93b3d pvebackup_co_complete_stream (qemu-system-x86_64 + 0x75ab3d) > #11 0x00005641bce4e353 coroutine_trampoline (qemu-system-x86_64 + 0x815353) Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-09 14:57:28 +02:00
Wolfgang Bumiller	7f4326d1dc	pbs cleanup fixes Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-08 13:10:51 +02:00
Wolfgang Bumiller	53bff441c5	delete patches which were dropped from the series file Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-08 13:07:04 +02:00
Fabian Ebner	dc265df350	add revert to work around performance regression when backing up large RBD disk resulting in QMP timeouts and very slow backups. The plan is to figure out (ideally together with upstream) a way to make the implementation of bdrv_co_block_status for RBD more efficient. But for now, revert the problematic change as a stop-gap measure. Upstream bug report: https://gitlab.com/qemu-project/qemu/-/issues/1026 Forum threads: https://forum.proxmox.com/threads/109272/ https://forum.proxmox.com/threads/109448/ https://forum.proxmox.com/threads/101334/ (partially) Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-05-19 09:23:38 +02:00
Wolfgang Bumiller	58a5492e9c	namespace support Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-05-12 13:49:35 +02:00
Thomas Lamprecht	309b5c1694	backport various fixes for gluster, qxl and vnc Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-05-11 10:40:14 +02:00
Thomas Lamprecht	f87d0523df	vma: allow partial restore Introduce a new map line for skipping a certain drive, of the form skip=drive-scsi0 Since in PVE, most archives are compressed and piped to vma for restore, it's not easily possible to skip reads. For the reader, a new skip flag for VmaRestoreState is added and the target is allowed to be NULL if skip is specified when registering. If the skip flag is set, no writes will be made as well as no check for duplicate clusters. Therefore, the flag is not set for verify. Originally-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:07:37 +02:00
Thomas Lamprecht	2fd4ea2813	patches: update context Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:07:01 +02:00
Thomas Lamprecht	2653a5f029	vma: restore: call blk_unref for all opened block devices Originally-by: Fabian Ebner <f.ebner@proxmox.com> Link: https://lists.proxmox.com/pipermail/pve-devel/2022-April/052642.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:05:29 +02:00
Thomas Lamprecht	4de9440f87	various stable backports Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-22 10:22:39 +02:00
Thomas Lamprecht	c8ba14bed0	cherry-pick fix for passing some acpi slic tables Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-15 08:07:34 +02:00
Fabian Ebner	27199bd753	backup: add patch to initialize bcs bitmap early enough for PBS This is necessary for multi-disk backups where not all jobs are immediately started after they are created. QEMU commit 06e0a9c16405c0a4c1eca33cf286cc04c42066a2 did already part of the work, ensuring that new writes after job creation don't pass through to the backup, but not yet for the MIRROR_SYNC_MODE_BITMAP case which is used for PBS. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-03-03 11:37:17 +01:00
Fabian Ebner	f6d40bfdf4	add patch for loading a snapshot with qemu-img dd Will be used when cloning from a qcow2 efidisk. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	107132becc	fix getopt-string when introducing -n option for qemu-img dd The colon after U is wrong, because it doesn't take an argument. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	4567474e95	update submodule and patches to 6.2.0 Notable changes: * bdrv_co_p{discard,readv,writev,write_zeroes} function signatures changed, to using int64_t for offsets/bytes and some still had int rather than BrdvRequestFlags for the flags. * job_cancel_sync now has a force parameter. Commit messages in 73895f3838cd7fdaf185cf1dbc47be58844a966f 4cfb3f05627ad82af473e7f7ae113c3884cd04e3 sound like using force=true makes more sense. * Added 3 patches coming in via qemu-stable tag, most important one is to work around a librbd issue. * Added another 3 patches from qemu-devel to fix issue leading to crash when live migrating with iothread. * cluster_size calculation helper changed (see patch pve/0026). * QAPI's if conditionals now use 'CONFIG_FOO' rather than 'defined(CONFIG_FOO)' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	2bf61c3eb6	vma: create: register all streams before entering coroutines Otherwise, the header might already get written by a coroutine and registering further streams will fail after that. Also adds a missing g_list_free call for the other GList that's used. Reported in the community forum: https://forum.proxmox.com/threads/104744/ Reproducer script (increase beyond 30 if the issue isn't triggered yet): > #!/usr/bin/perl > > my $dir = "./vma-create-bug"; > mkdir $dir; > > my $archive_path = "$dir/vzdump-qemu-104-2202_02_02-00_00_00.vma"; > unlink $archive_path; > > my $cmd = "vma create $archive_path -v"; > for (my $i = 0; $i < 30; $i++) { > system("truncate -s 1M $dir/drive-virtio$i.img"); > $cmd .= " drive-virtio$i=$dir/drive-virtio$i.img"; > } > system($cmd); Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-14 15:38:58 +01:00
Thomas Lamprecht	ddbf7a872d	update submodule and patches to 6.1.1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-01-13 10:56:39 +01:00
Fabian Ebner	570d4ad51d	fix #3738 : cherry-pick "block: introduce max_hw_iov for use in scsi-generic" which fixes the bad commit 18473467d55a20d643b6c9b3a52de42f705b4d35 that was tracked down via bisecting, and has a Cc for qemu-stable as well. Issue was easy enough to reproduce with a single virtio-block disk using a few runs of dd if=/dev/urandom of=file bs=1M count=1000 Commit cc071629539dc1f303175a7e2d4ab854c0a8b20f upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-12-01 15:34:27 +01:00
Dominik Csapak	c5e8e7c998	buildsys: fix build-dependencies on headers for 'vma' and 'pbs_restore' both of them depend on generated header files, so we have to specify them as sources. Otherwise, it happens (at least on some machines) that they will be compiled before the headers are generated, aborting the build. Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2021-11-18 08:11:57 +01:00
Fabian Grünbichler	7cf6b60926	fix #3728 : handle machine without type libguestfs starts their helper VMs with `-machine accel=..` without a machine type, and our pve version suffix handling would segfault in that case. there might be other scripted use cases that are affected as well. this regression was introduced with the rebase of our patch set on top of 6.1.0 Fixes: `f376b2b9e2` Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-17 17:20:26 +01:00
Fabian Grünbichler	edbcc10a69	cherry-pick segfault fix this was reported multiple times in our forums[1 with backtraces, 2 & 3 with same log messages], fix is taken from upstream master. 1: https://forum.proxmox.com/threads/pve-7-0-14-1-vm-not-running-live-migration-kills-vm-post-ssd-move-pre-ram-move.99704/ 2: https://forum.proxmox.com/threads/proxmox-7-0-14-1-crashes-vm-during-migrate-to-other-host.99678 3: https://forum.proxmox.com/threads/cannot-migrate-between-zfs-and-ceph.99685/#post-430152 Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-16 09:23:43 +01:00
Stefan Reiter	af64ed13eb	add fixup patch for qxl migration logic Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-10-13 17:58:18 +02:00
Stefan Reiter	f376b2b9e2	update and rebase to QEMU v6.1.0 Very clean rebase, only the +pve version handling needed manual fixing. Drops two applied patches from extra/ and adds one new from upstream (extra/0001*, fixes VNC over unix sockets) as well as 3 of my own for allowing password changes on custom VNC displays again (as seen and reviewed upstream, but not yet applied). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-10-11 15:13:26 +02:00
Stefan Reiter	26eee146bc	add temporary QMP race fix same as the initial version sent to qemu-devel, it won't be the final fix we plan to upstream but it should be enough band-aid to workaround how PVE uses the QMP. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> [ Thomas: add a bit reasoning to commit message body ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-09-06 07:28:07 +02:00
Wolfgang Bumiller	277d33454f	drop patch force-disabling smm This drops debian/patches/pve/0005-PVE-Config-smm_available-false.patch (and renumbers the remaining patches) From what I could gather, this patch was originally added due to issues with old kernels. Now we have users which seem to run into issues with the patch. All this does is toggle an option, and it's available via a qemu CLI option anyway, so if dropping this patch causes issues for some people we can just add an option to qemu-server & UI control smm explicitly. Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Cc: Alexandre Derumier <aderumier@odiso.com> Tested-by: Stefan Reiter <s.reiter@proxmox.com>	2021-08-24 11:19:05 +02:00
Fabian Ebner	0114d3cd02	io_uring: resubmit when result is -EAGAIN Linux SCSI can throw spurious -EAGAIN in some corner cases in its completion path, which will end up being the result in the completed io_uring request. Resubmitting such requests should allow block jobs to complete, even if such spurious errors are encountered. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-07-29 11:51:57 +02:00
Stefan Reiter	8dca018b68	udpate and rebase to QEMU v6.0.0 Mostly minor changes, bigger ones summarized: * QEMU's internal backup code now uses a new async system, which allows parallel requests - the default max_workers settings is 64, I chose less, since 64 put enough stress on QEMU that the guest became practically unusable during the backup, and 16 still shows quite a nice measureable performance improvement. Little code changes for us though. * 'malformed' QAPI parameters/functions are now a build error (i.e. using '_' vs '-'), I chose to just whitelist our calls in the name of backwards compatibility. * monitor OOB race fix now uses the upstream variant, cherry-picked from origin/master since it's not in 6.0 by default * last patch fixes a bug with snapshot rollback related to the new yank system Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-05-28 11:29:44 +02:00
Thomas Lamprecht	0a88214b72	alloc track: use coroutine version of bdrv_pwrite_zeroes as we're in a coroutine here too Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:31:53 +02:00
Thomas Lamprecht	76e464784e	pbs block driver: run read in the AIO context of the bs Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:31:53 +02:00
Thomas Lamprecht	b36e8acc31	alloc track: acquire BS AIO context during dropping ran into this when live-restoring a backup configured for IO-threads, got the good ol': > qemu: qemu_mutex_unlock_impl: Operation not permitted error. Checking out the history of the related bdrv_backup_top_drop(*bs) method, we can see that it used to do the AIO context acquiring too, but in the backup path this was problematic and was changed to be higher up in the call path in a upstream series from Stefan[0]. That said, this is a completely different code path and it is safe to do so here. We always run from the main threads's AIO context here and we call it only indirectly once, guarded by checking for `s->drop_state == DropNone` and set `s->drop_state = DropRequested` shortly before we schedule the track_drop() in a bh. [0]: https://lists.gnu.org/archive/html/qemu-devel/2020-03/msg09139.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:27:48 +02:00
Thomas Lamprecht	aa42ea267e	alloc track: keep track_drop() closer to similar block drivers Reads just nicer with a drain begin and end call. Also clearing the backing link of the alloc track BDS makes it closer to bdrv_backup_top_drop() with which this driver has a bit in common. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:27:37 +02:00
Stefan Reiter	e79be6c6c4	add upstream fixes for qmp_block_resize cherry-picked cleanly from 6.0 development tree, fixes an issue with resizing RBD drives (and reportedly also on krbd or potentially other storage backends) with iothreads. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-30 18:14:37 +02:00
Stefan Reiter	bb751cab32	Add tentative fix for QMP hang Not exactly as sent upstream[0] since we're missing a change in our v5.2.0 branch (irrelevant for us), but functionally works the same. [0] https://lists.gnu.org/archive/html/qemu-devel/2021-03/msg07590.html	2021-03-22 16:52:40 +01:00
Stefan Reiter	677d0d169f	add alloc-track block driver patch See added patches for more info, overview: 0044: slightly increase PBS performance by reducing allocations 0045: slightly increase block-stream performance for Ceph 0046: don't crash with block-stream on RBD 0047: add alloc-track driver for live restore Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-16 20:53:18 +01:00
Stefan Reiter	e9b36665c7	fix saving and loading dirty bitmaps in snapshots Saving dirty bitmaps from our savevm-async code didn't work, since we use a coroutine which holds the iothread mutex already (upstream savevm is sync, migration uses a thread). Release the mutex before calling the one function that (according to it's documentation) requires the lock to not be held: qemu_savevm_state_pending. Additionally, loading dirty bitmaps requires a call to dirty_bitmap_mig_before_vm_start after "loadvm", which the upstream savevm does explicitly afterwards - do that too. This is exposed via the query-proxmox-support property "pbs-dirty-bitmap-savevm". Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-16 20:44:06 +01:00
Stefan Reiter	40e6b6e5a5	add ACPI compat patch for 5.1 and older machine types Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-05 15:20:14 +01:00
Stefan Reiter	2413972b46	move bitmap-mirror patches to seperate folder ...instead of having them in the middle of the backup related patches. These might (hopefully) become upstream at some point as well. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-03-03 14:29:05 +01:00
Stefan Reiter	0c893fd820	clean up pve/ patches by squashing patches of patches No functional change intended. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-03-03 14:29:05 +01:00
Stefan Reiter	4194124719	pbs-restore: unref/close target block backend Use blk_unref to drop the last reference, which will close the block backend and flush all caches and outstanding writes. This is especially important for restoring to Ceph, as the userspace librbd caches will not be flushed if the application exits immediately, leading to potentially incomplete restores. Reported-by: Eneko Lacunza <elacunza@binovo.es> Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-02-24 19:02:07 +01:00
Thomas Lamprecht	42a90c4e1c	d/patches: backport virtiofsd security fix Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-24 19:02:07 +01:00
Stefan Reiter	0b8da68824	add PBS master key support Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-02-12 10:47:14 +01:00
Stefan Reiter	817b7667e8	Update to QEMU 5.2 Lots of patches touched and some slight changes to the build process since QEMU switched to meson as their build system. Functionality-wise very little rebasing required. New patches introduced: * pve/0058: to fix VMA backups and clean up some code in general with new 5.2 features now available to us (namely coroutine-enabled QMP). * extra/0002: don't build man pages for guest agent when disabled * extra/0003: fix live-migration with hugepages * 0017 and 0018 are adjusted to fix snapshot abort and improve snap performance a bit Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-02-12 10:20:01 +01:00
Fabian Ebner	a16eaaffd3	fix #3084 : fall back to open-iscsi initiatorname Fixes vma restore when the target is an iSCSI storage which expects that initiatorname. Also avoids the need to always explicitly set the initiatorname in PVE code, thus fixing moving efidisks from and to such iSCSI storages. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-02-06 15:09:15 +01:00
Wolfgang Bumiller	b515d45e6b	fix #3225 : properly cancel jobs in 'created' state Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2021-01-07 10:26:37 +01:00
Stefan Reiter	cfe02b3b4e	update patches with some pbs-state migration cleanups ...and literal cleanup, as in, call save_cleanup after success or error. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-11-25 11:49:06 +01:00
Stefan Reiter	32ee41155b	update patches with squashed in 'include library version' Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> bump build-dependency on libproxmox-backup-qemu0-dev with version query support Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-11-25 11:49:06 +01:00
Thomas Lamprecht	66eae0ae75	fix dirty-bitmap state migration freeze The idea in general is to migrate all the state, which is small for us, in a single step once. But, QEMU only calls save state if we return active true. Hardcoding is-active to return true, like done initially, makes the migration freeze, as QEMU thinks this is never done, and only stops calling us and finishes after a few seconds. So, add a state with an "active" boolean, set to true when initializing a migration, and set it to false when the state was saved. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-11-05 18:58:15 +01:00
Thomas Lamprecht	f36fa39113	migration/block-dirty-bitmap: migrate other bitmaps even if one fails Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-11-04 18:35:50 +01:00
Thomas Lamprecht	d95ad93eed	apply dirty-bitmap state migration + fix Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-10-29 18:05:43 +01:00
Stefan Reiter	72ae34ecce	Several fixes for backup abort and error reporting Also add my Signed-off-by to some patches where it was missing. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-29 17:57:47 +01:00
Stefan Reiter	d333327a1b	Add transaction patches and fix for blocking finish With the transaction patches, patch 0026-PVE-Backup-modify-job-api.patch is no longer necessary, so drop it and rebase all following patches on top. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-09-29 09:21:15 +02:00
Thomas Lamprecht	4b7a18845c	cherry-pick: "usb: fix setup_len init (CVE-2020-14364)" Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-09-14 19:38:34 +02:00
Thomas Lamprecht	7895b0d523	work around #3002 : revert "qemu-img convert: Don't pre-zero images" Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-09-14 19:37:45 +02:00
Stefan Reiter	437d68473c	Add systemd journal logging patch Prints QEMU errors that occur after the "-daemonize" fork to the systemd journal, instead of pushing them into /dev/null like before. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-09-08 17:13:29 +02:00
Fabian Grünbichler	a5feeebc51	allow backup of read-only block drives this is needed for template backups with PBS until we have the backup equivalent of 'pbs-restore'. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-08-20 14:11:39 +02:00
Stefan Reiter	60ae3775bf	update to QEMU 5.1 No major semantic changes, mostly just deprecations and changed function signatures. Drop the extra/ patches, as they have been applied upstream. The added extra/ patch was accepted upstream[0] but has not been picked up for 5.1. It is required for non-4M aligned backups to work with PBS. [0] https://lists.gnu.org/archive/html/qemu-devel/2020-08/msg01671.html Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-08-20 13:40:36 +02:00
Thomas Lamprecht	f00a720d7e	PVE: add query-pbs-bitmap-info QMP call Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-08-19 18:11:23 +02:00
Thomas Lamprecht	c5f7dc1d72	PVE: add zero block handling to PBS dump callback Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-08-19 13:56:03 +02:00
Fabian Grünbichler	2821f02d70	fix PBS write callback with big blocks Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-08-11 11:14:36 +02:00
Oguz Bektas	95fd47ecb9	patch for possible DOS in qemu network packet processing fixes an assertion failure in qemu network packet processing, which can lead to DOS'ing the qemu process on the host. this affects 'e1000e' and 'vmxnet3' network devices. patch is cherry-picked from the commit mentioned in the oss-security email. more info on oss-security [0] [0]: https://www.openwall.com/lists/oss-security/2020/08/10/1 Signed-off-by: Oguz Bektas <o.bektas@proxmox.com>	2020-08-11 11:08:39 +02:00
Stefan Reiter	f257cc05f4	Fix dirty-bitmap PBS backup with multiple drives "PVE backup: rename incremental to use-dirty-bitmap" merged two variables (use_dirty_bitmap and incremental) into one, but they served two different purposes. Rename the original use_dirty_bitmap to "expect_only_dirty" so the new one doesn't conflict, and rework "PVE: use proxmox_backup_check_incremental" around that semantic. In practice, this had the effect that only one disk at a time would have a bitmap added, as after the first "use_dirty_bitmap" would be set to one and the rest would behave as if the QMP parameter of the same name was unset. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-07-14 10:46:48 +02:00
Wolfgang Bumiller	6d46b2ff4c	fix backup qmp parameters to pass along encryption info Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2020-07-10 13:31:52 +02:00
Thomas Lamprecht	3499c5b45a	PBS patches: block driver, adapat encrypt/compress param, add query-proxmox-support QMP cmd Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-09 13:15:49 +02:00
Thomas Lamprecht	4c17eebee4	fixup: proxmox_backup_check_incremental is negated Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-06 23:05:42 +02:00
Thomas Lamprecht	3ab149ccdd	update/add PBS integration patches * rename "incremental" param to "use-dirty-bitmap", avoids confusion as the backup can be incrementally also with that param set to false. * use new proxmox_backup_check_incremental * fix setting dirty counter and adapt to new connect API semantic Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-06 22:13:12 +02:00
Thomas Lamprecht	15b9c76e1f	pbs: query-backup: set reused field also for dirty-bitmap Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-03 19:26:09 +02:00
Thomas Lamprecht	d7f4e01a34	debian/patches: squash some followup patches and regroup a bit more together Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-02 13:33:16 +02:00
Thomas Lamprecht	20be7fa0a0	backup: improve QAPI info and remove all dirty-bitmaps on failed drive-job effectively two commits merged as one: https://pve.proxmox.com/pipermail/pve-devel/2020-July/044185.html https://pve.proxmox.com/pipermail/pve-devel/2020-July/044194.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-02 13:03:49 +02:00

1 2 3 4 5

228 Commits (master)