pve-qemu

Commit Graph

Author	SHA1	Message	Date
Fiona Ebner	51232e2e40	fix #5409 : backup: fix copy-before-write timeout The type for the copy-before-write timeout in nanoseconds was wrong. By being just uint32_t, a maximum of slightly over 4 seconds was possible. Larger values would overflow and thus the 45 seconds set by Proxmox's backup with fleecing, resulted in effectively 2 seconds timeout for copy-before-write operations. Reported-by: Friedrich Weber <f.weber@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-04-29 17:25:20 +02:00
Fiona Ebner	4fbd50e2f9	update submodule and patches to QEMU 9.0.0 Biggest change is that AioContext locking got removed, but no changes required other than dropping the calls to acquire and release it. As a consequence, the single parameter for the bdrv_graph_wrlock() call got removed which also required adaptation. QAPI docs became stricter requiring to document all members. Other minor changes: - Single parameter from migration_is_running() was dropped. - qemu_mutex_(un)lock_iothread() got renamed to bql_(un)lock(). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-04-29 15:29:52 +02:00
Fiona Ebner	f1eed34ac7	update submodule and patches to QEMU 8.2.2 This version includes both the AioContext lock and the block graph lock, so there might be some deadlocks lurking. It's not possible to disable the block graph lock like was done in QEMU 8.1, because there are no changes like the function bdrv_schedule_unref() that require it. QEMU 9.0 will finally get rid of the AioContext locking. During live-restore with a VirtIO SCSI drive with iothread there is a known racy deadlock related to the AioContext lock. Not new [1], but not sure if more likely now. Should be fixed in QEMU 9.0. The block graph lock comes with annotations that can be checked by clang's TSA. This required changes to the block drivers, i.e. alloc-track, pbs, zeroinit as well as taking the appropriate locks in pve-backup, savevm-async, vma-reader. Local variable shadowing is prohibited via a compiler flag now, required slight adaptation in vma.c. Major changes only affect alloc-track: * It is not possible to call a generated co-wrapper like bdrv_get_info() while holding the block graph lock exclusively [0], which does happen during initialization of alloc-track when the backing hd is set and the refresh_limits driver callback is invoked. The bdrv_get_info() call to get the cluster size is moved to directly after opening the file child in track_open(). The important thing is that at least the request alignment for the write target is used, because then the RMW cycle in bdrv_pwritev will gather enough data from the backing file. Partial cluster allocations in the target are not a fundamental issue, because the driver returns its allocation status based on the bitmap, so any other data that maps to the same cluster will still be copied later by a stream job (or during writes to that cluster). * Replacing the node cannot be done in the track_co_change_backing_file() callback, because it is a coroutine and cannot hold the block graph lock exclusively. So it is moved to the stream job itself with the auto-remove option not having an effect anymore (qemu-server would always set it anyways). In the future, there could either be a special option for the stream job, or maybe the upcoming blockdev-replace QMP command can be used. Replacing the backing child is actually already done in the stream job, so no need to do it in the track_co_change_backing_file() callback. It also cannot be called from a coroutine. Looking at the implementation in the qcow2 driver, it doesn't seem to be intended to change the backing child itself, just update driver-internal state. Other changes: * alloc-track: Error out early when used without auto-remove. Since replacing the node now happens in the stream job, where the option cannot be read from (it's internal to the driver), it will always be treated as 'on'. Makes sure to have users beside qemu-server notice the change (should they even exist). The option can be fully dropped in the future while adding a version guard in qemu-server. * alloc-track: Avoid seemingly superfluous child permission update. Doesn't seem necessary nowadays (maybe after commit "alloc-track: fix deadlock during drop" where the dropping is not rescheduled and delayed anymore or some upstream change). Replacing the block node will already update the permissions of the new node (which was the file child before). Should there really be some issue, instead of having a drop state, this could also be just based off the fact whether there is still a backing child. Dumping the cumulative (shared) permissions for the BDS with a debug print yields the same values after this patch and with QEMU 8.1, namely 3 and 5. * PBS block driver: compile unconditionally. Proxmox VE always needs it and something in the build process changed to make it not enabled by default. Probably would need to move the build option to meson otherwise. * backup: job unreferencing during cleanup needs to happen outside of coroutine, so it was moved to before invoking the clean * mirror: Cherry-pick stable fix to avoid potential deadlock. * savevm-async: migrate_init now can fail, so propagate potential error. * savevm-async: compression counters are not accessible outside migration/ram-compress now, so drop code that prophylactically set it to zero. [0]: https://lore.kernel.org/qemu-devel/220be383-3b0d-4938-b584-69ad214e5d5d@proxmox.com/ [1]: https://lore.kernel.org/qemu-devel/e13b488e-bf13-44f2-acca-e724d14f43fd@proxmox.com/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-04-26 14:14:06 +02:00
Fiona Ebner	ed159bc32a	add patch to fix deadlock with VirtIO block and iothread during QMP stop Backported from commit bfa36802d1 ("virtio-blk: avoid using ioeventfd state in irqfd conditional") because the rework/rename dataplane -> ioeventfd didn't happen yet. Reported in the community forum [0] and reproduced doing a backup loop to PBS with suspend mode with fio doing heavy IO in the guest and using an RBD storage (with krbd). [0]: https://forum.proxmox.com/threads/141320 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-21 20:09:22 +01:00
Fiona Ebner	86460aef76	fix #4507 : add patch to automatically increase NOFILE soft limit In many configurations, e.g. multiple vNICs with multiple queues or with many Ceph OSDs, the default soft limit of 1024 is not enough. QEMU is supposed to work fine with file descriptors >= 1024 and does not use select() on POSIX. Bump the soft limit to the allowed hard limit to avoid issues with the aforementioned configurations. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-06 10:33:12 +01:00
Thomas Lamprecht	4ff04bdfa5	work around stuck guest IO with iothread and VirtIO block/SCSI This essentially repeats commit `6b7c181` ("add patch to work around stuck guest IO with iothread and VirtIO block/SCSI") with an added fix for the SCSI event virtqueue, which requires special handling. This is to avoid the issue [3] that made the revert `2a49e66` ("Revert "add patch to work around stuck guest IO with iothread and VirtIO block/SCSI"") necessary the first time around. When using iothread, after commits 1665d9326f ("virtio-blk: implement BlockDevOps->drained_begin()") 766aa2de0f ("virtio-scsi: implement BlockDevOps->drained_begin()") it can happen that polling gets stuck when draining. This would cause IO in the guest to get completely stuck. A workaround for users is stopping and resuming the vCPUs because that would also stop and resume the dataplanes which would kick the host notifiers. This can happen with block jobs like backup and drive mirror as well as with hotplug [2]. Reports in the community forum that might be about this issue[0][1] and there is also one in the enterprise support channel. As a workaround in the code, just re-enable notifications and kick the virt queue after draining. Draining is already costly and rare, so no need to worry about a performance penalty here. Take special care to attach the SCSI event virtqueue host notifier with the _no_poll() variant like in virtio_scsi_dataplane_start(). This avoids the issue from the first attempted fix where the iothread would suddenly loop with 100% CPU usage whenever some guest IO came in [3]. This is necessary because of commit 38738f7dbb ("virtio-scsi: don't waste CPU polling the event virtqueue"). See [4] for the relevant discussion. [0]: https://forum.proxmox.com/threads/137286/ [1]: https://forum.proxmox.com/threads/137536/ [2]: https://issues.redhat.com/browse/RHEL-3934 [3]: https://forum.proxmox.com/threads/138140/ [4]: https://lore.kernel.org/qemu-devel/bfc7b20c-2144-46e9-acbc-e726276c5a31@proxmox.com/ Link: https://lore.kernel.org/qemu-devel/20240202153158.788922-1-hreitz@redhat.com/ Originally-by: Fiona Ebner <f.ebner@proxmox.com> [ TL: Update to v2 and rebased patch series handling to v8.1.5 ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-02-02 19:35:34 +01:00
Fiona Ebner	5e8903f875	stable fixes for corner case in i386 emulation and crash with VNC clipboard Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-02 19:06:29 +01:00
Fiona Ebner	4b7975e75d	update submodule and patches to QEMU 8.1.5 Most notable fixes from a Proxmox VE perspective are: * "virtio-net: correctly copy vnet header when flushing TX" To prevent a stack overflow that could lead to leaking parts of the QEMU process's memory. * "hw/pflash: implement update buffer for block writes" To prevent an edge case for half-completed writes. This potentially affected EFI disks. * Fixes to i386 emulation and ARM emulation. No changes for patches were necessary (all are just automatic context changes). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-02 19:06:29 +01:00
Fiona Ebner	dfac4f3593	pick fix for potential deadlock with QMP resize and iothread While the patch gives bdrv_graph_wrlock() as an example where the issue can manifest, something similar can happen even when that is disabled. Was able to reproduce the issue with while true; do qm resize 115 scsi0 +4M; sleep 1; done while running fio --name=make-mirror-work --size=100M --direct=1 --rw=randwrite \ --bs=4k --ioengine=psync --numjobs=5 --runtime=1200 --time_based in the VM. Fix picked up from: https://lists.nongnu.org/archive/html/qemu-devel/2023-12/msg01102.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-12-11 16:56:50 +01:00
Fiona Ebner	df2cc786ee	add fix for vnc clipboard This fixes the host->guest direction with noNVC as a client (and likely others). Reported-by: Friedrich Weber <f.weber@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Tested-by: Friedrich Weber <f.weber@proxmox.com>	2023-11-22 14:19:45 +01:00
Fiona Ebner	89b46e17ec	fix #5054 : backport fix for software reset with SATA The issue prevented FreeBSD 14 VMs with SATA disk from booting. The commit it fixes e2a5d9b3d9c3 ("hw/ide/ahci: simplify and document PxCI handling") is part of stable 8.1.2. The patch was already applied to the block branch upstream: https://lists.nongnu.org/archive/html/qemu-devel/2023-11/msg02711.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Tested-by: Friedrich Weber <f.weber@proxmox.com>	2023-11-20 10:35:00 +01:00
Fiona Ebner	c38e337f5d	revert commit breaking VirtIO network adapters for certain versions of Windows As reported in the community forum [0] and reproduced locally this breaks VirtIO network adapters in (at least) the German ISO of Windows Server 2022. The fix itself was for > Issue is not fatal but as result acpi-index/"PCI Label ID" property > is either not shown in device details page or shows incorrect value. so revert and tolerate that as a stop-gap, rather than have the devices not working at all. [0]: https://forum.proxmox.com/threads/92094/post-605684 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-11-17 11:52:52 +01:00
Fiona Ebner	a36bda146c	add patch to avoid huge snapshot performance regression Taking a snapshot became prohibitively slow because of the migration_transferred_bytes() call in migration_rate_exceeded() [0]. This also applied to the async snapshot taking in Proxmox VE, so work around the issue until it is fixed upstream. [0]: https://gitlab.com/qemu-project/qemu/-/issues/1821 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	03ff63aa61	add patch to disable graph locking There are still some issues with graph locking, e.g. deadlocks during backup canceling [0] and initial attempts to fix it didn't work [1]. Because the AioContext locks still exist, it should still be safe to disable graph locking. [0]: https://lists.nongnu.org/archive/html/qemu-devel/2023-09/msg00729.html [1]: https://lists.nongnu.org/archive/html/qemu-devel/2023-09/msg06905.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	10e1093325	update submodule and patches to QEMU 8.1.2 Bigger notable changes: * Commit 1a30b0f5d7 ("block: .bdrv_open is non-coroutine and unlocked") broke the PVE backup patches, in particular setting up the backup dump block driver, because bdrv_new_open_driver() cannot be called from a coroutine. To fix it, bdrv_co_open() is used instead, and while it's a much more involved function, the result should be essentially the same. The only difference I noticed is that the BDRV_O_ALLOW_RDWR flag is also set in the resulting bds (block driver state), but that shouldn't hurt. Smaller notable changes: * aio_set_fd_handler() dropped its 'is_external' parameter stating that all callers now pass false in 60f782b6b7 ("aio: remove aio_disable_external() API"). The calls in the PVE patches also passed false, so just drop the parameter too. * global_state_store() does not have a return value anymore, so the user in the PVE savevm-async patch was adapted. For context, see c33f1829f8 ("migration: never fail in global_state_store()"). * Renames affecting the PVE savevm-async patch: migrate_use_block() -> migrate_block() and ram_counters -> mig_stats 9d4b1e5f22 ("migration: Move migrate_use_block() to options.c") aff3f6606d ("migration: Rename ram_counters to mig_stats") Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	816077299c	fix #2874 : SATA: avoid unsolicited write to sector 0 during reset If there is a pending DMA operation during ide_bus_reset(), the fact that the IDEstate is already reset before the operation is canceled can be problematic. In particular, ide_dma_cb() might be called and then use the reset IDEstate which contains the signature after the reset. When used to construct the IO operation this leads to ide_get_sector() returning 0 and nsector being 1. This is particularly bad, because a write command will thus destroy the first sector which often contains a partition table or similar. Upstream discussion: https://lists.nongnu.org/archive/html/qemu-devel/2023-08/msg04239.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-26 11:30:22 +02:00
Fiona Ebner	df47146afe	add patch fixing fd leak for vhost Each pause+resume operation (which is also done as part of taking a VM snapshot) would increase the number of open file descriptors by the number of vhost devices (e.g. network devices by default). This could lead to crashes during backup and surely other issues once the system limit (default 1024) was reached [0]. [0]: https://forum.proxmox.com/threads/131603/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-08-03 17:40:13 +02:00
Fiona Ebner	5919ec1446	add patch fixing resume for snapshot and hibernate with drive with iothread and a dirty bitmap Not difficult to run into, just have a drive with iothread, take a PBS backup and then take a snapshot or hibernate. Resuming will fail with > qemu: qemu_mutex_unlock_impl: Operation not permitted because of not acquiring the correct AioContext first. Migration is not affected, because it runs in coroutine context. Reported in the community forum: https://forum.proxmox.com/threads/129899/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-07-28 12:00:50 +02:00
Fiona Ebner	a39364b9d1	update reentrancy patches to version in upstream git The previous version was picked from the mailing list and still had an object_dynamic_cast call in a hot path, which is avoided with the version that landed in git. Also adds a few more exceptions for devices that need reentrancy. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-06-06 16:32:38 +02:00
Fiona Ebner	0f693c2cab	update submodule and patches to QEMU 8.0.2 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-06-06 16:32:38 +02:00
Fiona Ebner	53b56ca781	add stable patches for 8.0.0 Changes to other patches are all just metadata/context changes except for pvebackup_co_prepare() needing to call bdrv_co_unref() rather than bdrv_unref(), because it is a coroutine itself. This is documented in d6ee2e324e ("block-coroutine-wrapper: Introduce no_co_wrapper"). The change is necessary, because one of the stable fixes converts bdrv_unref and blk_unref into no_co_wrappers (in preparation for a second patch to fix a hang with the block resize QMP command). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-05-22 15:09:14 +02:00
Fiona Ebner	bf251437e9	update submodule and patches to QEMU 8.0.0 Many changes were necessary this time around: * QAPI was changed to avoid redundant has_* variables, see commit 44ea9d9be3 ("qapi: Start to elide redundant has_FOO in generated C") for details. This affected many QMP commands added by Proxmox too. * Pending querying for migration got split into two functions, one to estimate, one for exact value, see commit c8df4a7aef ("migration: Split save_live_pending() into state_pending_") for details. Relevant for savevm-async and PBS dirty bitmap. Some block (driver) functions got converted to coroutines, so the Proxmox block drivers needed to be adapted. * Alloc track auto-detaching during PBS live restore got broken by AioContext-related changes resulting in a deadlock. The current, hacky method was replaced by a simpler one. Stefan apparently ran into a problem with that when he wrote the driver, but there were improvements in the stream job code since then and I didn't manage to reproduce the issue. It's a separate patch "alloc-track: fix deadlock during drop" for now, you can find the details there. * Async snapshot-related changes: - The pending querying got adapted to the above-mentioned split and a patch is added to optimize it/make it more similar to what upstream code does. - Added initialization of the compression counters (for future-proofing). - It's necessary the hold the BQL (big QEMU lock = iothread mutex) during the setup phase, because block layer functions are used there and not doing so leads to racy, hard-to-debug crashes or hangs. It's necessary to change some upstream code too for this, a version of the patch "migration: for snapshots, hold the BQL during setup callbacks" is intended to be upstreamed. - Need to take the bdrv graph read lock before flushing. * hmp_info_balloon was moved to a different file. * Needed to include a new headers from time to time to still get the correct functions. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-05-22 15:09:14 +02:00
Fiona Ebner	e752bbe5e2	cherry-pick TCG-related stable fixes for 7.2 When turning off the "KVM hardware virtualization" checkbox in Proxmox VE, the TCG accelerator is used, so these fixes are relevant then. The first patch is included to allow cherry-picking the others without changes. Reported-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-17 15:46:20 +01:00
Fiona Ebner	72fc94c0c6	add patch fixing ACPI CPU hotplug issue with TCG Required for the debian/edk2-vars-generator.py script in the pve-edk2-firmware repository when building the edk2-stable202302 release. Without this patch, the QEMU process spawned by the script would hang indefinietly. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-17 12:06:22 +01:00
Fiona Ebner	ffda59f626	add patches to fix regression with LSI SCSI controller The patch 0008-memory-prevent-dma-reentracy-issues.patch introduced a regression for the LSI SCSI controller leading to boot failures [0], because, in its current form, it relies on reentrancy for a particular ram_io region. [0]: https://forum.proxmox.com/threads/123843 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:22 +01:00
Fiona Ebner	3c4f941ac7	add more stable fixes The patches were selected from the recent "Patch Round-up for stable 7.2.1" [0]. Those that should be relevant for our supported use-cases (and the upcoming nvme use-case) were picked. Most of the patches added now have not been submitted to qemu-stable before. The follow-up for the virtio-rng-pci migration fix will break migration between versions with the fix and without the fix when a virtio-pci-rng(-non)-transitional device is used. Luckily Proxmox VE only uses the virtio-pci-rng device, and this was fixed by 0006-virtio-rng-pci-fix-migration-compat-for-vectors.patch which was applied before any public version of Proxmox VE's QEMU 7.2 package was released. [0]: https://lists.nongnu.org/archive/html/qemu-stable/2023-03/msg00010.html [1]: https://bugzilla.redhat.com/show_bug.cgi?id=2162569 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:19 +01:00
Fiona Ebner	3a94e1a186	fixup patch "ide: avoid potential deadlock when draining during trim" The patch was incomplete and (re-)introduced an issue with a potential failing assertion upon cancelation of the DMA request. There is a patch on qemu-devel now[0], and it's the same as this one code-wise (except for comments). But the discussion is still ongoing. While there shouldn't be a real issue with the patch, there might be better approaches. The plan is to use this as a stop-gap for now and pick up the proper solution once it's ready. [0]: https://lists.nongnu.org/archive/html/qemu-devel/2023-03/msg03325.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-13 17:36:19 +01:00
Fiona Ebner	58659169de	add patch to avoid potential deadlock with trim for IDE/SATA and draining In particular, the deadlock can occur, together with unlucky timing between the QEMU threads, when the guest is issuing trim requests during the start of a backup operation. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> [ T: resolve trivial merge conflict in series file ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-03-08 14:22:36 +01:00
Fiona Ebner	10691e04e9	add patch fixing Linux boot failures with megasas SCSI A regression in 7.2 and easily reproduced. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-03-07 19:50:12 +01:00
Fiona Ebner	00e2507aac	add fix for iscsi double free issue leading to crashes Reported here[0] and here[1]. [0]: https://gitlab.com/qemu-project/qemu/-/issues/1378 [1]: https://forum.proxmox.com/threads/122776/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 13:49:19 +01:00
Fiona Ebner	e7e5f63573	add patch fixing DMA reentrancy issues that could lead to use-after-frees and stack overflows with a malicious (or buggy) guest. See [0] for a good summary: [0]: https://lore.kernel.org/qemu-devel/CAFEAcA_23vc7hE3iaM-JVA6W38LK4hJoWae5KcknhPRD5fPBZA@mail.gmail.com Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-02-21 10:18:35 +01:00
Fiona Ebner	d5f6ef56f0	add patch to fix issue with VirtIO disk using detect-zeroes=unmap Affects Proxmox VE, when the discard disk setting is used for a VirtIO disk. Upstream bug report: https://gitlab.com/qemu-project/qemu/-/issues/1404 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-27 09:36:41 +01:00
Fiona Ebner	48c307550a	add regression fix for migration with virtio-rng device between QEMU less than 7.2 and QEMU 7.2 without the fix (both directions are affected). As mentioned in the patch message, this fix itself will break migration between QEMU 7.2 and QEMU 7.2 with the fix (in both directions, if a virtio-rng device is attached), but this is fine, because no pve-qemu-kvm package with QEMU 7.2 has been publicly released yet. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-12 13:10:19 +01:00
Fiona Ebner	f64132208a	cherry-pick stable fixes for 7.2 Two for virtio-mem and one for vIOMMU. Both features are not yet exposed in PVE's qemu-server, but planned to be added. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-01-10 15:42:28 +01:00
Fiona Ebner	d03e1b3ce3	update submodule and patches to 7.2.0 User-facing breaking change: The slirp submodule for user networking got removed. It would be necessary to add the --enable-slirp option to the build and/or install the appropriate library to continue building it. Since PVE is not explicitly supporting it, it would require additionally installing the libslirp0 package on all installations and there is very little mention on the community forum when searching for "slirp" or "netdev user", the plan is to only enable it again if there is some real demand for it. Notable changes: * The big change for this release is the rework of job locking, using a job mutex and introducing _locked() variants of job API functions moving away from call-side AioContext locking. See (in the qemu submodule) commit 6f592e5aca ("job.c: enable job lock/unlock and remove Aiocontext locks") and previous commits for context. Changes required for the backup patches: * Use WITH_JOB_LOCK_GUARD() and call the _locked() variant of job API functions where appropriate (many are only availalbe as a _locked() variant). * Remove acquiring/releasing AioContext around functions taking the job mutex lock internally. The patch introducing sequential transaction support for jobs needs to temporarily unlock the job mutex to call job_start() when starting the next job in the transaction. * The zeroinit block driver now marks its child as primary. The documentation in include/block/block-common.h states: > Filter node has exactly one FILTERED\|PRIMARY child, and may have > other children which must not have these bits Without this, an assert will trigger when copying to a zeroinit target with qemu-img convert, because bdrv_child_cb_attach() expects any non-PRIMARY child to be not FILTERED: > qemu-img convert -n -p -f raw -O raw input.raw zeroinit:output.raw > qemu-img: ../block.c:1476: bdrv_child_cb_attach: Assertion > `!(child->role & BDRV_CHILD_FILTERED)' failed. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-12-16 11:47:20 +01:00
Thomas Lamprecht	8a38e1da9e	cherry-pick "block/block-backend: blk_set_enable_write_cache is IO_CODE" albeit I was short from disarming that GLOBAL_STATE_CODE assert completely, as its just bogus to assert that on runtime for a lot of call sites, rather it should be verified on compilation (function coloring with attributes and maybe a compiler plugin). But, as this is already solved upstream lets take in that patch. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-11-22 09:19:00 +01:00
Thomas Lamprecht	509409fb64	init: daemonize: defuse PID file resolve error to warning fixes file restore, where we actively unlink the PID file of the transient VM ourself after opening it - while we use it only for tracking when the QEMU process itself has finished start up, it's easier and cleaner to fix this regression now, than to rework that to something that doesn't depends on the PID file at all. Applying Fiona's patch as patch-patch tracked under extra, as I expect that something similar to this gets accepted upstreamed. Link: https://lists.proxmox.com/pipermail/pve-devel/2022-October/054448.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-10-28 10:22:26 +02:00
Fiona Ebner	73912aee39	cherry-pick upstream fixes for 7.1.0 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:32 +02:00
Fiona Ebner	5b15e2ecaf	update submodule and patches to 7.1.0 Notable changes: * The only big change is the switch to using a custom QIOChannel for savevm-async, because the previously used QEMUFileOps was dropped. Changes to the current implementation: * Switch to vector based methods as required for an IO channel. For short reads the passed-in IO vector is stuffed with zeroes at the end, just to be sure. * For reading: The documentation in include/io/channel.h states that at least one byte should be read, so also error out when whe are at the very end instead of returning 0. * For reading: Fix off-by-one error when request goes beyond end. The wrong code piece was: if ((pos + size) > maxlen) { size = maxlen - pos - 1; } Previously, the last byte would not be read. It's actually possible to get a snapshot .raw file that has content all the way up the final 512 byte (= BDRV_SECTOR_SIZE) boundary without any trailing zero bytes (I wrote a script to do it). Luckily, it didn't cause a real issue, because qemu_loadvm_state() is not interested in the final (i.e. QEMU_VM_VMDESCRIPTION) section. The buffer for reading it is simply freed up afterwards and the function will assume that it read the whole section, even if that's not the case. * For writing: Make use of the generated blk_pwritev() wrapper instead of manually wrapping the coroutine to simplify and save a few lines. * Adapt to changed interfaces for blk_{pread,pwrite}: * a9262f551e ("block: Change blk_{pread,pwrite}() param order") * 3b35d4542c ("block: Add a 'flags' param to blk_pread()") * bf5b16fa40 ("block: Make blk_{pread,pwrite}() return 0 on success") Those changes especially affected the qemu-img dd patches, because the context also changed, but also some of our block drivers used the functions. * Drop qemu-common.h include: it got renamed after essentially everything was moved to other headers. The only remaining user I could find for things dropped from the header between 7.0 and 7.1 was qemu_get_vm_name() in the iscsi-initiatorname patch, but it already includes the header to which the function was moved. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-10-14 14:52:29 +02:00
Fabian Ebner	0e88ec19db	add two more stable patches For the io_uring patch, it's not very clear which configurations can trigger it, but it should be rather uncommon. See qemu commit be6a166fde652589761cf70471bcde623e9bd72a for a bit more information. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-07-19 17:22:10 +02:00
Fabian Ebner	14ed554660	cherry-pick upstream fixes for 7.0.0 coming in via qemu-stable (except for the vdmk fix, which was tagged for-7.0 on the qemu-devel list, but didn't make it into the release). Also took the chance to switch the gluster fix to the version that made it into upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:30 +02:00
Fabian Ebner	dc9827a6a4	update submodule and patches to 7.0.0 Only very minor changes needed: * Most patches in extra (or some version of them) are part of 7.0.0. * aio_set_fd_handler got an extra parameter, but can just pass NULL like we did for the related 'poll' parameter. See QEMU commit 826cc32423db2a99d184dbf4f507c737d7e7a4ae for more. * Add include for qemu/memalign.h in vma.c and vma-writer.c. * Add reverts for fixups of already reverted 0347a8fd4c ("block/rbd: implement bdrv_co_block_status") that came in with 7.0.0. Those fixups are not enough, see Proxmox bugzilla #4047. * Two trivial context changes for bitmap-mirror patches. * block_int.h got split up into multiple headers. * Some context changes in configure and meson.build. * Used the oppurtunity to squash fixup of bdrv_backuo_dump_create typo in a later patch into the patch introducing the function (had to move code to new header during rebase). Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:21 +02:00
Wolfgang Bumiller	53bff441c5	delete patches which were dropped from the series file Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-08 13:07:04 +02:00
Thomas Lamprecht	309b5c1694	backport various fixes for gluster, qxl and vnc Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-05-11 10:40:14 +02:00
Thomas Lamprecht	4de9440f87	various stable backports Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-22 10:22:39 +02:00
Thomas Lamprecht	c8ba14bed0	cherry-pick fix for passing some acpi slic tables Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-15 08:07:34 +02:00
Fabian Ebner	4567474e95	update submodule and patches to 6.2.0 Notable changes: * bdrv_co_p{discard,readv,writev,write_zeroes} function signatures changed, to using int64_t for offsets/bytes and some still had int rather than BrdvRequestFlags for the flags. * job_cancel_sync now has a force parameter. Commit messages in 73895f3838cd7fdaf185cf1dbc47be58844a966f 4cfb3f05627ad82af473e7f7ae113c3884cd04e3 sound like using force=true makes more sense. * Added 3 patches coming in via qemu-stable tag, most important one is to work around a librbd issue. * Added another 3 patches from qemu-devel to fix issue leading to crash when live migrating with iothread. * cluster_size calculation helper changed (see patch pve/0026). * QAPI's if conditionals now use 'CONFIG_FOO' rather than 'defined(CONFIG_FOO)' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Thomas Lamprecht	ddbf7a872d	update submodule and patches to 6.1.1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-01-13 10:56:39 +01:00
Fabian Ebner	570d4ad51d	fix #3738 : cherry-pick "block: introduce max_hw_iov for use in scsi-generic" which fixes the bad commit 18473467d55a20d643b6c9b3a52de42f705b4d35 that was tracked down via bisecting, and has a Cc for qemu-stable as well. Issue was easy enough to reproduce with a single virtio-block disk using a few runs of dd if=/dev/urandom of=file bs=1M count=1000 Commit cc071629539dc1f303175a7e2d4ab854c0a8b20f upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-12-01 15:34:27 +01:00
Fabian Grünbichler	edbcc10a69	cherry-pick segfault fix this was reported multiple times in our forums[1 with backtraces, 2 & 3 with same log messages], fix is taken from upstream master. 1: https://forum.proxmox.com/threads/pve-7-0-14-1-vm-not-running-live-migration-kills-vm-post-ssd-move-pre-ram-move.99704/ 2: https://forum.proxmox.com/threads/proxmox-7-0-14-1-crashes-vm-during-migrate-to-other-host.99678 3: https://forum.proxmox.com/threads/cannot-migrate-between-zfs-and-ceph.99685/#post-430152 Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-16 09:23:43 +01:00

1 2

97 Commits (master)