opengnsys_ipxe

Commit Graph

Author	SHA1	Message	Date
Michael Brown	bc75bbaf17	[efi] Add DNS headers and GUID definitions Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-07 12:57:51 +01:00
Michael Brown	e7adf5701f	[efi] Add Ip4Config2 header and GUID definition Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-07 12:57:51 +01:00
Michael Brown	92ab2de3a4	[efi] Add IPv6 versions of existing IPv4 headers and GUID definitions Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-07 12:27:06 +01:00
Michael Brown	3184ff74eb	[efi] Update to current EDK2 headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-07 12:24:42 +01:00
Michael Brown	9cb0a4b8ec	[efi] Disable static assertions in EFI headers on non-EFI platforms The EDK2 headers may be included even in builds for non-EFI platforms. Commits such as `9de6c45` ("[arm] Use -fno-short-enums for all 32-bit ARM builds") have so far ensured that the compile-time checks within the EDK2 headers will pass even when building for a non-EFI platform. As a more general solution, temporarily disable static assertions while including UefiBaseType.h if building on a non-EFI platform. This avoids the need to modify the ABI on other platforms. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-07 12:24:03 +01:00
Michael Brown	b0093571f8	[crypto] Add support for PKCS#8 private key format Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-06-02 13:54:42 +01:00
Michael Brown	6a7f560e60	[efi] Implement "shim" as a dummy command on non-EFI platforms The "shim" command will skip downloading the shim binary (and is therefore a conditional no-op) if there is already a selected EFI image that can be executed directly via LoadImage()/StartImage(). This allows the same iPXE script to be used with Secure Boot either enabled or disabled. Generalise this further to provide a dummy "shim" command that is an unconditional no-op on non-EFI platforms. This then allows the same iPXE script to be used for BIOS, EFI with Secure Boot disabled, or EFI with Secure Boot enabled. The same effect could be achieved by using "iseq ${platform} efi" within the script, but this would complicate end-user documentation. To minimise the code size impact, the dummy "shim" command is a pure no-op that does not call parse_options() and so will ignore even standardised arguments such as "--help". Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-24 10:20:31 +01:00
Michael Brown	5b43181436	[efi] Support versions of shim that perform SBAT verification The UEFI shim implements a fairly nicely designed revocation mechanism designed around the concept of security generations. Unfortunately nobody in the shim community has thus far added the relevant metadata to the Linux kernel, with the result that current versions of shim are incapable of booting current versions of the Linux kernel. Experience shows that there is unfortunately no point in trying to get a fix for this upstreamed into shim. We therefore default to working around this undesirable behaviour by patching data read from the "SbatLevel" variable used to hold SBAT configuration. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-23 15:27:20 +01:00
Michael Brown	d2e1601cf4	[efi] Separate GetMemoryMap() wrapper from shim unlocker Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-23 14:52:30 +01:00
Michael Brown	95b8338f0d	[efi] Add "shim" command Allow a shim to be used to facilitate booting a kernel using a script such as: kernel /images/vmlinuz console=ttyS0,115200n8 initrd /images/initrd.img shim /images/shimx64.efi boot Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-22 15:37:11 +01:00
Michael Brown	28184b7c22	[efi] Add support for executing images via a shim Add support for using a shim as a helper to execute an EFI image. When a shim has been specified via shim(), the shim image will be passed to LoadImage() instead of the selected EFI image and the command line will be prepended with the name of the selected EFI image. The selected EFI image will be accessible to the shim via the virtual filesystem as a hidden file. Reduce the Secure Boot attack surface by removing, where possible, the spurious requirement for a third party second stage loader binary such as GRUB to be used solely in order to call the "shim lock protocol" entry point. Do not install the EFI PXE APIs when using a shim, since if shim finds EFI_PXE_BASE_CODE_PROTOCOL on the loaded image's device handle then it will attempt to download files afresh instead of using the files already downloaded by iPXE and exposed via the EFI_SIMPLE_FILE_SYSTEM protocol. (Experience shows that there is no point in trying to get a fix for this upstreamed into shim.) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-22 15:37:11 +01:00
Michael Brown	3c214f0465	[efi] Add definitions for the UEFI shim lock protocol The UEFI shim includes a "shim lock protocol" that can be used by a third party second stage loader such as GRUB to verify a kernel image. Add definitions for the relevant portions of this protocol interface. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-22 15:37:11 +01:00
Michael Brown	ce2200d5fb	[efi] Add efi_asprintf() and efi_vasprintf() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-22 15:10:16 +01:00
Michael Brown	c4a8d90387	[image] Generalise concept of selected image Most image flags are independent values: any combination of flags may be set for any image, and the flags for one image are independent of the flags for any other image. The "selected" flag does not follow this pattern: at most one image may be marked as selected at any time. When invoking a kernel via the UEFI shim, there will be multiple "special" images: the selected kernel itself, the shim image, and potentially a shim-signed GRUB binary to be used as a crutch to assist shim in loading the kernel (since current versions of the UEFI shim are not capable of directly loading a Linux kernel). Remove the "selected" image flag and replace it with a general concept of an image tag with the same semantics: a given tag may be assigned to at most one image, an image may be found by its tag only while the image is currently registered, and a tag will survive unregistration and reregistration of an image (if it has not already been assigned to a new image). For visual consistency, also replace the current image pointer with a current image tag. The image pointer stored within the image tag holds only a weak reference to the image, since the selection of an image should not prevent that image from being freed. (The strong reference to the currently executing image is held locally within the execution scope of image_exec(), and is logically separate from the current image pointer.) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-17 14:42:03 +01:00
Michael Brown	79d85e29aa	[efi] Attempt to detect EFI images that fail Secure Boot verification An EFI image that is rejected by LoadImage() due to failing Secure Boot verification is still an EFI image. Unfortunately, the extremely broken UEFI Secure Boot model provides no way for us to unambiguously determine that a valid EFI executable image was rejected only because it failed signature verification. We must therefore use heuristics to guess whether not an image that was rejected by LoadImage() could still be loaded via a separate PE loader such as the UEFI shim. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-17 14:40:50 +01:00
Michael Brown	03eea19c19	[efi] Allow currently selected image to be opened as "grub.efi" Versions 15.4 and earlier of the UEFI shim are incapable of correctly parsing the command line in order to extract the second stage loader filename, and will always attempt to load "grubx64.efi" or equivalent. Versions 15.3 and later of the UEFI shim are currently incapable of loading a Linux kernel directly anyway, since the kernel does not include SBAT metadata. These versions will require a genuine shim-signed GRUB binary to be used as a crutch to assist shim in loading a Linux kernel. This leaves versions 15.2 and earlier of the UEFI shim (as currently used in e.g. RHEL7) as being capable of directly loading a Linux kernel, but incorrectly attempting to load it using the filename "grubx64.efi" or equivalent. To support the bugs in these older versions of the UEFI shim, allow the currently selected image to be opened via any filename of the form "grub.efi". Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-05 14:54:20 +01:00
Michael Brown	0bb0aea878	[efi] Allow currently executing image to be opened via virtual filesystem When invoking a kernel via the UEFI shim, the kernel image must be accessible via EFI_SIMPLE_FILE_SYSTEM_PROTOCOL but must not be present in the magic initrd constructed from all registered images. Re-register a currently executing EFI image and mark it as hidden, thereby allowing it to be accessed via the virtual filesystem exposed via EFI_SIMPLE_FILE_SYSTEM_PROTOCOL without appearing in the magic initrd contents. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-05 14:54:20 +01:00
Michael Brown	f9beb20e99	[image] Allow for images to be hidden from lists of all images When invoking a kernel via the UEFI shim, the kernel (and potentially also a helper binary such as GRUB) must be accessible via the virtual filesystem exposed via EFI_SIMPLE_FILE_SYSTEM_PROTOCOL but must not be present in the magic initrd constructed from all registered images. Allow for images to be flagged as hidden, which will cause them to be excluded from API-level lists of all images such as the virtual filesystem directory contents, the magic initrd, or the Multiboot module list. Hidden images remain visible to iPXE commands including "imgstat", which will show a "[HIDDEN]" flag for such images. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-05 14:54:20 +01:00
Michael Brown	f93e6b712f	[efi] Show original filenames in debug messages Show the original filename as used by the consumer when calling our EFI_SIMPLE_FILE_SYSTEM_PROTOCOL's Open() method. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-05 13:05:28 +01:00
Michael Brown	22cc65535a	[efi] Allow downloaded images to take precedence over constructed files Try searching for a matching registered image before checking for fixed filenames (such as "initrd.magic" for the dynamically generated magic initrd file). This minimises surprise by ensuring that an explicitly downloaded image will always be used verbatim. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-05-05 13:05:28 +01:00
Michael Brown	bd13697446	[efi] Allow for sections to be excluded from the generated PE file Hybrid bzImage and UEFI binaries (such as wimboot) include a bzImage header within a section starting at offset zero, with the PE header effectively occupying unused space within this section. This section should not appear as a named section in the resulting PE file. Allow for the existence of hidden sections that do not result in a section header being written to the PE file. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-04-10 17:02:45 +01:00
Michael Brown	9fb28080d9	[efi] Allow elf2efi to be used for hybrid binaries Hybrid 32-bit BIOS and 64-bit UEFI binaries (such as wimboot) may include R_X86_64_32 relocation records for the 32-bit BIOS portions. These should be ignored when generating PE relocation records, since they apply only to code that cannot be executed within the context of the 64-bit UEFI binary, and creating a 4-byte relocation record is invalid in a binary that may be relocated anywhere within the 64-bit address space (see commit `907cffb` "[efi] Disallow R_X86_64_32 relocations"). Add a "--hybrid" option to elf2efi, which will cause R_X86_64_32 relocation records to be silently discarded. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-04-10 16:51:51 +01:00
Michael Brown	1e4c3789e9	[efi] Shrink size of data directory in PE header Hybrid bzImage and UEFI binaries (such as wimboot) require the PE header to be kept as small as possible, since the bzImage header starts at a fixed offset 0x1f1. The EFI_IMAGE_OPTIONAL_HEADER structures in PeImage.h define an optional header containing 16 data directory entries, of which the last eight are unused in binaries that we create. Shrink the data directory to contain only the first eight entries, to minimise the overall size of the PE header. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-04-10 16:51:49 +01:00
Michael Brown	0d04635ef0	[efi] Remove redundant zero padding in PE header Hybrid bzImage and UEFI binaries (such as wimboot) require the PE header to be kept as small as possible, since the bzImage header starts at a fixed offset 0x1f1. The PE header currently includes 128 bytes of zero padding between the DOS and NT header portions. This padding has been present since commit `81d92c6` ("[efi] Add EFI image format and basic runtime environment") first added support for EFI images in iPXE, and was included on the basis of matching the observed behaviour of the Microsoft toolchain. There appears to be no requirement for this padding to exist: EDK2 binaries built with gcc include only 64 bytes of zero padding, Linux kernel binaries include 66 bytes of non-zero padding, and wimboot binaries include no padding at all. Remove the unnecessary padding between the DOS and NT header portions to minimise the overall size of the PE header. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-04-10 16:50:10 +01:00
Michael Brown	1d1cf74a5e	[tls] Handle fragmented handshake records Originally-implemented-by: Christopher Schenk <christopher@cschenk.net> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-30 23:38:43 +01:00
Michael Brown	aa368ba529	[tls] Pass I/O buffer to received record handlers Prepare for the possibility that a record handler may choose not to consume the entire record by passing the I/O buffer and requiring the handler to mark consumed data using iob_pull(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-30 23:37:55 +01:00
Michael Brown	2c6a15d2a3	[tls] Clean up change cipher spec record handling Define and use data structures and constants for the (single-byte) change cipher spec records. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-30 16:57:12 +01:00
Michael Brown	09e8a15408	[efi] Claim fixed device paths by uninstalling device path protocol As documented in commits `6a004be` ("[efi] Support the initrd autodetection mechanism in newer Linux kernels") and `04e60a2` ("[efi] Omit EFI_LOAD_FILE2_PROTOCOL for a zero-length initrd"), the choice in Linux of using a fixed device path requires bootloaders to allow for the fact that a previous bootloader may have already installed a handle with the fixed device path. We currently deal with this situation by reusing the existing handle, replacing the EFI_LOAD_FILE2_PROTOCOL instance with our own. Simplify the code by instead uninstalling the EFI_DEVICE_PATH_PROTOCOL instance from the existing handle (if present), thereby allowing the creation of a new handle to succeed. Create the new handle only if we have a non-empty initrd to provide. This works around bugs in bootloaders such as the systemd EFI stub that fail to allow for the existence of multiple-bootloader chains. (The workaround is not comprehensive: if the user has downloaded other images in iPXE before invoking the systemd Unified Kernel Image (UKI), then the systemd EFI stub will still crash and burn since it fails to allow for the fact that a previous bootloader has already installed a handle with the fixed device path.) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-15 16:48:35 +00:00
Matt Parrella	bf25e23d07	[intel] Add workaround for I210 reset hardware bugs The Intel I210's packet buffer size registers reset only on power up, not when a reset signal is asserted. This can lead to the inability to pass traffic in the event that the DMA TX Maximum Packet Size (which does reset to its default value on reset) is bigger than the TX Packet Buffer Size. For example, an operating system may be using the time sensitive networking features of the I210 and the registers may be programmed correctly, but then a reset signal is asserted and iPXE on the next boot will be unable to use the I210. Mimic what Linux does and forcibly set the registers to their default values. Signed-off-by: Matt Parrella <parrella.matthew@gmail.com>	2023-03-14 14:44:32 +00:00
Michael Brown	8f1c120119	[dhcp] Unregister ProxyDHCP and PXEBS settings on a successful DHCPACK When a DHCP transaction does not result in the registration of a new "proxydhcp" or "pxebs" settings block, any existing settings blocks are currently left unaltered. This can cause surprising behaviour. For example: when chainloading iPXE, the "proxydhcp" and "pxebs" settings blocks may be prepopulated using cached values from the previous PXE bootloader. If iPXE performs a subsequent DHCP request, then the DHCP or ProxyDHCP servers may choose to respond differently to iPXE. The response may choose to omit the ProxyDHCP or PXEBS stages, in which case no new "proxydhcp" or "pxebs" settings blocks may be registered. This will result in iPXE using a combination of both old and new DHCP responses. Fix by assuming that a successful DHCPACK effectively acquires ownership of the "proxydhcp" and "pxebs" settings blocks, and that any existing settings blocks should therefore be unregistered. Reported-by: Henry Tung <htung@palantir.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-14 11:35:30 +00:00
Michael Brown	54fcb7c29c	[efi] Use image name instead of pointer value in debug messages Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-07 14:18:00 +00:00
Michael Brown	9e1f7a3659	[image] Always unregister currently executing image We unregister script images during their execution, to prevent a "boot" command from re-executing the containing script. This also has the side effect of preventing executing scripts from showing up within the Linux magic initrd image (or the Multiboot module list). Additional logic in bzimage.c and efi_file.c prevents a currently executing kernel from showing up within the magic initrd image. Similar logic in multiboot.c prevents the Multiboot kernel from showing up as a Multiboot module. This still leaves some corner cases that are not covered correctly. For example: when using a gzip-compressed kernel image, nothing will currently hide the original compressed image from the magic initrd. Fix by moving the logic that temporarily unregisters the current image from script_exec() to image_exec(), so that it applies to all image types, and simplify the magic initrd and Multiboot module list construction logic on the basis that no further filtering of the registered image list is necessary. This change has the side effect of hiding currently executing EFI images from the virtual filesystem exposed by iPXE. For example, when using iPXE to boot wimboot, the wimboot binary itself will no longer be visible within the virtual filesystem. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-07 12:22:19 +00:00
Michael Brown	e51e7bbad7	[image] Consistently use for_each_image() to iterate over images Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-06 16:56:37 +00:00
Forest Crossman	523788ccda	[intelx] Add PCI IDs for Intel 82599 10GBASE-T NIC Signed-off-by: Forest Crossman <cyrozap@gmail.com>	2023-03-05 18:22:18 -06:00
Michael Brown	96bb6ba441	[params] Allow for arbitrary HTTP request headers to be specified Extend the request parameter mechanism to allow for arbitrary HTTP headers to be specified via e.g.: params param --header Referer http://www.example.com imgfetch http://192.168.0.1/script.ipxe##params Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-01 12:20:02 +00:00
Michael Brown	33cb56cf1b	[params] Rename "form parameter" to "request parameter" Prepare for the parameter mechanism to be generalised to specifying request parameters that are passed via mechanisms other than an application/x-www-form-urlencoded form. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-01 11:55:04 +00:00
Michael Brown	60531ff6e2	[http] Use POST method only if the form parameter list is non-empty An attempt to use an existent but empty form parameter list will currently result in an invalid POST request since the Content-Length header will be missing. Fix by using GET instead of POST if the form parameter list is empty. This is a non-breaking change (since the current behaviour produces an invalid request), and simplifies the imminent generalisation of the parameter list concept to handle both header and form parameters. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-03-01 11:12:44 +00:00
Michael Brown	04e60a278a	[efi] Omit EFI_LOAD_FILE2_PROTOCOL for a zero-length initrd When the Linux kernel is being used with no initrd, iPXE will still provide a zero-length initrd.magic file within the virtual filesystem. As of commit `6a004be` ("[efi] Support the initrd autodetection mechanism in newer Linux kernels"), this zero-length file will also be exposed via an EFI_LOAD_FILE2_PROTOCOL instance on a handle with a fixed device path. The correct handling of zero-length files via EFI_LOAD_FILE2_PROTOCOL is unfortunately not well defined. Linux expects the first call to LoadFile() to always fail with EFI_BUFFER_TOO_SMALL. When the initrd is genuinely zero-length, iPXE will return success since the buffer is not too small to hold the (zero-length) file. This causes Linux to immediately report a spurious EFI_LOAD_ERROR boot failure. We could change the logic in iPXE's efi_file_load() to always return EFI_BUFFER_TOO_SMALL if Buffer is NULL on entry. Since the correct behaviour of LoadFile() in the corner case of a zero-length file is left undefined by the UEFI specification, this would be permissible. Unfortunately this approach would not fix the problem. If we return EFI_BUFFER_TOO_SMALL and set the file length to zero, then Linux will call the boot services AllocatePages() method with a zero length. In at least the EDK2 implementation, this combination of parameters will cause AllocatePages() to return EFI_OUT_OF_RESOURCES, and Linux will again report a boot failure. Another approach would be to install the initrd device path handle only if we have a non-empty initrd to offer. Unfortunately this would lead to a failure in yet another corner case: if a previous bootloader has installed an initrd device path handle (e.g. to pass a boot script to iPXE) then we must not leave that initrd in place, since then our loaded kernel would end up seeing the wrong initrd content. The cleanest fix seems to be to ensure that the initrd device path handle is installed with the EFI_DEVICE_PATH_PROTOCOL instance present but with the EFI_LOAD_FILE2_PROTOCOL instance absent (and forcibly uninstalled if necessary), matching the state in which we leave the handle after uninstalling our virtual filesystem. Linux will then not find any handle that supports EFI_LOAD_FILE2_PROTOCOL within the fixed device path, and so will fall through to trying other mechanisms to locate the initrd. Reported-by: Chris Bradshaw <cwbshaw@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-28 12:30:54 +00:00
Michael Brown	471599dc77	[efi] Split out EFI_RNG_PROTOCOL as a separate entropy source Commit `7ca801d` ("[efi] Use the EFI_RNG_PROTOCOL as an entropy source if available") added EFI_RNG_PROTOCOL as an alternative entropy source via an ad-hoc mechanism specific to efi_entropy.c. Split out EFI_RNG_PROTOCOL to a separate entropy source, and allow the entropy core to handle the selection of RDRAND, EFI_RNG_PROTOCOL, or timer ticks as the active source. The fault detection logic added in commit `a87537d` ("[efi] Detect and disable seriously broken EFI_RNG_PROTOCOL implementations") may be removed completely, since the failure will already be detected by the generic ANS X9.82-mandated repetition count test and will now be handled gracefully by the entropy core. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-20 14:53:10 +00:00
Michael Brown	7d71cf318a	[rng] Allow for entropy sources that fail during startup tests Provide per-source state variables for the repetition count test and adaptive proportion test, to allow for the situation in which an entropy source can be enabled but then fails during the startup tests, thereby requiring an alternative entropy source to be used. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-20 14:53:10 +00:00
Michael Brown	6625e49cea	[tables] Allow any lvalue to be used as a table iterator Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-20 13:46:45 +00:00
Michael Brown	9f17d1116d	[rng] Allow entropy source to be selected at runtime As noted in commit `3c83843` ("[rng] Check for several functioning RTC interrupts"), experimentation shows that Hyper-V cannot be trusted to reliably generate RTC interrupts. (As noted in commit `f3ba0fb` ("[hyperv] Provide timer based on the 10MHz time reference count MSR"), Hyper-V appears to suffer from a general problem in reliably generating any legacy interrupts.) An alternative entropy source is therefore required for an image that may be used in a Hyper-V Gen1 virtual machine. The x86 RDRAND instruction provides a suitable alternative entropy source, but may not be supported by all CPUs. We must therefore allow for multiple entropy sources to be compiled in, with the single active entropy source selected only at runtime. Restructure the internal entropy API to allow a working entropy source to be detected and chosen at runtime. Enable the RDRAND entropy source for all x86 builds, since it is likely to be substantially faster than any other source. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-17 21:29:51 +00:00
Michael Brown	2733c4763a	[iscsi] Limit maximum transfer size to MaxBurstLength We currently specify only the iSCSI default value for MaxBurstLength and ignore any negotiated value, since our internal block device API allows only for receiving directly into caller-allocated buffers and so we have no intrinsic limit on burst length. A conscientious target may however refuse to attempt a transfer that we request for a number of blocks that would exceed the negotiated maximum burst length. Fix by recording the negotiated maximum burst length and using it to limit the maximum number of blocks per transfer as reported by the SCSI layer. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-16 13:27:25 +00:00
Michael Brown	cff857461b	[rng] Add RDRAND as an entropy source Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-15 22:43:33 +00:00
Michael Brown	6a004be0cc	[efi] Support the initrd autodetection mechanism in newer Linux kernels Linux 5.7 added the ability to autodetect an initrd by searching for a handle via a fixed vendor-specific "Linux initrd device path" and then locating and using the EFI_LOAD_FILE2_PROTOCOL instance on that handle. This maps quite naturally onto our existing concept of a "magic initrd" as introduced for EFI in commit `e5f0255` ("[efi] Provide an "initrd.magic" file for use by UEFI kernels"). Add an EFI_LOAD_FILE2_PROTOCOL instance to our EFI virtual files (backed by simply calling the existing EFI_SIMPLE_FILE_SYSTEM_PROTOCOL method to read from the file), and install the protocol instance for the "initrd.magic" virtual file onto a new device handle that also provides the Linux initrd device path. The design choice in Linux of using a single fixed device path makes this unfortunately messy to support, since device paths must be unique within a system. When multiple bootloaders are used (e.g. GRUB loading iPXE loading Linux) then only one bootloader can ever install the device path onto a handle. Subsequent bootloaders must locate the existing handle and replace the load file protocol instance with their own. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-15 17:36:47 +00:00
Michael Brown	cf9ad00afc	[efi] Fix debug message when reading from EFI virtual files Show the requested range when a caller reads from a virtual file via the EFI_SIMPLE_FILE_SYSTEM_PROTOCOL interface. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-15 17:20:39 +00:00
Michael Brown	76a286530a	[image] Check delimiters when parsing command-line key-value arguments The Linux kernel bzImage image format and the CPIO archive constructor will parse the image command line for certain arguments of the form "key=value". This parsing is currently implemented using strstr() in a way that can cause a false positive suffix match. For example, a command line containing "highmem=<n>" would erroneously be treated as containing a value for "mem=<n>". Fix by centralising the logic used for parsing such arguments, and including a check that the argument immediately follows a whitespace delimiter (or is at the start of the string). Reported-by: Filippo Giunchedi <filippo@esaurito.net> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-14 11:13:45 +00:00
Michael Brown	3c83843e11	[rng] Check for several functioning RTC interrupts Commit `74222cd` ("[rng] Check for functioning RTC interrupt") added a check that the RTC is capable of generating interrupts via the legacy PIC, since this mechanism appears to be broken in some Hyper-V virtual machines. Experimentation shows that the RTC is sometimes capable of generating a single interrupt, but will then generate no subsequent interrupts. This currently causes rtc_entropy_check() to falsely detect that the entropy gathering mechanism is functional. Fix by checking for several RTC interrupts before declaring that it is a functional entropy source. Reported-by: Andreas Hammarskjöld <junior@2PintSoftware.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-11 15:11:51 +00:00
Michael Brown	be8ecaf805	[eisa] Check for system board presence before probing for slots EISA expansion slot I/O port addresses overlap space that may be assigned to PCI devices, which can lead to register reads and writes with unwanted side effects during EISA probing. Reduce the chances of performing EISA probing on PCI devices by probing EISA slot vendor and product ID registers only if the EISA system board vendor ID register indicates that the motherboard supports EISA. Debugged-by: Václav Ovsík <vaclav.ovsik@gmail.com> Tested-by: Václav Ovsík <vaclav.ovsik@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-10 23:34:59 +00:00
Xiaotian Wu	62a1d5c0f5	[loong64] Add initial support for LoongArch64 Add support for building a LoongArch64 Linux userspace binary. Signed-off-by: Xiaotian Wu <wuxiaotian@loongson.cn> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-06 21:14:17 +00:00
Michael Brown	84cb774390	[test] Include build architecture in test suite banner The test suites for the various architectures are often run back to back, and there is currently nothing to visually distinguish one test run from another. Include the architecture name within the self-test startup banner, to aid in visual identification of test results. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-06 21:06:00 +00:00
Michael Brown	ef0a6f4792	[ioapi] Move PAGE_SHIFT to bits/io.h The PAGE_SHIFT definition is an architectural property, rather than an aspect of a particular I/O API implementation (of which, in theory, there may be more than one per architecture). Reflect this by moving the definition to the top-level bits/io.h for each architecture. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-06 12:34:21 +00:00
Michael Brown	c6901792f0	[build] Allow for per-architecture unprefixed constant operand modifier Over the years, the undocumented operand modifier used to produce the unprefixed constant values in __einfo_error() has varied from "%c0" to "%a0" in commit `1a77466` ("[build] Fix use of inline assembly on GCC 4.8 ARM64 builds") and back to "%c0" in commit `3fb3ffc` ("[build] Fix use of inline assembly on GCC 8 ARM64 builds"), according to the evolving demands of the toolchain. LoongArch64 suffers from a similar issue: GCC 13 will allow either, but the currently released GCC 12 allows only the "%a0" form. Introduce a macro ASM_NO_PREFIX, defined in bits/compiler.h, to abstract away this difference and allow different architectures to use different operand modifiers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-05 23:55:14 +00:00
Michael Brown	a2bed43939	[xen] Allow for platforms that have no Xen support The Xen headers support only x86 and ARM. Allow for platforms such as LoongArch64 to build despite the absence of Xen support by providing an architecture-specific <bits/xen.h> that simply does: #ifndef _BITS_XEN_H #define _BITS_XEN_H #include <ipxe/nonxen.h> #endif /* _BITS_XEN_H */ Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-05 22:21:36 +00:00
Michael Brown	7cc305f7b4	[efi] Enable NET_PROTO_LLDP by default Requested-by: Christian I. Nilsson <nikize@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-05 18:54:39 +00:00
Michael Brown	dc16de3204	[lldp] Add support for the Link Layer Discovery Protocol Add support for recording LLDP packets and exposing TLV values via the settings mechanism. LLDP settings are encoded as ${netX.lldp/<prefix>.<type>.<index>.<offset>.<length>} where <type> is the TLV type <offset> is the starting offset within the TLV value <length> is the length (or zero to read the from <offset> to the end) <prefix>, if it has a non-zero value, is the subtype byte string of length <offset> to match at the start of the TLV value, up to a maximum matched length of 4 bytes <index> is the index of the entry matching <type> and <prefix> to be accessed, with zero indicating the first matching entry The <prefix> is designed to accommodate both matching of the OUI within an organization-specific TLV (e.g. 0x0080c2 for IEEE 802.1 TLVs) and of a subtype byte as found within many TLVs. This encoding allows most LLDP values to be extracted easily. For example System name: ${netX.lldp/5.0.0.0:string} System description: ${netX.lldp/6.0.0.0:string} Port description: ${netX.lldp/4.0.0.0:string} Port interface name: ${netX.lldp/5.2.0.1.0:string} Chassis MAC address: ${netX.lldp/4.1.0.1.0:hex} Management IPv4 address: ${netX.lldp/5.1.8.0.2.4:ipv4} Port VLAN ID: ${netX.lldp/0x0080c2.1.127.0.4.2:int16} Port VLAN name: ${netX.lldp/0x0080c2.3.127.0.7.0:string} Maximum frame size: ${netX.lldp/0x00120f.4.127.0.4.2:uint16} Originally-implemented-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-05 18:18:02 +00:00
Michael Brown	8450fa4a7b	[dhcp] Ignore DHCPNAK unless originating from the selected DHCP server RFC 2131 leaves undefined the behaviour of the client in response to a DHCPNAK that comes from a server other than the selected DHCP server. A substantial amount of online documentation suggests using multiple independent DHCP servers with non-overlapping ranges in the same subnet in order to provide some minimal redundancy. Experimentation shows that in this setup, at least ISC dhcpd will send a DHCPNAK in response to the client's DHCPREQUEST for an address that is not within the range defined on that server. (Since the requested address does lie within the subnet defined on that server, this will happen regardless of the "authoritative" parameter.) The client will therefore receive a DHCPACK from the selected DHCP server along with one or more DHCPNAKs from each of the non-selected DHCP servers. Filter out responses from non-selected DHCP servers before checking for a DHCPNAK, so that these arguably spurious DHCPNAKs will not cause iPXE to return to the discovery state. Continue to check for DHCPNAK before filtering out responses for non-selected lease addresses, since experimentation shows that the DHCPNAK will usually have an empty yiaddr field. Reported-by: Anders Blomdell <anders.blomdell@control.lth.se> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-03 19:51:58 +00:00
Michael Brown	4e456d9928	[efi] Do not attempt to drive PCI bridge devices The "bridge" driver introduced in `3aa6b79` ("[pci] Add minimal PCI bridge driver") is required only for BIOS builds using the ENA driver, where experimentation shows that we cannot rely on the BIOS to fully assign MMIO addresses. Since the driver is a valid PCI driver, it will end up binding to all PCI bridge devices even on a UEFI platform, where the firmware is likely to have completed MMIO address assignment correctly. This has no impact on most systems since there is generally no UEFI driver for PCI bridges: the enumeration of the whole PCI bus is handled by the PciBusDxe driver bound to the root bridge. Experimentation shows that at least one laptop will freeze at the point that iPXE attempts to bind to the bridge device. No deeper investigation has been carried out to find the root cause. Fix by causing efipci_supported() to return an error unless the configuration space header type indicates a non-bridge device. Reported-by: Marcel Petersen <mp@sbe.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-03 16:10:31 +00:00
Xiaotian Wu	d405a0bd84	[util] Add support for LoongArch64 binaries Signed-off-by: Xiaotian Wu <wuxiaotian@loongson.cn> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-03 12:44:11 +00:00
Michael Brown	8b645eea16	[xen] Update to current Xen headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-02 11:19:44 +00:00
Michael Brown	6f250be279	[efi] Allow autoexec script to be located alongside iPXE binary Try loading the autoexec.ipxe script first from the directory containing the iPXE binary (based on the relative file path provided to us via EFI_LOADED_IMAGE_PROTOCOL), then fall back to trying the root directory. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-01 23:54:19 +00:00
Michael Brown	b6304f2984	[realtek] Explicitly disable VLAN offload Some cards seem to have the receive VLAN tag stripping feature enabled by default, which causes received VLAN packets to be misinterpreted as being received by the trunk device. Fix by disabling VLAN tag stripping in the C+ Command Register. Debugged-by: Xinming Lai <yiyihu@gmail.com> Tested-by: Xinming Lai <yiyihu@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-01 19:09:30 +00:00
Michael Brown	aa85c2918a	[efi] Update to current EDK2 headers Update to pick up the upstream commit bda715b ("MdePkg: Fix UINT64 and INT64 word length for LoongArch64"). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-02-01 10:50:47 +00:00
Michael Brown	66a2ff442d	[tests] Verify ability to sleep the CPU The self-test suite does not currently ever attempt to sleep the CPU. This is an operation that may fail (e.g. by attempting to execute a privileged instruction while running as a Linux userspace binary, or by halting the CPU with all interrupts disabled). Add a trivial self-test to exercise the ability to sleep the CPU without crashing or halting forever. Inspired-by: Xiaotian Wu <wuxiaotian@loongson.cn> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-31 10:17:57 +00:00
Michael Brown	3bcd0d3271	[dhcp] Add IANA-defined values for all current EFI client architectures Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-31 02:00:12 +00:00
Michael Brown	4bb521a8c4	[efi] Accept a command line passed to an iPXE image via LoadOptions Treat a command line passed to iPXE via UEFI LoadOptions as an image to be registered at startup, as is already done for the .lkrn, .pxe, and .exe BIOS images. Originally-implemented-by: Ladi Prosek <lprosek@redhat.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-29 18:56:11 +00:00
Michael Brown	b9be454010	[la64] Import LoongArch64 ProcessorBind.h from EDK2 headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 19:14:00 +00:00
Michael Brown	e3d543437e	[efi] Update to current EDK2 headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:22:25 +00:00
Michael Brown	137ca5d877	[efi] Mark ConsoleControl.h as a non-imported header The obsolete ConsoleControl.h header is no longer present in the current EDK2 codebase, but is still required for interoperability with old iMacs. Add an iPXE include guard to this file so that the EDK2 header import script will no longer attempt to import it from the EDK2 tree. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:22:25 +00:00
Michael Brown	900379594a	[efi] Remove deleted directories from EDK2 header import script The IntelFrameworkPkg and EdkCompatibilityPkg directories have been removed from the EDK2 codebase. Remove these directories from the EDK2 header import script. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:22:25 +00:00
Michael Brown	91944c6341	[efi] Allow for whitespace before #include in imported EDK2 header files Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:22:25 +00:00
Michael Brown	dac41fc4ec	[efi] Detect SPDX licence identifiers in imported EDK2 headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:22:25 +00:00
Michael Brown	5220bdc524	[legal] Add missing FILE_LICENCE declaration to efi_path.c Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:15:16 +00:00
Michael Brown	38f54fb413	[legal] Add support for the BSD-2-Clause-Patent licence Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 17:07:40 +00:00
Michael Brown	5bf8b11527	[efi] Build util/efirom as a host-only binary As with util/elf2efi32 and util/elf2efi64 in commit `a99e435` ("[efi] Do not rely on ProcessorBind.h when building host binaries"), build util/efirom without using any architecture-specific EDK2 headers since the build host's CPU architecture may not be supported by EDK2. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-28 16:26:28 +00:00
Michael Brown	2d180ce233	[tcp] Update maximum window size to 2MB The current maximum window size of 256kB was calculated based on rough link bandwidth and RTT measurements taken in 2012, and is too small to avoid filling the TCP window on some modern links. Update the list of typical link bandwidth and RTT figures to reflect the modern world, and increase the maximum window size accordingly. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-25 18:34:01 +00:00
Michael Brown	4bffe0f0d9	[pxe] Discard queued PXE UDP packets when under memory pressure The PXE UDP receive queue may grow without limit if the PXE NBP does not call PXENV_UDP_READ sufficiently frequently. Fix by implementing a cache discarder for received PXE UDP packets (similar to the TCP cache discarder). Reported-by: Tal Shorer <shorer@amazon.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-25 10:03:09 +00:00
Mohammed Taha	c5426cdaa9	[golan] Add new PCI ID for NVIDIA BlueField-3 network device Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 22:52:30 +00:00
Michael Brown	e72670ad7b	[pxe] Avoid drawing menu items on bottom row of screen Many consoles will scroll immediately upon drawing a character in the rightmost column of the bottom row of the display, in order to be able to advance the cursor to the next character (even if the cursor is disabled). This causes PXE menus to display incorrectly. Specifically, pressing the down arrow key while already on the last menu item may cause the whole screen to scroll and the line to be duplicated. Fix by moving the PXE menu one row up from the bottom of the screen. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 20:30:59 +00:00
Michael Brown	68734b9a4d	[efi] Bind to only the topmost instance of the SNP or NII protocols UEFI has the mildly annoying habit of installing copies of the EFI_SIMPLE_NETWORK_PROTOCOL instance on the IPv4 and IPv6 child device handles. This can cause iPXE's SNP driver to attempt to bind to a copy of the EFI_SIMPLE_NETWORK_PROTOCOL that iPXE itself provided on a different handle. Fix by refusing to bind to an SNP (or NII) handle if there exists another instance of the same protocol further up the device path (on the basis that we always want to bind to the highest possible device). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 19:27:13 +00:00
Michael Brown	2fef0c541e	[efi] Extend efi_locate_device() to allow searching up the device path Extend the functionality of efi_locate_device() to allow callers to find instances of the protocol that may exist further up the device path. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 19:27:13 +00:00
Michael Brown	1cd0a248cc	[efi] Add efi_path_prev() utility function Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 19:27:13 +00:00
Michael Brown	204d39222a	[efi] Add efi_path_terminate() utility function Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 19:27:11 +00:00
Michael Brown	fcfb70bfb2	[arm] Inhibit linker warnings about an implied executable stack Some versions of the 32-bit ARM linker seem to treat the absence of a .note.GNU-stack section as implying an executable stack, and will print a warning that this is deprecated behaviour. Silence the warning by adding a .note.GNU-stack section to each assembly file and retaining the sections in the Linux linker script. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 12:55:44 +00:00
Michael Brown	c5e1f007ac	[arm] Use -mfloat-abi=soft only for EFI builds The EFI ABI requires the use of -mfloat-abi=soft, but other platforms may require -mfloat-abi=hard. Allow for this by using -mfloat-abi=soft only for EFI builds. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 01:32:14 +00:00
Michael Brown	9de6c45dd3	[arm] Use -fno-short-enums for all 32-bit ARM builds The EFI ABI requires the use of -fno-short-enums, and the EDK2 headers will perform a compile-time check that enums are 32 bits. The EDK2 headers may be included even in builds for non-EFI platforms, and so the -fno-short-enums flag must be used in all 32-bit ARM builds. Fortunately, nothing else currently cares about enum sizes. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-23 01:26:46 +00:00
Michael Brown	8f59911b20	[arm] Support building as a Linux userspace binary for AArch64 Add support for building as a Linux userspace binary for AArch64. This allows the self-test suite to be more easily run for the 64-bit ARM code. For example: # On a native AArch64 system: # make bin-arm64-efi/tests.linux && ./bin-arm64-efi/tests.linux # On a non-AArch64 system (e.g. x86_64) via cross-compilation, # assuming that kernel and glibc headers are present within # /usr/aarch64-linux-gnu/sys-root/: # make bin-arm64-linux/tests.linux CROSS=aarch64-linux-gnu- && \ qemu-aarch64 -L /usr/aarch64-linux-gnu/sys-root/ \ ./bin-arm64-linux/tests.linux Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-22 20:36:57 +00:00
Michael Brown	2061d658b3	[dhcp] Simplify platform-specific client architecture definitions Move the platform-specific DHCP client architecture definitions to header files of the form <ipxe/$(PLATFORM)/dhcparch.h>. This simplifies the directory structure and allows the otherwise unused arch/$(ARCH)/include/$(PLATFORM) to be removed from the include directory search path, which avoids the confusing situation in which a header file may potentially be accessed through more than one path. For Linux userspace binaries on any architecture, use the EFI values for that architecture by delegating to the EFI header file. This avoids the need to explicitly select values for Linux userspace binaries for each architecture. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-22 17:45:34 +00:00
Michael Brown	2ef5f5e05e	[build] Move -Ulinux to common Makefile The requirement to undo the implicit "-Dlinux" is not specific to the x86 architecture. Move this out of the x86-specific Makefile. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-22 16:19:22 +00:00
Michael Brown	475c0dfa8e	[linux] Centralise the linker script for Linux binaries Reduce duplication between i386 and x86_64 by providing a single shared linker script that both architectures can include. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-22 12:38:03 +00:00
Michael Brown	a99e435c8e	[efi] Do not rely on ProcessorBind.h when building host binaries We cannot rely on the EDK2 ProcessorBind.h headers when compiling a binary for execution on the build host itself (e.g. elf2efi), since the host's CPU architecture may not even be supported by EDK2. Fix by skipping ProcessorBind.h when building a host binary, and defining the bare minimum required to allow other EDK2 headers to compile cleanly. Reported-by: Michal Suchánek <msuchanek@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-20 00:17:49 +00:00
Alexander Graf	6b977d1250	[ena] Allocate an unused Asynchronous Event Notification Queue (AENQ) We currently don't allocate an Asynchronous Event Notification Queue (AENQ) because we don't actually care about any of the events that may come in. The ENA firmware found on Graviton instances requires the AENQ to exist, otherwise all admin queue commands will fail. Fix by allocating an AENQ and disabling all events (so that we do not need to include code to acknowledge any events that may arrive). Signed-off-by: Alexander Graf <graf@amazon.com>	2023-01-18 22:47:58 +00:00
Michael Brown	08740220ba	[netdevice] Ensure consistent interpretation of "netX" device name Ensure that the "${netX/...}" settings mechanism always uses the same interpretation of the network device corresponding to "netX" as any other mechanism that performs a name-based lookup of a network device. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-17 12:42:46 +00:00
Michael Brown	2dcef4b7a1	[efi] Create VLAN autoboot device automatically When chainloading iPXE from an EFI VLAN device, configure the corresponding iPXE VLAN device to be created automatically. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-15 22:42:30 +00:00
Michael Brown	f07630c74f	[vlan] Support automatic VLAN device creation Add the ability to automatically create a VLAN device for a specified trunk device link-layer address and VLAN tag. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-15 22:35:44 +00:00
Michael Brown	5a2fa6040e	[autoboot] Include VLAN tag in filter for identifying autoboot device When chainloading iPXE from a VLAN device, the MAC address of the loaded image's device handle will match the MAC address of the trunk device created by iPXE, and the autoboot process will then erroneously consider the trunk device to be an autoboot device. Fix by recording the VLAN tag along with the MAC address, and treating the VLAN tag as part of the filter used to match the MAC address against candidate network devices. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-15 21:36:08 +00:00
Michael Brown	c4c03e5be8	[netdevice] Allow duplicate MAC addresses Many laptops now include the ability to specify a "system-specific MAC address" (also known as "pass-through MAC"), which is supposed to be used for both the onboard NIC and for any attached docking station or other USB NIC. This is intended to simplify interoperability with software or hardware that relies on a MAC address to recognise an individual machine: for example, a deployment server may associate the MAC address with a particular operating system image to be deployed. This therefore creates legitimate situations in which duplicate MAC addresses may exist within the same system. As described in commit `98d09a1` ("[netdevice] Avoid registering duplicate network devices"), the Xen netfront driver relies on the rejection of duplicate MAC addresses in order to inhibit registration of the emulated PCI devices that a Xen PV-HVM guest will create to shadow each of the paravirtual network devices. Move the code that rejects duplicate MAC addresses from the network device core to the Xen netfront driver, to allow for the existence of duplicate MAC addresses in non-Xen setups. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-15 00:42:52 +00:00
Michael Brown	47af48012e	[netdevice] Separate concept of scope ID from network device name index The network device index currently serves two purposes: acting as a sequential index for network device names ("net0", "net1", etc), and acting as an opaque unique integer identifier used in socket address scope IDs. There is no particular need for these usages to be linked, and it can lead to situations in which devices are named unexpectedly. For example: if a system has two network devices "net0" and "net1", a VLAN is created as "net1-42", and then a USB NIC is connected, then the USB NIC will be named "net3" rather than the expected "net2" since the VLAN device "net1-42" will have consumed an index. Separate the usages: rename the "index" field to "scope_id" (matching its one and only use case), and assign the name without reference to the scope ID by finding the first unused name. For consistency, assign the scope ID by similarly finding the first unused scope ID. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-14 00:09:20 +00:00
Michael Brown	ab19546386	[efi] Disable receive filters to work around buggy UNDI drivers Some UNDI drivers (such as the AMI UsbNetworkPkg currently in the process of being upstreamed into EDK2) have a bug that will prevent any packets from being received unless at least one attempt has been made to disable some receive filters. Work around these buggy drivers by attempting to disable receive filters before enabling them. Ignore any errors, since we genuinely do not care whether or not the disabling succeeds. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2023-01-11 00:18:18 +00:00
Michael Brown	7147532c3f	[cachedhcp] Retain cached DHCPACK after startup if not already consumed We currently free an unclaimed cached DHCPACK immediately after startup, in order to free up memory. This prevents the cached DHCPACK from being applied to a device that is created after startup, such as a VLAN device created via the "vcreate" command. Retain any unclaimed DHCPACK after startup to allow it to be matched against (and applied to) any device that gets created at runtime. Free the DHCPACK during shutdown if it still remains unclaimed, in order to exit with memory cleanly freed. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-22 15:12:34 +00:00
Michael Brown	60b5532cfc	[cachedhcp] Include VLAN tag in filter for applying cached DHCPACK When chainloading iPXE from a VLAN device, the MAC address within the cached DHCPACK will match the MAC address of the trunk device created by iPXE, and the cached DHCPACK will then end up being erroneously applied to the trunk device. This tends to break outbound IPv4 routing, since both the trunk and VLAN devices will have the same assigned IPv4 address. Fix by recording the VLAN tag along with the cached DHCPACK, and treating the VLAN tag as part of the filter used to match the cached DHCPACK against candidate network devices. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-22 14:59:29 +00:00
Michael Brown	b9571ca12e	[efi] Add efi_path_vlan() utility function EFI provides no API for determining the VLAN tag (if any) for a specified device handle. There is the EFI_VLAN_CONFIG_PROTOCOL, but that exists only on the trunk device handle (not on the VLAN device handle), and provides no way to match VLAN tags against the trunk device's child device handles. The EDK2 codebase seems to rely solely on the device path to determine the VLAN tag for a specified device handle: both NetLibGetVlanId() and BmGetNetworkDescription() will parse the device path to search for a VLAN_DEVICE_PATH component. Add efi_path_vlan() which uses the same device path parsing logic to determine the VLAN tag. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-22 14:27:56 +00:00
Michael Brown	099e4d39b3	[efi] Expose efi_path_next() utility function Provide a single central implementation of the logic for stepping through elements of an EFI device path. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-22 13:34:28 +00:00
Michael Brown	0f3ace92c6	[efi] Allow passing a NULL device path to path utility functions Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-22 13:30:02 +00:00
Michael Brown	d879c8e4d9	[efi] Provide VLAN configuration protocol UEFI implements VLAN support within the Managed Network Protocol (MNP) driver, which may create child VLAN devices automatically based on stored UEFI variables. These child devices do not themselves provide a raw-packet interface via EFI_SIMPLE_NETWORK_PROTOCOL, and may be consumed only via the EFI_MANAGED_NETWORK_PROTOCOL interface. The device paths constructed for these child devices may conflict with those for the EFI_SIMPLE_NETWORK_PROTOCOL instances that iPXE attempts to install for its own VLAN devices. The upshot is that creating an iPXE VLAN device (e.g. via the "vcreate" command) will fail if the UEFI Managed Network Protocol has already created a device for the same VLAN tag. Fix by providing our own EFI_VLAN_CONFIG_PROTOCOL instance on the same device handle as EFI_SIMPLE_NETWORK_PROTOCOL. This causes the MNP driver to treat iPXE's device as supporting hardware VLAN offload, and it will therefore not attempt to install its own instance of the protocol. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-14 11:51:52 +00:00
Michael Brown	5e62b4bc6c	[vlan] Allow external code to identify VLAN priority as well as tag Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-14 11:05:37 +00:00
Michael Brown	b0ded89e91	[build] Disable dangling pointer checking for GCC The dangling pointer warning introduced in GCC 12 reports false positives that result in build failures. In particular, storing the address of a local code label used to record the current state of a state machine (as done in crypto/deflate.c) is reported as an error. There seems to be no way to mark the pointer type as being permitted to hold such a value, so unconditionally disable the warning. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-14 01:29:49 +00:00
Michael Brown	54c4c1d403	[build] Disable array bounds checking for GCC The array bounds checker on GCC 12 and newer reports a very large number of false positives that result in build failures. In particular, accesses through pointers to zero-length arrays (such as those used by the linker table mechanism in include/ipxe/tables.h) are reported as errors, contrary to the GCC documentation. Work around this GCC issue by unconditionally disabling the warning. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-12-14 00:54:13 +00:00
Christian I. Nilsson	563bff4722	[intel] Add PCI ID for I219-V and -LM 16,17 Signed-off-by: Christian I. Nilsson <nikize@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-15 13:05:28 +00:00
Michael Brown	2ae5355321	[pci] Backup and restore standard config space across PCIe FLR The behaviour of PCI devices across a function-level reset seems to be inconsistent in practice: some devices will preserve PCI BARs, some will not. Fix the behaviour of FLR on devices that do not preserve PCI BARs by backing up and restoring PCI configuration space across the reset. Preserve only the standard portion of the configuration space, since there may be registers with unexpected side effects in the remaining non-standardised space. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-13 21:38:41 +00:00
Michael Brown	ca2be7e094	[pci] Allow PCI config space backup to be limited by maximum offset Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-13 20:42:09 +00:00
Michael Brown	688646fe6d	[tls] Add GCM cipher suites Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-10 09:58:44 +00:00
Michael Brown	f5c829b6f8	[tests] Verify ability to perform in-place encryption and decryption TLS relies upon the ability of ciphers to perform in-place decryption, in order to avoid allocating additional I/O buffers for received data. Add verification of in-place encryption and decryption to the cipher self-tests. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-10 09:58:44 +00:00
Michael Brown	4acded7e57	[crypto] Support in-place decryption for GCM ciphers The hash calculation is currently performed incorrectly when decrypting in place, since the ciphertext will have been overwritten with the plaintext before being used to update the hash value. Restructure the code to allow for in-place encryption and decryption. Choose to optimise for the decryption case, since we are likely to decrypt much more data than we encrypt. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-10 09:58:37 +00:00
Michael Brown	63fdd9b581	[tests] Verify ability to reset cipher initialisation vector TLS relies upon the ability to reuse a cipher by resetting only the initialisation vector while reusing the existing key. Add verification of resetting the initialisation vector to the cipher self-tests. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-09 16:54:13 +00:00
Michael Brown	63577207ab	[crypto] Ensure relevant GCM cipher state is cleared by cipher_setiv() Reset the accumulated authentication state when cipher_setiv() is called, to allow the cipher to be reused without resetting the key. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-09 16:48:50 +00:00
Michael Brown	7256a6eb24	[tls] Allow handshake digest algorithm to be specified by cipher suite All existing cipher suites use SHA-256 as the TLSv1.2 and above handshake digest algorithm (even when using SHA-1 as the MAC digest algorithm). Some GCM cipher suites use SHA-384 as the handshake digest algorithm. Allow the cipher suite to specify the handshake (and PRF) digest algorithm to be used for TLSv1.2 and above. This requires some restructuring to allow for the fact that the ClientHello message must be included within the handshake digest, even though the relevant digest algorithm is not yet known at the point that the ClientHello is sent. Fortunately, the ClientHello may be reproduced verbatim at the point of receiving the ServerHello, so we rely on reconstructing (rather than storing) this message. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-09 14:49:42 +00:00
Michael Brown	51ecc05490	[tls] Always send maximum supported version in ClientHello Always send the maximum supported version in our ClientHello message, even when performing renegotiation (in which case the current version may already be lower than the maximum supported version). This is permitted by the specification, and allows the ClientHello to be reconstructed verbatim at the point of selecting the handshake digest algorithm in tls_new_server_hello(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-09 14:49:42 +00:00
Michael Brown	54d83e92f0	[tls] Add support for AEAD ciphers Allow for AEAD cipher suites where the MAC length may be zero and the authentication is instead provided by an authenticating cipher, with the plaintext authentication tag appended to the ciphertext. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-08 15:14:19 +00:00
Michael Brown	186306d619	[tls] Treat invalid block padding as zero length padding Harden against padding oracle attacks by treating invalid block padding as zero length padding, thereby deferring the failure until after computing the (incorrect) MAC. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-08 15:14:06 +00:00
Michael Brown	634a86093a	[tls] Allow for arbitrary-length initialisation vectors Restructure the encryption and decryption operations to allow for the use of ciphers where the initialisation vector is constructed by concatenating the fixed IV (derived as part of key expansion) with a record IV (prepended to the ciphertext). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-08 15:14:04 +00:00
Michael Brown	c453b4c284	[tls] Add MAC length as a cipher suite parameter TLS stream and block ciphers use a MAC with a length equal to the output length of the digest algorithm in use. For AEAD ciphers there is no MAC, with the equivalent functionality provided by the cipher algorithm's authentication tag. Allow for the existence of AEAD cipher suites by making the MAC length a parameter of the cipher suite. Assume that the MAC key length is equal to the MAC length, since this is true for all currently supported cipher suites. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-08 14:09:18 +00:00
Michael Brown	b6eef14858	[tls] Abstract out concept of a TLS authentication header All TLS cipher types use a common structure for the per-record data that is authenticated in addition to the plaintext itself. This data is used as a prefix in the HMAC calculation for stream and block ciphers, or as additional authenticated data for AEAD ciphers. Define a "TLS authentication header" structure to hold this data as a contiguous block, in order to meet the alignment requirement for AEAD ciphers such as GCM. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-08 13:48:45 +00:00
Michael Brown	6a360ebfde	[tls] Ensure cipher alignment size is respected Adjust the length of the first received ciphertext data buffer to ensure that all decryption operations respect the cipher's alignment size. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-07 11:19:49 +00:00
Michael Brown	30243ad739	[crypto] Add concept of cipher alignment size The GCM cipher mode of operation (in common with other counter-based modes of operation) has a notion of blocksize that does not neatly fall into our current abstraction: it does operate in 16-byte blocks but allows for an arbitrary overall data length (i.e. the final block may be incomplete). Model this by adding a concept of alignment size. Each call to encrypt() or decrypt() must begin at a multiple of the alignment size from the start of the data stream. This allows us to model GCM by using a block size of 1 byte and an alignment size of 16 bytes. As a side benefit, this same concept allows us to neatly model the fact that raw AES can encrypt only a single 16-byte block, by specifying an alignment size of zero on this cipher. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-07 11:19:48 +00:00
Michael Brown	d1bc872a2e	[tls] Formalise notions of fixed and record initialisation vectors TLS block ciphers always use CBC (as per RFC 5246 section 6.2.3.2) with a record initialisation vector length that is equal to the cipher block size, and no fixed initialisation vector. The initialisation vector for AEAD ciphers such as GCM is less straightforward, and requires both a fixed and per-record component. Extend the definition of a cipher suite to include fixed and record initialisation vector lengths, and generate the fixed portion (if any) as part of key expansion. Do not add explicit calls to cipher_setiv() in tls_assemble_block() and tls_split_block(), since the constraints imposed by RFC 5246 are specifically chosen to allow implementations to avoid doing so. (Instead, add a sanity check that the record initialisation vector length is equal to the cipher block size.) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-07 11:19:48 +00:00
Michael Brown	f8565a655e	[tls] Remove support for TLSv1.0 The TLSv1.0 protocol was deprecated by RFC 8996 (along with TLSv1.1), and has been disabled by default in iPXE since commit `dc785b0fb` ("[tls] Default to supporting only TLSv1.1 or above") in June 2020. While there is value in continuing to support older protocols for interoperability with older server appliances, the additional complexity of supporting the implicit initialisation vector for TLSv1.0 is not worth the cost. Remove support for the obsolete TLSv1.0 protocol, to reduce complexity of the implementation and simplify ongoing maintenance. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-07 11:19:48 +00:00
Michael Brown	7b60a48752	[efi] Clear DMA-coherent buffers before mapping The DMA mapping is performed implicitly as part of the call to dma_alloc(). The current implementation creates the IOMMU mapping for the allocated and potentially uninitialised data before returning to the caller (which will immediately zero out or otherwise initialise the buffer). This leaves a small window within which a malicious PCI device could potentially attempt to retrieve firmware-owned secrets present in the uninitialised buffer. (Note that the hypothetically malicious PCI device has no viable way to know the address of the buffer from which to attempt a DMA read, rendering the attack extremely implausible.) Guard against any such hypothetical attacks by zeroing out the allocated buffer prior to creating the coherent DMA mapping. Suggested-by: Mateusz Siwiec <Mateusz.Siwiec@ioactive.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-11-04 20:28:09 +00:00
Michael Brown	f48b01cb01	[bzimage] Fix parsing of "vga=..." when not at end of command line bzimage_parse_cmdline() uses strcmp() to identify the named "vga=..." kernel command line option values, which will give a false negative if the option is not last on the command line. Fix by temporarily changing the relevant command line separator (if any) to a NUL terminator. Debugged-by: Simon Rettberg <simon.rettberg@rz.uni-freiburg.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-27 13:05:35 +01:00
Michael Brown	8fce26730c	[crypto] Add block cipher Galois/Counter mode of operation Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-25 13:21:30 +01:00
Michael Brown	da81214cec	[crypto] Add concept of authentication tag to cipher algorithms Some ciphers (such as GCM) support the concept of a tag that can be used to authenticate the encrypted data. Add a cipher method for generating an authentication tag. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-25 13:21:30 +01:00
Michael Brown	0c383bf00a	[crypto] Add concept of additional data to cipher algorithms Some ciphers (such as GCM) support the concept of additional authenticated data, which does not appear in the ciphertext but may affect the operation of the cipher. Allow cipher_encrypt() and cipher_decrypt() to be called with a NULL destination buffer in order to pass additional data. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-25 13:21:30 +01:00
Michael Brown	8e478e648f	[crypto] Allow initialisation vector length to vary from cipher blocksize Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-25 13:21:28 +01:00
Michael Brown	52f72d298a	[crypto] Expose null crypto algorithm methods for reuse Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-25 13:20:22 +01:00
Michael Brown	2c78242732	[tls] Add support for DHE variants of the existing cipher suites Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 15:42:13 +01:00
Michael Brown	6b2c94d3a7	[tls] Add support for Ephemeral Diffie-Hellman key exchange Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 15:42:11 +01:00
Michael Brown	ea33ea33c0	[tls] Add key exchange mechanism to definition of cipher suite Allow for the key exchange mechanism to vary depending upon the selected cipher suite. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 14:37:12 +01:00
Michael Brown	80c45c5c71	[tls] Record ServerKeyExchange record, if provided Accept and record the ServerKeyExchange record, which is required for key exchange mechanisms such as Ephemeral Diffie-Hellman (DHE). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 14:37:12 +01:00
Michael Brown	028aac99a3	[tls] Generate pre-master secret at point of sending ClientKeyExchange The pre-master secret is currently constructed at the time of instantiating the TLS connection. This precludes the use of key exchange mechanisms such as Ephemeral Diffie-Hellman (DHE), which require a ServerKeyExchange message to exchange additional key material before the pre-master secret can be constructed. Allow for the use of such cipher suites by deferring generation of the master secret until the point of sending the ClientKeyExchange message. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 14:37:12 +01:00
Michael Brown	1a7317e7d4	[tls] Generate master secret at point of sending ClientKeyExchange The master secret is currently constructed upon receiving the ServerHello message. This precludes the use of key exchange mechanisms such as Ephemeral Diffie-Hellman (DHE), which require a ServerKeyExchange message to exchange additional key material before the pre-master secret and master secret can be constructed. Allow for the use of such cipher suites by deferring generation of the master secret until the point of sending the ClientKeyExchange message. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 14:37:12 +01:00
Michael Brown	18b861024a	[crypto] Add Ephemeral Diffie-Hellman key exchange algorithm Add an implementation of the Ephemeral Diffie-Hellman key exchange algorithm as defined in RFC2631, with test vectors taken from the NIST Cryptographic Toolkit. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-11 14:33:19 +01:00
Michael Brown	007d3cb800	[crypto] Simplify internal HMAC API Simplify the internal HMAC API so that the key is provided only at the point of calling hmac_init(), and the (potentially reduced) key is stored as part of the context for later use by hmac_final(). This simplifies the calling code, and avoids the need for callers such as TLS to allocate a potentially variable length block in order to retain a copy of the unmodified key. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-10 12:21:54 +01:00
Michael Brown	88419b608d	[test] Add HMAC self-tests The HMAC code is already tested indirectly via several consuming algorithms that themselves provide self-tests (e.g. HMAC-DRBG, NTLM authentication, and PeerDist content identification), but lacks any direct test vectors. Add explicit HMAC tests and ensure that corner cases such as empty keys, block-length keys, and over-length keys are all covered. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-10-10 12:17:39 +01:00
Michael Brown	081b3eefc4	[ena] Assign memory BAR if left empty by BIOS Some BIOSes in AWS EC2 (observed with a c6i.metal instance in eu-west-2) will fail to assign an MMIO address to the ENA device, which causes ioremap() to fail. Experiments show that the ENA device is the only device behind its bridge, even when multiple ENA devices are present, and that the BIOS does assign a memory window to the bridge. We may therefore choose to assign the device an MMIO address at the start of the bridge's memory window. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-19 17:49:25 +01:00
Michael Brown	3aa6b79c8d	[pci] Add minimal PCI bridge driver Add a minimal driver for PCI bridges that can be used to locate the bridge to which a PCI device is attached. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-19 17:47:57 +01:00
Michael Brown	649176cd60	[pci] Select PCI I/O API at runtime for cloud images Pretty much all physical machines and off-the-shelf virtual machines will provide a functional PCI BIOS. We therefore default to using only the PCI BIOS, with no fallback to an alternative mechanism if the PCI BIOS fails. AWS EC2 provides the opportunity to experience some exceptions to this rule. For example, the t3a.nano instances in eu-west-1 have no functional PCI BIOS at all. As of commit `83516ba` ("[cloud] Use PCIAPI_DIRECT for cloud images") we therefore use direct Type 1 configuration space accesses in the images built and published for use in the cloud. Recent experience has discovered yet more variation in AWS EC2 instances. For example, some of the metal instance types have multiple PCI host bridges and the direct Type 1 accesses therefore see only a subset of the PCI devices. Attempt to accommodate future such variations by making the PCI I/O API selectable at runtime and choosing ECAM (if available), falling back to the PCI BIOS (if available), then finally falling back to direct Type 1 accesses. This is implemented as a dedicated PCIAPI_CLOUD API, rather than by having the PCI core select a suitable API at runtime (as was done for timers in commit `302f1ee` ("[time] Allow timer to be selected at runtime"). The common case will remain that only the PCI BIOS API is required, and we would prefer to retain the optimisations that come from inlining the configuration space accesses in this common case. Cloud images are (at present) disk images rather than ROM images, and so the increased code size required for this design approach in the PCIAPI_CLOUD case is acceptable. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-18 13:41:21 +01:00
Michael Brown	9448ac5445	[bios] Allow pcibios_discover() to return an empty range Allow pcibios_discover() to return an empty range if the INT 1A,B101 PCI BIOS installation check call fails. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-18 13:35:58 +01:00
Michael Brown	be667ba948	[pci] Add support for the Enhanced Configuration Access Mechanism (ECAM) The ACPI MCFG table describes a direct mapping of PCI configuration space into MMIO space. This mapping allows access to extended configuration space (up to 4096 bytes) and also provides for the existence of multiple host bridges. Add support for the ECAM mechanism described by the ACPI MCFG table, as a selectable PCI I/O API alongside the existing PCI BIOS and Type 1 mechanisms. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-16 01:05:47 +01:00
Michael Brown	ff228f745c	[pci] Generalise pci_num_bus() to pci_discover() Allow pci_find_next() to discover devices beyond the first PCI segment, by generalising pci_num_bus() (which implicitly assumes that there is only a single PCI segment) with pci_discover() (which has the ability to return an arbitrary contiguous chunk of PCI bus:dev.fn address space). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-15 16:49:47 +01:00
Michael Brown	56b30364c5	[pci] Check for wraparound in callers of pci_find_next() The semantics of the bus:dev.fn parameter passed to pci_find_next() are "find the first existent PCI device at this address or higher", with the caller expected to increment the address between finding devices. This does not allow the parameter to distinguish between the two cases "start from address zero" and "wrapped after incrementing maximal possible address", which could therefore lead to an infinite loop in the degenerate case that a device with address ffff:ff:1f.7 really exists. Fix by checking for wraparound in the caller (which is already responsible for performing the increment). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-15 15:20:58 +01:00
Michael Brown	8fc3c26eae	[pci] Allow pci_find_next() to return non-zero PCI segments Separate the return status code from the returned PCI bus:dev.fn address, in order to allow pci_find_next() to be used to find devices with a non-zero PCI segment number. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-15 15:20:58 +01:00
Michael Brown	6459e3b7b1	[linux] Add missing PROVIDE_PCIAPI_INLINE() macros Ensure type consistency of the PCI I/O API methods by adding the missing PROVIDE_PCIAPI_INLINE() macros. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-15 15:20:58 +01:00
Michael Brown	8f5fc16143	[ipv6] Ignore SLAAC on prefixes with an incompatible prefix length Experience suggests that routers are often misconfigured to advertise SLAAC even on prefixes that do not have a SLAAC-compatible prefix length. iPXE will currently treat this as an error, resulting in the prefix being ignored completely. Handle this misconfiguration by ignoring the autonomous address flag when the prefix length is unsuitable for SLAAC. Reported-by: Malte Janduda <mail@janduda.net> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-13 13:25:19 +01:00
Michael Brown	bc19aeca5f	[ipv6] Fix mask calculation when prefix length is not a multiple of 8 Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-06 13:04:19 +01:00
Michael Brown	131daf1aae	[test] Validate constructed IPv6 routing table entries Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-09-06 12:31:32 +01:00
Michael Brown	a80124456e	[ena] Increase receive ring size to 128 entries Some versions of the ENA hardware (observed on a c6i.large instance in eu-west-2) seem to require a receive ring containing at least 128 entries: any smaller ring will never see receive completions or will stall after the first few completions. Increase the receive ring size to 128 entries (determined empirically) for compatibility with these hardware versions. Limit the receive ring fill level to 16 (as at present) to avoid consuming more memory than will typically be available in the internal heap. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-26 19:38:27 +01:00
Michael Brown	3b81a4e256	[ena] Provide a host information page Some versions of the ENA firmware (observed on a c6i.large instance in eu-west-2) seem to require a host information page, without which the CREATE_CQ command will fail with ENA_ADMIN_UNKNOWN_ERROR. These firmware versions also seem to require us to claim that we are a Linux kernel with a specific driver major version number. This appears to be a firmware bug, as revealed by Linux kernel commit 1a63443af ("net/amazon: Ensure that driver version is aligned to the linux kernel"): this commit changed the value of the driver version number field to be the Linux kernel version, and was hastily reverted in commit 92040c6da ("net: ena: fix broken interface between ENA driver and FW") which clarified that the version number field does actually have some undocumented significance to some versions of the firmware. Fix by providing a host information page via the SET_FEATURE command, incorporating the apparently necessary lies about our identity. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-26 19:38:27 +01:00
Michael Brown	9f81e97af5	[ena] Specify the unused completion queue MSI-X vector as 0xffffffff Some versions of the ENA firmware (observed on a c6i.large instance in eu-west-2) will complain if the completion queue's MSI-X vector field is left empty, even though the queue configuration specifies that interrupts are not used. Work around these firmware versions by passing in what appears to be the magic "no MSI-X vector" value in this field. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-26 19:38:27 +01:00
Michael Brown	6d2cead461	[ena] Allow for out-of-order completions The ENA data path design has separate submission and completion queues. Submission queues must be refilled in strict order (since there is only a single linear tail pointer used to communicate the existence of new entries to the hardware), and completion queue entries include a request identifier copied verbatim from the submission queue entry. Once the submission queue doorbell has been rung, software never again reads from the submission queue entry and nothing ever needs to write back to the submission queue entry since completions are reported via the separate completion queue. This design allows the hardware to complete submission queue entries out of order, provided that it internally caches at least as many entries as it leaves gaps. Record and identify I/O buffers by request identifier (using a circular ring buffer of unique request identifiers), and remove the assumption that submission queue entries will be completed in order. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-26 19:38:25 +01:00
Michael Brown	856ffe000e	[ena] Limit submission queue fill level to completion queue size The CREATE_CQ command is permitted to return a size smaller than requested, which could leave us in a situation where the completion queue could overflow. Avoid overflow by limiting the submission queue fill level to the actual size of the completion queue. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-26 19:37:54 +01:00
Michael Brown	c5af41a6f5	[intelxl] Explicitly request a single queue pair for virtual functions Current versions of the E810 PF driver fail to set the number of in-use queue pairs in response to the CONFIG_VSI_QUEUES message. When the number of in-use queue pairs is less than the number of available queue pairs, this results in some packets being directed to nonexistent receive queues and hence silently dropped. Work around this PF driver bug by explicitly configuring the number of available queue pairs via the REQUEST_QUEUES message. This message triggers a VF reset that, in turn, requires us to reopen the admin queue and issue an additional GET_RESOURCES message to restore the VF to a functional state. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-16 19:31:06 +01:00
Michael Brown	04879352c4	[intelxl] Allow for admin commands that trigger a VF reset The RESET_VF admin queue command does not complete via the usual mechanism, but instead requires us to poll registers to wait for the reset to take effect and then reopen the admin queue. Allow for the existence of other admin queue commands that also trigger a VF reset, by separating out the logic that waits for the reset to complete. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-16 19:29:01 +01:00
Michael Brown	491c075f7f	[intelxl] Negotiate virtual function API version 1.1 Negotiate API version 1.1 in order to allow access to virtual function opcodes that are disallowed by default on the E810. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-16 17:58:52 +01:00
Michael Brown	b52ea20841	[intelxl] Show virtual function packet statistics for debugging Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-16 17:58:46 +01:00
Michael Brown	cad1cc6b44	[intelxl] Add driver for Intel 100 Gigabit Ethernet NICs Add a driver for the E810 family of 100 Gigabit Ethernet NICs. The core datapath is identical to that of the 40 Gigabit XL710, and this part of the code is shared between both drivers. The admin queue mechanism is sufficiently similar to make it worth reusing substantial portions of the code, with separate implementations for several commands to handle the (unnecessarily) breaking changes in data structure layouts. The major differences are in the mechanisms for programming queue contexts (where the E810 abandons TX/RX symmetry) and for configuring the transmit scheduler and receive filters: these portions are sufficiently different to justify a separate driver. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-12 16:15:17 +01:00
Michael Brown	6871a7de70	[intelxl] Use admin queue to set port MAC address and maximum frame size Remove knowledge of the PRTGL_SA[HL] registers, and instead use the admin queue to set the MAC address and maximum frame size. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-12 13:24:06 +01:00
Michael Brown	727b034f11	[intelxl] Use admin queue to get port MAC address Remove knowledge of the PRTPM_SA[HL] registers, and instead use the admin queue to retrieve the MAC address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-12 13:03:12 +01:00
Michael Brown	06467ee70f	[intelxl] Defer fetching MAC address until after opening admin queue Allow for the MAC address to be fetched using an admin queue command, instead of reading the PRTPM_SA[HL] registers directly. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-12 13:03:12 +01:00
Michael Brown	d6e36a2d73	[intelxl] Set maximum frame size to 9728 bytes as per datasheet The PRTGL_SAH register contains the current maximum frame size, and is not guaranteed on reset to contain the actual maximum frame size supported by the hardware, which the datasheet specifies as 9728 bytes (including the 4-byte CRC). Set the maximum packet size to a hardcoded 9728 bytes instead of reading from the PRTGL_SAH register. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-12 13:03:12 +01:00
Michael Brown	99242bbe2e	[intelxl] Always issue "clear PXE mode" admin queue command Remove knowledge of the GLLAN_RCTL_0 register (which changes location between the XL810 and E810 register maps), and instead unconditionally issue the "clear PXE mode" command with the EEXIST error silenced. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 15:28:03 +01:00
Michael Brown	faf26bf8b8	[intelxl] Allow expected admin queue command errors to be silenced The "clear PXE mode" admin queue command will return an EEXIST error if the device is already in non-PXE mode, but there is no other admin queue command that can be used to determine whether the device has already been switched into non-PXE mode. Provide a mechanism to allow expected errors from a command to be silenced, to allow the "clear PXE mode" command to be cleanly used without needing to first check the GLLAN_RCTL_0 register value. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 15:28:03 +01:00
Michael Brown	f0ea19b238	[intelxl] Increase data buffer size to 4kB At least one E810 admin queue command (Query Default Scheduling Tree Topology) insists upon being provided with a 4kB data buffer, even when the data to be returned is much smaller. Work around this requirement by increasing the admin queue data buffer size to 4kB. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 15:24:29 +01:00
Michael Brown	fb69d14002	[intelxl] Separate virtual function driver definitions Move knowledge of the virtual function data structures and admin command definitions from intelxl.h to intelxlvf.h. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 14:53:57 +01:00
Michael Brown	c220b93f31	[intelxl] Reuse admin command descriptor and buffer for VF responses Remove the large static admin data buffer structure embedded within struct intelxl_nic, and instead copy the response received via the "send to VF" admin queue event to the (already consumed and completed) admin command descriptor and data buffer. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 14:53:57 +01:00
Michael Brown	67f8878e10	[intelxl] Handle admin events via a callback The physical and virtual function drivers each care about precisely one admin queue event type. Simplify event handling by using a per-driver callback instead of the existing weak function symbol. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-11 14:53:54 +01:00
Michael Brown	9e46ffa924	[intelxl] Rename 8086:1889 PCI ID to "iavf" The PCI device ID 8086:1889 is for the Intel Ethernet Adaptive Virtual Function, which is a generic virtual function that can be exposed by different generations of Intel hardware. Rename the PCI ID from "xl710-vf-ad" to "iavf" to reflect that the driver is not XL710-specific. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-10 12:29:47 +01:00
Michael Brown	ef70667557	[intelxl] Increase receive descriptor ring size to 64 entries The E810 requires that receive descriptor rings have at least 64 entries (and are a multiple of 32 entries). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-10 12:29:47 +01:00
Michael Brown	9f5b9e3abb	[intelxl] Negotiate API version for virtual function via admin queue Do not attempt to use the admin commands to get the firmware version and report the driver version for the virtual function driver, since these will be rejected by the E810 firmware as invalid commands when issued by a virtual function. Instead, use the mailbox interface to negotiate the API version with the physical function driver. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-10 12:29:47 +01:00
Michael Brown	b4216fa506	[intelxl] Use non-zero MSI-X vector for virtual function interrupts The 100 Gigabit physical function driver requires a virtual function driver to request that transmit and receive queues are mapped to MSI-X vector 1 or higher. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-10 12:29:47 +01:00
Michael Brown	1b61c2118c	[intelxl] Fix invocation of intelxlvf_admin_queues() The second parameter to intelxlvf_admin_queues() is a boolean used to select the VF opcode, rather than the raw VF opcode itself. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-10 12:29:45 +01:00
Michael Brown	a202de385d	[intelxl] Use function-level reset instead of PFGEN_CTRL.PFSWR Remove knowledge of the PFGEN_CTRL register (which changes location between XL710 and E810 register maps), and instead use PCIe FLR to reset the physical function. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 16:43:36 +01:00
Michael Brown	0965cec53c	[pci] Generalise function-level reset mechanism Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 16:39:40 +01:00
Michael Brown	9dfcdc04c8	[intelxl] Update list of PCI IDs Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	d8014b1801	[intelxl] Include admin command response data buffer in debug output Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	319caeaa7b	[intelxl] Identify rings consistently in debug messages Use the tail register offset (which exists for all ring types) as the ring identifier in all relevant debug messages. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	814aef68c5	[intelxl] Add missing padding bytes to receive queue context For the sake of completeness, ensure that all 32 bytes of the receive queue context are programmed (including the unused final 8 bytes). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	725f0370fa	[intelxl] Fix bit width of function number in PFFUNC_RID register Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	5d3fad5c10	[intelxl] Fix retrieval of switch configuration via admin queue Commit `8f3e648` ("[intelxl] Use one admin queue buffer per admin queue descriptor") changed the API for intelxl_admin_command() such that the caller now constructs the command directly within the next available descriptor ring entry, rather than relying on intelxl_admin_command() to copy the descriptor to and from the descriptor ring. This introduced a regression in intelxl_admin_switch(), since the second and subsequent iterations of the loop will not have constructed a valid command in the new descriptor ring entry before calling intelxl_admin_command(). Fix by constructing the command within the loop. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-08-08 15:59:55 +01:00
Michael Brown	d3c8944d5c	[acpi] Expose system MAC address via ${sysmac} setting Expose the system MAC address (if any) via the ${sysmac} setting. This allows scripts to access the system MAC address even when iPXE has decided not to apply it to a network device (e.g. because the cached DHCPACK MAC address was selected in order to match the behaviour of a previous boot stage). The setting is named ${sysmac} rather than ${acpimac} in order to allow for forward compatibility with non-ACPI mechanisms that may exist in future for specifying a system MAC address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-06-10 13:44:40 +01:00
Michael Brown	d72c8fdc90	[cachedhcp] Allow cached DHCPACK to override a temporary MAC address When running on a system with an ACPI-provided system-specific MAC address, iPXE will apply this address to an ECM or NCM USB NIC. If iPXE has been chainloaded from a previous stage that does not understand the ACPI MAC mechanism then this can result in iPXE using a different MAC address than the previous stage, which is surprising to users. Attempt to minimise surprise by allowing the MAC address found in a cached DHCPACK packet to override a temporary MAC address, if the DHCPACK MAC address matches the network device's permanent MAC address. When a previous stage has chosen to use the network device's permanent MAC address (e.g. because it does not understand the ACPI MAC mechanism), this will cause iPXE to make the same choice. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-05-23 13:05:24 +01:00
Michael Brown	87f1796f15	[ecm] Treat ACPI MAC address as being a non-permanent MAC address When applying an ACPI-provided system-specific MAC address, apply it to netdev->ll_addr rather than netdev->hw_addr. This allows iPXE scripts to access the permanent MAC address via the ${netX/hwaddr} setting (and thereby provides scripts with a mechanism to ascertain that the NIC is using a MAC address other than its own permanent hardware address). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-05-23 12:23:53 +01:00
Michael Brown	f58b5109f4	[acpi] Support the "_RTXMAC_" format for ACPI-based MAC addresses Some newer HP products expose the host-based MAC (HBMAC) address using an ACPI method named "RTMA" returning a part-binary string of the form "_RTXMAC_#<mac>#", where "<mac>" comprises the raw MAC address bytes. Extend the existing support to handle this format alongside the older "_AUXMAC_" format (which uses a base16-encoded MAC address). Reported-by: Andreas Hammarskjöld <junior@2PintSoftware.com> Tested-by: Andreas Hammarskjöld <junior@2PintSoftware.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-25 16:47:06 +00:00
Michael Brown	614c3f43a1	[acpi] Add MAC address extraction self-tests Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-24 12:58:52 +00:00
Michael Brown	1e1b9593e6	[linux] Add stub phys_to_user() implementation For symmetry with the stub user_to_phys() implementation, provide phys_to_user() with the same underlying assumption that virtual addresses are physical (since there is no way to know the real physical address when running as a Linux userspace executable). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-24 12:58:52 +00:00
Michael Brown	27825e5557	[acpi] Allow for the possibility of overriding ACPI tables at link time Allow for linked-in code to override the mechanism used to locate an ACPI table, thereby opening up the possibility of ACPI self-tests. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-24 12:58:52 +00:00
Michael Brown	dd35475438	[efi] Support Unicode character output via framebuffer console Extend the glyph cache to include a number of dynamic entries that are populated on demand whenever a non-ASCII character needs to be drawn. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-15 17:30:52 +00:00
Michael Brown	ba93c9134c	[fbcon] Support Unicode character output Accumulate UTF-8 characters in fbcon_putchar(), and require the frame buffer console's .glyph() method to accept Unicode character values. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-15 17:27:18 +00:00
Michael Brown	2ff3385e00	[efi] Support Unicode character output via text console Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-15 17:09:58 +00:00
Michael Brown	7e9631b60f	[utf8] Add UTF-8 accumulation self-tests Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-15 16:25:13 +00:00
Michael Brown	3cd3a73261	[utf8] Add ability to accumulate Unicode characters from UTF-8 bytes Signed-off-by: Michael Brown <mcb30@ipxe.org>	2022-03-01 15:57:33 +00:00

... 2 3 4 5 6 ...

6589 Commits (dcad73ca5ad3e1fe011c52a24036f67ad69fadc1)