opengnsys_ipxe

Commit Graph

Author	SHA1	Message	Date
Michael Brown	c49acbb4d2	[http] Gracefully handle offers of multiple authentication schemes Servers may provide multiple WWW-Authenticate headers, each offering a different authentication scheme. We currently fail the request as soon as we encounter an unrecognised scheme, which prevents subsequent offers from succeeding. Fix by silently ignoring headers for schemes that we do not recognise. If no schemes are recognised then the request will eventually fail anyway due to the 401 response code. If multiple schemes are supported, arbitrarily choose the scheme appearing first within the response headers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-11-12 18:52:03 +00:00
Ladi Prosek	0631a46a94	[crypto] Fail fast if cross-certificate source is empty In fully self-contained deployments it may be desirable to build iPXE with an empty CROSSCERT source to avoid talking to external services. Add an explicit check for this case and make validator_start_download fail immediately if the base URI is empty. Signed-off-by: Ladi Prosek <lprosek@redhat.com> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-24 17:56:04 +01:00
Michael Brown	af02a8d071	[dns] Ensure DNS names are NUL-terminated when used as diagnostic strings Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-07 12:19:35 +01:00
Michael Brown	9faf069126	[dns] Report current DNS query as job progress status message Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-06 11:46:13 +01:00
Michael Brown	8047baf7c6	[netdevice] Add "hwaddr" setting Expose the underlying hardware address as a setting. For IPoIB devices, this provides scripts with access to the Infiniband GUID. Requested-by: Allen, Benjamin S. <bsallen@alcf.anl.gov> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-06 10:52:30 +01:00
Michael Brown	7e673a6b67	[peerdist] Gather and report peer statistics during download Record and report the number of peers (calculated as the maximum number of peers discovered for a block's segment at the time that the block download is complete), and the percentage of blocks retrieved from peers rather than from the origin server. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-05 23:23:22 +01:00
Michael Brown	97f0f56a34	[netdevice] Cancel all pending transmissions on any transmit error Some external code (such as the UEFI UNDI driver for the Realtek USB NIC on a Microsoft Surface Book) will block during transmission attempts and can take several seconds to report a transmit error. If there is a large queue of pending transmissions, then the accumulated time from a series of such failures can easily exceed the EFI watchdog timeout, resulting in what appears to be a system lockup followed by a reboot. Work around this problem by immediately cancelling any pending transmissions as soon as any transmit error occurs. The only expected transmit error under normal operation is ENOBUFS arising when the hardware transmit queue is full. By definition, this can happen only for drivers that do not utilise deferred transmissions, and so this new behaviour will not affect these drivers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-09-05 12:30:04 +01:00
Michael Brown	1e4a3f5bab	[tls] Support RFC5746 secure renegotiation Support renegotiation with servers supporting RFC5746. This allows for the use of per-directory client certificates. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-07-04 19:54:34 +01:00
Michael Brown	2f12690455	[tls] Keep cipherstream window open until TLS negotiation is complete When performing a SAN boot, the plainstream window size will be zero (since this is the mechanism used internally to indicate that no data should be fetched via the initial request). This zero value currently propagates to the advertised TCP window size, which prevents the TLS negotiation from completing. Fix by ensuring that the cipherstream window is held open until TLS negotiation is complete, and only then falling back to passing through the plainstream window size. Reported-by: John Wigley <johnwigley#ipxe@acorna.co.uk> Tested-by: John Wigley <johnwigley#ipxe@acorna.co.uk> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-05-22 13:17:23 +01:00
Michael Brown	785389c2ba	[iscsi] Always send FirstBurstLength parameter As of kernel 4.11, the LIO target will propose a value for FirstBurstLength if the initiator did not do so. This is entirely redundant in our case, since FirstBurstLength is defined by RFC 3720 to be "Irrelevant when: ( InitialR2T=Yes and ImmediateData=No )" and we already enforce both InitialR2T=Yes and ImmediateData=No in our initial proposal. However, LIO (arguably correctly) complains when we do not respond to its redundant proposal of an already-irrelevant value. Fix by always proposing the default value for FirstBurstLength. Debugged-by: Patrick Seeburger <info@8bit.de> Tested-by: Patrick Seeburger <info@8bit.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-05-03 13:01:11 +01:00
Michael Brown	c8cae7cc17	[http] Notify data transfer interface when underlying connection is ready HTTP implements xfer_window_changed() on the underlying server connection using http_step(), which does not propagate the window change notification to the data transfer interface. This breaks the multipath-capable SAN boot code, which relies on the window change notification to discover that the HTTP block device is ready for commands to be issued. Fix by sending xfer_window_changed() in http_step() once the underlying connection has been determined to be ready. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-28 23:40:52 +03:00
Michael Brown	7cfdd769aa	[block] Describe all SAN devices via ACPI tables Describe all SAN devices via ACPI tables such as the iBFT. For tables that can describe only a single device (i.e. the aBFT and sBFT), one table is installed per device. For multi-device tables (i.e. the iBFT), all devices are described in a single table. An underlying SAN device connection may be closed at the time that we need to construct an ACPI table. We therefore introduce the concept of an "ACPI descriptor" which enables the SAN boot code to maintain an opaque pointer to the underlying object, and an "ACPI model" which can build tables from a list of such descriptors. This separates the lifecycles of ACPI descriptions from the lifecycles of the block device interfaces, and allows for construction of the ACPI tables even if the block device interface has been closed. For a multipath SAN device, iPXE will wait until sufficient information is available to describe all devices but will not wait for all paths to connect successfully. For example: with a multipath iSCSI boot iPXE will wait until at least one path has become available and name resolution has completed on all other paths. We do this since the iBFT has to include IP addresses rather than DNS names. We will commence booting without waiting for the inactive paths to either become available or close; this avoids unnecessary boot delays. Note that the Linux kernel will refuse to accept an iBFT with more than two NIC or target structures. We therefore describe only the NICs that are actually required in order to reach the described targets. Any iBFT with at most two targets is therefore guaranteed to describe at most two NICs. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-28 19:12:48 +03:00
Michael Brown	75bb948008	[tcp] Use correct length for memset() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-22 15:11:05 +02:00
Michael Brown	c26c1fd07c	[infiniband] Return status code from ib_create_mi() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-22 11:18:23 +02:00
Michael Brown	39ef530088	[infiniband] Return status code from ib_create_cq() and ib_create_qp() Any underlying errors arising during ib_create_cq() or ib_create_qp() are lost since the functions simply return NULL on error. This makes debugging harder, since a debug-enabled build is required to discover the root cause of the error. Fix by returning a status code from these functions, thereby allowing any underlying errors to be propagated. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-22 11:18:02 +02:00
Michael Brown	f17cf0ecd0	[http] Add missing check for memory allocation failure Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-21 15:20:59 +02:00
Michael Brown	64de7dc7fd	[slam] Avoid NULL pointer dereference in slam_pull_value() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-21 14:57:36 +02:00
Michael Brown	60561d0f3d	[slam] Fix resource leak on error path Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-21 14:53:13 +02:00
Michael Brown	9b581158b5	[802.11] Remove redundant NULL pointer check after dereference Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-21 14:01:08 +02:00
Michael Brown	e500e5dd07	[nfs] Fix double free bug on error path Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-21 13:46:26 +02:00
Michael Brown	de2c6fa240	[dhcp] Allow vendor class to be changed in DHCP requests Allow the DHCPv4 vendor class to be specified via the "vendor-class" setting. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-20 13:58:59 +02:00
Vishvananda Ishaya Abrams	4524cc11bf	[iscsi] Don't close when receiving NOP-In Some iSCSI targets send NOP-In. Rather than closing the connection when we receive one, it is more user friendly to log a debug message and keep the connection open. Eventually, it would be nice if iPXE supported replying to NOP-Ins, but we might as well keep the connection open until the target disconnects us. Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-09 14:23:22 +00:00
Michael Brown	a29bdb3a92	[iscsi] Use intfs_shutdown() when shutting down multiple interfaces Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-03-09 12:16:15 +00:00
Michael Brown	4a4da573dd	[http] Cleanly shut down potentially looped interfaces Use intfs_shutdown() and intfs_restart() to cleanly shut down multiple interfaces that may loop back to the same object. This fixes a regression introduced by commit `daa8ed9` ("[interface] Provide intf_reinit() to reinitialise nullified interfaces") which broke the use of HTTP Basic and Digest authentication. Reported-by: murmansk <murmansk@hotmail.com> Reported-by: Brett Waldo <brettwaldo@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-02-02 16:58:00 +00:00
Michael Brown	302f1eeb80	[time] Allow timer to be selected at runtime Allow the active timer (providing udelay() and currticks()) to be selected at runtime based on probing during the INIT_EARLY stage of initialisation. TICKS_PER_SEC is now a fixed compile-time constant for all builds, and is independent of the underlying clock tick rate. We choose the value 1024 to allow multiplications and divisions on seconds to be converted to bit shifts. TICKS_PER_MS is defined as 1, allowing multiplications and divisions on milliseconds to be omitted entirely. The 2% inaccuracy in this definition is negligible when using the standard BIOS timer (running at around 18.2Hz). TIMER_RDTSC now checks for a constant TSC before claiming to be a usable timer. (This timer can be tested in KVM via the command-line option "-cpu host,+invtsc".) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-26 08:17:37 +00:00
Michael Brown	70fc25ad6e	[netdevice] Limit MTU by hardware maximum frame length Separate out the concept of "hardware maximum supported frame length" and "configured link MTU", and limit the latter according to the former. In networks where the DHCP-supplied link MTU is inconsistent with the hardware or driver capabilities (e.g. a network using jumbo frames), this will result in iPXE advertising a TCP MSS consistent with a size that can actually be received. Note that the term "MTU" is typically used to refer to the maximum length excluding the link-layer headers; we adopt this usage. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-25 14:55:09 +00:00
Michael Brown	16aed6e5ce	[netdevice] Allow MTU to be changed at runtime Provide a settings applicator to modify netdev->max_pkt_len in response to changes to the "mtu" setting (DHCP option 26). Note that as with MAC address changes, drivers are permitted to completely ignore any changes in the MTU value. The net result will be that iPXE effectively uses the smaller of either the hardware default MTU or the software configured MTU. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-23 17:47:28 +00:00
Michael Brown	de85336abb	[cloud] Add ability to retrieve Google Compute Engine metadata For some unspecified "security" reason, the Google Compute Engine metadata server will refuse any requests that do not include the non-standard HTTP header "Metadata-Flavor: Google". Attempt to autodetect such requests (by comparing the hostname against "metadata.google.internal"), and add the "Metadata-Flavor: Google" header if applicable. Enable this feature in the CONFIG=cloud build, and include a sample embedded script allowing iPXE to boot from a script configured as metadata via e.g. # Create shared boot image make bin/ipxe.usb CONFIG=cloud EMBED=config/cloud/gce.ipxe # Configure per-instance boot script gcloud compute instances add-metadata <instance> \ --metadata-from-file ipxeboot=boot.ipxe Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-23 14:43:20 +00:00
David Decotigny	04c7befa73	[build] Return const char * from uuid_ntoa() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-22 13:45:00 +00:00
Michael Brown	43b2d8eafb	[ipv4] Accept unicast packets for the local network broadcast address The ISC Kea DHCP server transmits its DHCPOFFER as a unicast packet with a broadcast IPv4 destination address (255.255.255.255). This combination is currently rejected by iPXE. Fix by explicitly accepting the local network broadcast address (255.255.255.255) as a valid unicast destination address. Reported-by: Roy Ledochowski <roy.ledochowski@hpe.com> Tested-by: Roy Ledochowski <roy.ledochowski@hpe.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2017-01-22 09:12:52 +00:00
Michael Brown	81fceaec6e	[iscsi] Avoid potential infinite loops during shutdown The command and data interfaces may be connected to the same object. Nullify the data interface before shutting down the control interface to avoid potential infinite loops. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-11-16 23:03:37 +00:00
Michael Brown	daa8ed9274	[interface] Provide intf_reinit() to reinitialise nullified interfaces Provide an abstraction intf_reinit() to restore the descriptor of a previously nullified interface. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-11-16 22:22:13 +00:00
Michael Brown	ff28b22568	[crypto] Generalise X.509 "valid" field to a "flags" field Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-08-25 15:41:57 +01:00
Michael Brown	a4c4f72297	[ipv6] Allow for multiple routers Select the IPv6 source address and corresponding router (if any) using a very simplified version of the algorithm from RFC6724: - Ignore any source address that has a smaller scope than the destination address. For example, do not use a link-local source address when sending to a global destination address. - If we have a source address which is on the same link as the destination address, then use that source address. - If we are left with multiple possible source addresses, then choose the address with the smallest scope. For example, if we are sending to a site-local destination address and we have both a global source address and a site-local source address, then use the site-local source address. - If we are still left with multiple possible source addresses, then choose the address with the longest matching prefix. For the purposes of this algorithm, we treat RFC4193 Unique Local Addresses as having organisation-local scope. Since we use only link-local scope for our multicast transmissions, this approximation should remain valid in all practical situations. Originally-implemented-by: Thomas Bächler <thomas@archlinux.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-25 15:20:22 +01:00
Michael Brown	daa1a59310	[ipv6] Rename ipv6_scope to ipv6_settings_scope Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-21 15:47:45 +01:00
Michael Brown	c34d1518eb	[ipv6] Create routing table based on IPv6 settings Use the IPv6 settings to construct the routing table, in a matter analogous to the construction of the IPv4 routing table. This allows for manual assignment of IPv6 addresses via e.g. set net0/ip6 2001:ba8:0:1d4::6950:5845 set net0/len6 64 set net0/gateway6 fe80::226:bff:fedd:d3c0 The prefix length ("len6") may be omitted, in which case a default prefix length of 64 will be assumed. Multiple IPv6 addresses may be assigned manually by implicitly creating child settings blocks. For example: set net0/ip6 2001:ba8:0:1d4::6950:5845 set net0.ula/ip6 fda4:2496:e992::6950:5845 Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-20 13:02:44 +01:00
Michael Brown	4ad3c73b30	[ipv6] Match user expectations for IPv6 settings priorities A reasonable user expectation is that ${net0/ip6} should show the "highest-priority" of the IPv6 addresses, even when multiple IPv6 addresses are active. The expected order of priority is likely to be manually-assigned addresses first, then stateful DHCPv6 addresses, then SLAAC addresses, and lastly link-local addresses. Using ${priority} to enforce an ordering is undesirable since that would affect the priority assigned to each of the net<N> blocks as a whole, so use the sibling ordering capability instead. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-19 17:07:53 +01:00
Michael Brown	1fdc7da435	[ipv6] Expose IPv6 link-local address settings Originally-implemented-by: Hannes Reinecke <hare@suse.de> Originally-implemented-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-19 14:35:30 +01:00
Michael Brown	03d19cf14d	[dhcpv6] Expose IPv6 address setting acquired through DHCPv6 Originally-implemented-by: Hannes Reinecke <hare@suse.de> Originally-implemented-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-19 01:20:34 +01:00
Michael Brown	3b783d7fd2	[ipv6] Expose IPv6 settings acquired through NDP Expose the IPv6 address (or prefix) as ${ip6}, the prefix length as ${len6}, and the router address as ${gateway6}. Originally-implemented-by: Hannes Reinecke <hare@suse.de> Originally-implemented-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-19 00:13:00 +01:00
Michael Brown	ee54ab5be6	[ipv6] Allow settings to comprise arbitrary subsets of NDP options Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-19 00:13:00 +01:00
Michael Brown	129206f476	[ipv6] Rename ipv6_scope to dhcpv6_scope The settings scope ipv6_scope refers specifically to IPv6 settings that have a corresponding DHCPv6 option. Rename to dhcpv6_scope to more accurately reflect this purpose. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-16 12:42:08 +01:00
Michael Brown	ecfc81d76f	[settings] Create space for IPv6 in settings display order Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-15 17:39:49 +01:00
Michael Brown	c53a209a42	[ipv6] Perform SLAAC only during autoconfiguration We currently perform IPv6 stateless address autoconfiguration (SLAAC) in response to any router advertisement with the relevant flags set. This can result in the local IPv6 source address changing midway through a TCP connection, since our connections bind only to a local port number and do not store a local network address. In addition, this behaviour for SLAAC is inconsistent with that for DHCPv4 and stateful DHCPv6, both of which will be performed only as a result of an explicit autoconfiguration action (e.g. via the default autoboot sequence, or the "ifconf" command). Fix by ignoring router advertisements arriving outside the context of an ongoing autoconfiguration attempt. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-15 15:58:47 +01:00
Michael Brown	45dd627689	[ipv4] Send gratuitous ARPs whenever a new IPv4 address is applied In a busy network (such as a public cloud), IPv4 addresses may be recycled rapidly. When this happens, unidirectional traffic (such as UDP syslog) will succeed, but bidirectional traffic (such as TCP connections) may fail due to stale ARP cache entries on other nodes. The remote ARP cache expiry timeout is likely to exceed iPXE's connection timeout, meaning that boot attempts can fail before the problem is automatically resolved. Fix by sending gratuitous ARPs whenever an IPv4 address is changed, to attempt to update stale remote ARP cache entries. Note that this is not a guaranteed fix, since ARP is an unreliable protocol. We avoid sending gratuitous ARPs unconditionally, since otherwise any unrelated settings change (e.g. "set dns 192.168.0.1") would cause unexpected gratuitous ARPs to be sent. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-12 09:01:01 +01:00
Michael Brown	55f7a675d6	[iscsi] Treat redirection failures as fatal Debugged-by: Robin Smidsrød <robin@smidsrod.no> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-04 16:20:07 +01:00
Michael Brown	aeb6203811	[dhcp] Automatically generate vendor class identifier string The vendor class identifier strings in DHCP_ARCH_VENDOR_CLASS_ID are out of sync with the (correct) client architecture values in DHCP_ARCH_CLIENT_ARCHITECTURE. Fix by removing all definitions of DHCP_ARCH_VENDOR_CLASS_ID, and instead generating the vendor class identifier string automatically based on DHCP_ARCH_CLIENT_ARCHITECTURE and DHCP_ARCH_CLIENT_NDI. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-04 15:07:05 +01:00
Michael Brown	d7f1834b5e	[dhcpv6] Include vendor class identifier option in DHCPv6 requests RFC3315 defines DHCPv6 option 16 (vendor class identifier) but does not define any direct relationship with the roughly equivalent DHCPv4 option 60. The PXE specification predates IPv6, and the UEFI specification is expectedly vague on the subject. Examination of the reference EDK2 codebase suggests that the DHCPv6 vendor class identifier will be formatted in accordance with RFC3315, using a single vendor-class-data item in which the opaque-data field is the string as would appear in DHCPv4 option 60. RFC3315 requires the vendor class identifier to specify an IANA enterprise number, as a way of disambiguating the vendor-class-data namespace. The EDK2 code uses the value 343, described as: // TODO: IANA TBD: temporarily using Intel's Since this "TODO" has been present since at least 2010, it is probably safe to assume that it has now become a de facto standard. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-04 14:08:26 +01:00
Michael Brown	fda8916c83	[dhcpv6] Include RFC5970 client architecture options in DHCPv6 requests RFC5970 defines DHCPv6 options 61 (client system architecture type) and 62 (client network interface identifier), with contents equivalent to DHCPv4 options 93 and 94 respectively. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-04 13:18:49 +01:00
Michael Brown	3d9f094022	[dhcp] Allow for variable encapsulation of architecture-specific options DHCPv4 and DHCPv6 share some values in common for the architecture- specific options (such as the client system architecture type), but use different encapsulations: DHCPv4 has a single byte for the option length while DHCPv6 has a 16-bit field for the option length. Move the containing DHCP_OPTION() and related wrappers from the individual dhcp_arch.h files to dhcp.c, thus allowing for the architecture-specific values to be reused in dhcpv6.c. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-07-04 13:15:05 +01:00
Michael Brown	fce6117ad9	[ntp] Add simple NTP client Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-06-13 15:55:49 +01:00
Michael Brown	188789eb3c	[tcp] Send TCP keepalives on idle established connections In some circumstances, intermediate devices may lose state in a way that temporarily prevents the successful delivery of packets from a TCP peer. For example, a firewall may drop a NAT forwarding table entry. Since iPXE spends most of its time downloading files (and hence purely receiving data, sending only TCP ACKs), this can easily happen in a situation in which there is no reason for iPXE's TCP stack to generate any retransmissions. The temporary loss of connectivity can therefore effectively become permanent. Work around this problem by sending TCP keepalives after a period of inactivity on an established connection. TCP keepalives usually send a single garbage byte in sequence number space that has already been ACKed by the peer. Since we do not need to elicit a response from the peer, we instead send pure ACKs (with no garbage data) in order to keep the transmit code path simple. Originally-implemented-by: Ladi Prosek <lprosek@redhat.com> Debugged-by: Ladi Prosek <lprosek@redhat.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-06-13 09:58:32 +01:00
Michael Brown	b42e71921f	[http] Accept headers with no whitespace following the colon Reported-by: Raphael Cohn <raphael.cohn@stormmq.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-06-09 12:27:04 +01:00
Michael Brown	f42b2585fe	[http] Ignore unrecognised "Connection" header tokens Some HTTP/2 servers send the header "Connection: upgrade, close". This currently causes iPXE to fail due to the unrecognised "upgrade" token. Fix by ignoring any unrecognised tokens in the "Connection" header. Reported-by: Ján ONDREJ (SAL) <ondrejj@salstar.sk> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-05-25 15:35:43 +01:00
Michael Brown	231adda40f	[netdevice] Fix failure path in register_netdev() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-05-23 14:17:47 +01:00
Michael Brown	40a8a5294c	[ethernet] Make LACP support configurable at build time Add a build configuration option NET_PROTO_LACP to control whether or not LACP support is included for Ethernet devices. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-04-18 10:08:46 +01:00
Michael Brown	70509e6a03	[netdevice] Return ENOENT for an unknown bus type It is possible for the preloaded UNDI device to end up with no specified bus type, since it may not be recognised as either a PCI or an ISAPnP device. This will result in a bus type value of zero, which currently results in NULL being treated as a string pointer by netdev_fetch_bustype(). Fix by returning ENOENT if an unknown bus type is specified. Reported-by: Todd Stansell <todd@stansell.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-29 20:59:30 +01:00
Michael Brown	f8e1678b84	[crypto] Allow cross-certificate source to be configured at build time Provide a build option CROSSCERT in config/crypto.h to allow the default cross-signed certificate source to be configured at build time. The ${crosscert} setting may still be used to reconfigure the cross-signed certificate source at runtime. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-24 19:25:03 +00:00
Michael Brown	64acfd9ddd	[arp] Validate length of ARP packet There is no practical way to generate an underlength ARP packet since an ARP packet is always padded up to the minimum Ethernet frame length (or dropped by the receiving Ethernet hardware if incorrectly padded), but the absence of an explicit check causes warnings from some analysis tools. Fix by adding an explicit check on the I/O buffer length. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-12 01:24:03 +00:00
Michael Brown	05dcb07cb2	[tls] Avoid potential out-of-bound reads in length fields Many TLS records contain variable-length fields. We currently validate the overall record length, but do so only after reading the length of the variable-length field. If the record is too short to even contain the length field, then we may read uninitialised data from beyond the end of the record. This is harmless in practice (since the subsequent overall record length check would fail regardless of the value read from the uninitialised length field), but causes warnings from some analysis tools. Fix by validating that the overall record length is sufficient to contain the length field before reading from the length field. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-11 16:09:40 +00:00
Michael Brown	e44f6dcb89	[xsigo] Add support for Xsigo virtual Ethernet (XVE) EoIB devices Add support for EoIB devices as implemented by Xsigo. Based on the public (but out-of-tree) Linux kernel drivers at https://oss.oracle.com/git/?p=linux-uek.git;a=log;h=v4.1.12-32.2.1 Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-09 08:46:24 +00:00
Michael Brown	5bcaa1e4d4	[infiniband] Make IPoIB support configurable at build time Add a build configuration option VNIC_IPOIB to control whether or not IPoIB support is included for Infiniband devices. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-09 08:43:40 +00:00
Michael Brown	076d772648	[infiniband] Retrieve GID flag from cached path entries Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 17:40:52 +00:00
Michael Brown	6a3ffa0114	[infiniband] Assign names to queue pairs Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 15:51:53 +00:00
Michael Brown	174bf6b569	[infiniband] Assign names to CMRC connections Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 15:51:19 +00:00
Michael Brown	5a7fd2cc90	[infiniband] Allow for the creation of multicast groups Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:23:30 +00:00
Michael Brown	14ad9cbd67	[infiniband] Parse MLID, rate, and SL from multicast membership record Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:23:30 +00:00
Michael Brown	c335f8eae4	[infiniband] Record multicast GID attachment as part of group membership Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:23:30 +00:00
Michael Brown	114a2f19a6	[infiniband] Do not use GRH for local paths Avoid including an unnecessary GRH in packets sent to unicast destinations within the local subnet. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:23:24 +00:00
Michael Brown	bd1687465c	[infiniband] Use correct transaction identifier in CM responses Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	8336186564	[infiniband] Use connection's local ID as debug message identifier Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	36c4779356	[infiniband] Use "%d" as format specifier for LIDs Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	7aef4d4c94	[infiniband] Use "%#lx" as format specifier for queue pair numbers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	d7794dcac7	[infiniband] Assign names to Infiniband devices for debug messages Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	ff13eeb747	[infiniband] Add support for performing service record lookups Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:08:58 +00:00
Michael Brown	7544763626	[infiniband] Avoid multiple calls to ib_cmrc_shutdown() When a CMRC connection is closed, the deferred shutdown process calls ib_destroy_qp(). This will cause the receive work queue entries to complete in error (since they are being cancelled), which will in turn reschedule the deferred shutdown process. This eventually leads to ib_destroy_conn() being called on a connection that has already been freed. Fix by explicitly cancelling any pending shutdown process after the shutdown process has completed. Ironically, this almost exactly reverts commit `019d4c1` ("[infiniband] Use a one-shot process for CMRC shutdown"); prior to the introduction of one-shot processes the only way to achieve a one-shot process was for the process to cancel itself. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-08 12:07:03 +00:00
Michael Brown	fcf3b03544	[netdevice] Refuse to create duplicate network device names Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-03-07 21:04:40 +00:00
Michael Brown	4ddd3d99c3	[slam] Avoid potential division by zero Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-27 23:27:47 +00:00
Michael Brown	fef8e34b6f	[tcp] Guard against malformed TCP options Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-27 23:06:50 +00:00
Michael Brown	f0e9e55442	[tftp] Mangle initial slash on TFTP URIs TFTP URIs are intrinsically problematic, since: - TFTP servers may use either normal slashes or backslashes as a directory separator, - TFTP servers allow filenames to be specified using relative paths (with no initial directory separator), - TFTP filenames present in a DHCP filename field may use special characters such as "?" or "#" that prevent parsing as a generic URI. As of commit `7667536` ("[uri] Refactor URI parsing and formatting"), we have directly constructed TFTP URIs from DHCP next-server and filename pairs, avoiding the generic URI parser. This eliminated the problems related to special characters, but indirectly made it impossible to parse a "tftp://..." URI string into a TFTP URI with a non-absolute path. Re-introduce the convention of requiring an extra slash in a "tftp://..." URI string in order to specify a TFTP URI with an initial slash in the filename. For example: tftp://192.168.0.1/boot/pxelinux.0 => RRQ "boot/pxelinux.0" tftp://192.168.0.1//boot/pxelinux.0 => RRQ "/boot/pxelinux.0" This is ugly, but there seems to be no other sensible way to provide the ability to specify all possible TFTP filenames. A side-effect of this change is that format_uri() will no longer add a spurious initial "/" when formatting a relative URI string. This improves the console output when fetching an image specified via a relative URI. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-21 18:00:33 +00:00
Andrew Widdersheim	3fd81799ba	[netdevice] Add "ifname" setting Expose the network interface name (e.g. "net0") as a setting. This allows a script to obtain the name of the most recently opened network interface via ${netX/ifname}. Signed-off-by: Andrew Widdersheim <amwiddersheim@gmail.com> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-18 08:50:44 +00:00
Michael Brown	8af8886d0a	[stp] Fix incorrectly disambiguated errors The three nominally-disambiguated ENOTSUP errors accidentally all used the same error disambiguator, rendering them identical. Fix by changing all three values. We avoid reusing the 0x01 disambiguator value, since that remains ambiguous in older binaries. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-14 12:39:35 +00:00
Michael Brown	7c6858e95d	[infiniband] Profile post work queue entry operations Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-10 15:44:00 +00:00
Michael Brown	0af0888832	[tftp] Do not change current working URI when TFTP server is cleared For historical reasons, iPXE sets the current working URI to the root of the TFTP server whenever the TFTP server address is changed. This was originally implemented in the hope of allowing a DHCP-provided TFTP filename to be treated simply as a relative URI. This usage turns out to be impractical since DHCP-provided TFTP filenames may include characters which would have special significance to the URI parser, and so the DHCP next-server+filename combination is now handled by the dedicated pxe_uri() function instead. The practice of setting the current working URI to the root of the TFTP server is potentially helpful for interactive uses of iPXE, allowing a user to type e.g. iPXE> dhcp Configuring (net0 52:54:00:12:34:56)... ok iPXE> chain pxelinux.0 and have the URI "pxelinux.0" interpreted as being relative to the root of the TFTP server provided via DHCP. The current implementation of tftp_apply_settings() has an unintended flaw. When the "dhcp" command is used to renew a DHCP lease (or to pick up potentially modified DHCP options), the old settings block will be unregistered before the new settings block is registered. This causes tftp_apply_settings() to believe that the TFTP server has been changed twice (to 0.0.0.0 and back again), and so the current working URI will always be set to the root of the TFTP server, even if the DHCP response provides exactly the same TFTP server as previously. Fix by doing nothing in tftp_apply_settings() whenever there is no TFTP server address. Debugged-by: Andrew Widdersheim <awiddersheim@inetu.net> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-09 14:51:21 +00:00
Michael Brown	74c812a68c	[http] Handle relative redirection URIs Resolve redirection URIs as being relative to the original HTTP request URI, rather than treating them as being implicitly relative to the current working URI. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2016-01-09 13:20:55 +00:00
Michael Brown	ed0d7c4f6f	[dhcp] Limit maximum number of DHCP discovery deferrals For switches which remain permanently in the non-forwarding state (or which erroneously report a non-forwarding state), ensure that iPXE will eventually give up waiting for the link to become unblocked. Originally-fixed-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-11-10 14:05:46 +00:00
Michael Brown	7cc7e0ec86	[dhcp] Reset start time when deferring discovery If we detect (via STP) that a switch port is in a non-forwarding state, then the link is marked as being temporarily blocked and DHCP discovery will be deferred until the link becomes unblocked. The timer used to decide when to give up waiting for ProxyDHCPOFFERs is currently based on the time that DHCP discovery was started, and makes no allowances for any time spent waiting for the link to become unblocked. Consequently, if STP is used then the timeout for ProxyDHCPOFFERs becomes essentially zero. Fix by resetting the recorded start time whenever DHCP discovery is deferred due to a blocked link. Debugged-by: Sebastian Roth <sebastian.roth@zoho.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-10-30 13:29:03 +00:00
Michael Brown	3bd0d340f4	[http] Verify server port when reusing a pooled connection Reported-by: Allen <allen@gtf.org> Reported-by: Andreas Hammarskjöld <junior@2PintSoftware.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-10-02 07:54:51 +01:00
Michael Brown	0a4805bf94	[peerdist] Avoid NULL pointer dereference for plaintext blocks Avoid accidentally dereferencing a NULL cipher context pointer for plaintext blocks (which are usually messages with a block length of zero, indicating a missing block). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-09-29 01:24:36 +01:00
Michael Brown	8baefad659	[tcpip] Avoid generating positive zero for transmitted UDP checksums TCP/IP checksum fields are one's complement values and therefore have two possible representations of zero: positive zero (0x0000) and negative zero (0xffff). In RFC768, UDP over IPv4 exploits this redundancy to repurpose the positive representation of zero (0x0000) to mean "no checksum calculated"; checksums are optional for UDP over IPv4. In RFC2460, checksums are made mandatory for UDP over IPv4. The wording of the RFC is such that the UDP header is mandated to use only the negative representation of zero (0xffff), rather than simply requiring the checksum to be correct but allowing for either representation of zero to be used. In RFC1071, an example algorithm is given for calculating the TCP/IP checksum. This algorithm happens to produce only the positive representation of zero (0x0000); this is an artifact of the way that unsigned arithmetic is used to calculate a signed one's complement sum (and its final negation). A common misconception has developed (exemplified in RFC1624) that this artifact is part of the specification. Many people have assumed that the checksum field should never contain the negative representation of zero (0xffff). A sensible receiver will calculate the checksum over the whole packet and verify that the result is zero (in whichever representation of zero happens to be generated by the receiver's algorithm). Such a receiver will not care which representation of zero happens to be used in the checksum field. However, there are receivers in existence which will verify the received checksum the hard way: by calculating the checksum over the remainder of the packet and comparing the result against the checksum field. If the representation of zero used by the receiver's algorithm does not match the representation of zero used by the transmitter (and so placed in the checksum field), and if the receiver does not explicitly allow for both representations to compare as equal, then the receiver may reject packets with a valid checksum. For UDP, the combined RFCs effectively mandate that we should generate only the negative representation of zero in the checksum field. For IP, TCP and ICMP, the RFCs do not mandate which representation of zero should be used, but the misconceptions which have grown up around RFC1071 and RFC1624 suggest that it would be least surprising to generate only the positive representation of zero in the checksum field. Fix by ensuring that all of our checksum algorithms generate only the positive representation of zero, and explicitly inverting this in the case of transmitted UDP packets. Reported-by: Wissam Shoukair <wissams@mellanox.com> Tested-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-09-10 14:46:54 +01:00
Michael Brown	be51713474	[pxe] Populate ciaddr in fake PXE Boot Server ACK packet We currently do not populate the ciaddr field in the constructed PXE Boot Server ACK packet. This causes a WDS server to respond with a broadcast packet, which is then ignored by wdsmgfw.efi since it does not match the specified IP address filter. Fix by populating ciaddr within the constructed PXE Boot Server ACK packet. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-09-01 21:24:02 +01:00
Michael Brown	8430642642	[tcpip] Allow supported address families to be detected at runtime Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-09-01 21:04:45 +01:00
Michael Brown	f0c6c4efd8	[dhcp] Do not skip ProxyDHCPREQUEST if next-server is empty We attempt to mimic the behaviour of Intel's PXE ROM by skipping the separate ProxyDHCPREQUEST if the ProxyDHCPOFFER already contains a boot filename or a PXE boot menu. Experimentation reveals that Intel's PXE ROM will also check for a non-empty next-server address alongside the boot filename. Update our test to match this behaviour. Reported-by: Wissam Shoukair <wissams@mellanox.com> Tested-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-26 16:08:58 +01:00
Michael Brown	ba3695353a	[settings] Re-add "uristring" setting type Commit `09b057c` ("[settings] Remove "uristring" setting type") removed support for URI-encoded settings via the "uristring" setting type, on the basis that such encoding was no longer necessary to avoid problems with the command line parser. Other valid use cases for the "uristring" setting type do exist: for example, a password containing a '/' character expanded via chain http://username:${password:uristring}@server.name/boot.php Restore the existence of the "uristring" setting, avoiding the potentially large stack allocations that were used in the old code prior to commit `09b057c` ("[settings] Remove "uristring" setting type"). Requested-by: Robin Smidsrød <robin@smidsrod.no> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-25 13:31:46 +01:00
Michael Brown	0a34c2aab9	[dhcp] Ignore ProxyDHCPACKs without PXE options Suggested-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-18 17:18:38 +01:00
Michael Brown	60e2b71471	[dhcp] Allow pseudo-DHCP servers to use pseudo-identifiers Some ProxyDHCP servers and PXE boot servers do not specify a DHCP server identifier via option 54. We currently work around this in a variety of ad-hoc ways: - if a ProxyDHCPACK has no server identifier then we treat it as having the correct server identifier, - if a boot server ACK has no server identifier then we use the packet's source IP address as the server identifier. Introduce the concept of a DHCP server pseudo-identifier, defined as being: - the server identifier (option 54), or - if there is no server identifier, then the next-server address (siaddr), - if there is no server identifier or next-server address, then the DHCP packet's source IP address. Use the pseudo-identifier in place of the server identifier when handling ProxyDHCP and PXE boot server responses. Originally-fixed-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-18 15:43:06 +01:00
Wissam Shoukair	eb8df9a046	[ipoib] Fix a race when chain-loading undionly.kpxe in IPoIB The Infiniband link status change callback ipoib_link_state_changed() may be called while the IPoIB device is closed, in which case there will not be an IPoIB queue pair to be joined to the IPv4 broadcast group. This leads to NULL pointer dereferences in ib_mcast_attach() and ib_mcast_detach(). Fix by not attempting to join (or leave) the broadcast group unless we actually have an IPoIB queue pair. Signed-off-by: Wissam Shoukair <wissams@mellanox.com> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 14:42:36 +01:00
Michael Brown	fd18417cf1	[peerdist] Add support for PeerDist (aka BranchCache) HTTP content encoding Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 13:24:40 +01:00
Michael Brown	d2b2a0adae	[peerdist] Add block download multiplexer Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 13:24:39 +01:00
Michael Brown	4d032d5db8	[peerdist] Add individual block download mechanism Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 13:24:39 +01:00
Michael Brown	dc9d24e7d2	[peerdist] Add segment discovery mechanism Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 13:24:39 +01:00
Michael Brown	518a98eb56	[http] Rewrite HTTP core to support content encodings Rewrite the HTTP core to allow for the addition of arbitrary content encoding mechanisms, such as PeerDist and gzip. The core now exposes http_open() which can be used to create requests with an explicitly selected HTTP method, an optional requested content range, and an optional request body. A simple wrapper provides the preexisting behaviour of creating either a GET request or an application/x-www-form-urlencoded POST request (if the URI includes parameters). The HTTP SAN interface is now implemented using the generic block device translator. Individual blocks are requested using http_open() to create a range request. Server connections are now managed via a connection pool; this allows for multiple requests to the same server (e.g. for SAN blocks) to be completely unaware of each other. Repeated HTTPS connections to the same server can reuse a pooled connection, avoiding the per-connection overhead of establishing a TLS session (which can take several seconds if using a client certificate). Support for HTTP SAN booting and for the Basic and Digest authentication schemes is now optional and can be controlled via the SANBOOT_PROTO_HTTP, HTTP_AUTH_BASIC, and HTTP_AUTH_DIGEST build configuration options in config/general.h. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-17 13:24:33 +01:00
Michael Brown	b1caa48e4b	[crypto] Support SHA-{224,384,512} in X.509 certificates Add support for SHA-224, SHA-384, and SHA-512 as digest algorithms in X.509 certificates, and allow the choice of public-key, cipher, and digest algorithms to be configured at build time via config/crypto.h. Originally-implemented-by: Tufan Karadere <tufank@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-02 16:54:24 +01:00
Michael Brown	fc7885ed9e	[tls] Report supported signature algorithms in ClientHello Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-02 14:17:24 +01:00
Michael Brown	1ac7434111	[tls] Do not access beyond the end of a 24-bit integer The current implementation handles big-endian 24-bit integers (which occur in several TLS record types) by treating them as big-endian 32-bit integers which are shifted by 8 bits. This can result in "Invalid read" errors when running under valgrind, if the 24-bit field happens to be exactly at the end of an I/O buffer. Fix by ensuring that we touch only the three bytes which comprise the 24-bit integer. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-08-01 00:06:58 +01:00
Michael Brown	51b99d8bc8	[peerdist] Add support for constructing and decoding discovery messages Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 16:09:14 +01:00
Michael Brown	f0d594557c	[peerdist] Include trimmed range within content information block Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 15:22:26 +01:00
Michael Brown	b20d4a1522	[netdevice] Allow network devices to disclaim IRQ support at runtime VLAN and 802.11 devices use a network device operations structure that wraps an underlying structure. For example, the vlan_operations structure wraps the network device operations structure of the underlying trunk device. This can cause false positives from the current implementation of netdev_irq_supported(), which will always report that VLAN devices support interrupts since it has no visibility into the support provided by the underlying trunk device. Fix by allowing network devices to explicitly flag that interrupts are not supported, despite the presence of an irq() method. Originally-fixed-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 15:14:40 +01:00
Michael Brown	76338543f9	[iscsi] Add missing "break" statements iscsi_tx_done() is missing "break" statements at the end of each case. (Fortunately, this happens not to cause a bug in practice, since iscsi_login_request_done() is effectively a no-op when completing a data-out PDU.) Reported-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 14:15:14 +01:00
Michael Brown	2bcf13f13a	[ipv4] Allow IPv4 socket addresses to include a scope ID Extend the IPv6 concept of "scope ID" (indicating the network device index) to IPv4 socket addresses, so that IPv4 multicast transmissions may specify the transmitting network device. The scope ID is not (currently) exposed via the string representation of the socket address, since IPv4 does not use the IPv6 concept of link-local addresses (which could legitimately be specified in a URI). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 13:48:29 +01:00
Michael Brown	6efcabd415	[ipv4] Redefine IP address constants to avoid unnecessary byte swapping Redefine various IPv4 address constants and testing macros to avoid unnecessary byte swapping at runtime, and slightly rename the macros to prevent code from accidentally using the old definitions. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 13:48:29 +01:00
Michael Brown	9c185e2eac	[netdevice] Avoid using zero as a network device index Avoid using zero as a network device index, so that a zero sin6_scope_id can be used to mean "unspecified" (rather than unintentionally meaning "net0"). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 13:48:29 +01:00
Michael Brown	41670ca2fe	[ipv6] Treat a missing network device name as "netX" When an IPv6 socket address string specifies a link-local or multicast address but does not specify the requisite network device name (e.g. "fe80::69ff:fe50:5845" rather than "fe80::69ff:fe50:5845%net0"), assume the use of "netX". Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-28 13:48:23 +01:00
Michael Brown	1a30c20daf	[802.11] Use correct SHA1_DIGEST_SIZE constant name The constant SHA1_SIZE is defined only as part of the imported AXTLS code. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-27 15:59:10 +01:00
Michael Brown	cbbd6b761e	[xferbuf] Generalise to handle umalloc()-based buffers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-22 21:17:47 +01:00
Michael Brown	d0325b1da6	[fault] Generalise NETDEV_DISCARD_RATE fault injection mechanism Provide a generic inject_fault() function that can be used to inject random faults with configurable probabilities. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-22 21:17:47 +01:00
Michael Brown	9546b0c17b	[tcp] Ensure FIN is actually sent if connection is closed while idle Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-22 21:16:40 +01:00
Michael Brown	38afcc51ea	[tcp] Gracefully close connections during shutdown We currently do not wait for a received FIN before exiting to boot a loaded OS. In the common case of booting from an HTTP server, this means that the TCP connection is left consuming resources on the server side: the server will retransmit the FIN several times before giving up. Fix by initiating a graceful close of all TCP connections and waiting (for up to one second) for all connections to finish closing gracefully (i.e. for the outgoing FIN to have been sent and ACKed, and for the incoming FIN to have been received and ACKed at least once). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-07-04 12:51:23 +01:00
Michael Brown	8829634bd7	[ipoib] Attempt to generate ARPs as needed to repopulate REMAC cache The only way to map an eIPoIB MAC address (REMAC) to an IPoIB MAC address is to intercept an incoming ARP request or reply. If we do not have an REMAC cache entry for a particular destination MAC address, then we cannot transmit the packet. This can arise in at least two situations: - An external program (e.g. a PXE NBP using the UNDI API) may attempt to transmit to a destination MAC address that has been obtained by some method other than ARP. - Memory pressure may have caused REMAC cache entries to be discarded. This is fairly likely on a busy network, since REMAC cache entries are created for all received (broadcast) ARP requests. (We can't sensibly avoid creating these cache entries, since they are required in order to send an ARP reply, and when we are being used via the UNDI API we may have no knowledge of which IP addresses are "ours".) Attempt to ameliorate the situation by generating a semi-spurious ARP request whenever we find a missing REMAC cache entry. This will hopefully trigger an ARP reply, which would then provide us with the information required to populate the REMAC cache. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-29 14:50:16 +01:00
Michael Brown	d73982f098	[dhcp] Defer discovery if link is blocked If the link is blocked (e.g. due to a Spanning Tree Protocol port not yet forwarding packets) then defer DHCP discovery until the link becomes unblocked. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 17:32:24 +01:00
Michael Brown	94dbfb4374	[stp] Fix interpretaton of hello time Times in STP packets are expressed in units of 1/256 of a second. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 17:32:24 +01:00
Michael Brown	fb28c4a979	[stp] Add support for detecting Spanning Tree Protocol non-forwarding ports A fairly common end-user problem is that the default configuration of a switch may leave the port in a non-forwarding state for a substantial length of time (tens of seconds) after link up. This can cause iPXE to time out and give up attempting to boot. We cannot force the switch to start forwarding packets sooner, since any attempt to send a Spanning Tree Protocol bridge PDU may cause the switch to disable our port (if the switch happens to have the Bridge PDU Guard feature enabled for the port). For non-ancient versions of the Spanning Tree Protocol, we can detect whether or not the port is currently forwarding and use this to inform the network device core that the link is currently blocked. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 16:58:38 +01:00
Michael Brown	f3812395a2	[netdevice] Add a generic concept of a "blocked link" When Spanning Tree Protocol (STP) is used, there may be a substantial delay (tens of seconds) from the time that the link goes up to the time that the port starts forwarding packets. Add a generic concept of a "blocked link" (i.e. a link which is up but which is not expected to communicate successfully), and allow "ifstat" to indicate when a link is blocked. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 16:46:47 +01:00
Michael Brown	7e7870984b	[ethernet] Add minimal support for receiving LLC frames In some Ethernet framing variants the two-byte protocol field is used as a length, with the Ethernet header being followed by an IEEE 802.2 LLC header. The first two bytes of the LLC header are the DSAP and SSAP. If the received Ethernet packet appears to use this framing, then interpret the two-byte DSAP and SSAP as being the network-layer protocol. This allows support for receiving Spanning Tree Protocol frames (which use an LLC header with {DSAP,SSAP}=0x4242) to be added without requiring a full LLC protocol layer. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 15:28:42 +01:00
Michael Brown	c117b25e0b	[tcp] Do not shrink window when discarding received packets We currently shrink the TCP window permanently if we are ever forced (by a low-memory condition) to discard a previously received TCP packet. This behaviour was intended to reduce the number of retransmissions in a lossy network, since lost packets might potentially result in the entire window contents being retransmitted. Since commit `e0fc8fe` ("[tcp] Implement support for TCP Selective Acknowledgements (SACK)") the cost of lost packets has been reduced by around one order of magnitude, and the reduction in the window size (which affects the maximum throughput) is now the more significant cost. Remove the code which reduces the TCP maximum window size when a received packet is discarded. Reported-by: Wissam Shoukair <wissams@mellanox.com> Tested-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-06-25 10:20:48 +01:00
Michael Brown	15759e539e	[neighbour] Return success when deferring a packet Deferral of a packet for neighbour discovery is not really an error. If we fail to discover a neighbour then the failure will eventually be reported by the call to neighbour_destroy() when any outstanding I/O buffers are discarded. The current behaviour breaks PXE booting on FreeBSD, which seems to treat the error return from PXENV_UDP_WRITE as a fatal error and so never proceeds to poll PXENV_UDP_READ (and hence never allows iPXE to receive the ARP reply and send the deferred UDP packet). Change neighbour_tx() to return success when deferring a packet. This fixes interoperability with FreeBSD and removes transient neighbour cache misses from the "ifstat" error output, while leaving genuine neighbour discovery failures visible via "ifstat" (once neighbour discovery times out, or the interface is closed). Debugged-by: Wissam Shoukair <wissams@mellanox.com> Tested-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-05-20 15:29:36 +01:00
Michael Brown	86aa959561	[ipv6] Disambiguate received ICMPv6 errors Originally-implemented-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-05-11 12:45:14 +01:00
Michael Brown	1205721cbd	[base64] Add buffer size parameter to base64_encode() and base64_decode() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-04-24 15:32:04 +01:00
Michael Brown	9aa8090d06	[base16] Add buffer size parameter to base16_encode() and base16_decode() The current API for Base16 (and Base64) encoding requires the caller to always provide sufficient buffer space. This prevents the use of the generic encoding/decoding functionality in some situations, such as in formatting the hex setting types. Implement a generic hex_encode() (based on the existing format_hex_setting()), implement base16_encode() and base16_decode() in terms of the more generic hex_encode() and hex_decode(), and update all callers to provide the additional buffer length parameter. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-04-24 14:41:32 +01:00
Christian Hesse	bf40b79734	[build] Add missing "const" qualifiers This fixes "initialization discards 'const' qualifier from pointer target type" warnings with GCC 5.1.0. Signed-off-by: Christian Hesse <mail@eworm.de> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-04-24 13:03:28 +01:00
Michael Brown	d9166bbcae	[peerdist] Add support for decoding PeerDist Content Information Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-04-13 12:26:05 +01:00
Michael Brown	c492a9fd92	[netdevice] Add missing bus types to netdev_fetch_bustype() Reported-by: Robin Smidsrød <robin@smidsrod.no> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-18 16:42:39 +00:00
Michael Brown	57bab4e1d3	[tcpip] Fix dubious calculation of min_port Detected using sparse. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-13 10:19:44 +00:00
Michael Brown	e0fc8fe781	[tcp] Implement support for TCP Selective Acknowledgements (SACK) The TCP Selective Acknowledgement option (specified in RFC2018) provides a mechanism for the receiver to indicate packets that have been received out of order (e.g. due to earlier dropped packets). iPXE often operates in environments in which there is a high probability of packet loss. For example, the legacy USB keyboard emulation in some BIOSes involves polling the USB bus from within a system management interrupt: this introduces an invisible delay of around 500us which is long enough for around 40 full-length packets to be dropped. Similarly, almost all 1Gbps USB2 devices will eventually end up dropping packets because the USB2 bus does not provide enough bandwidth to sustain a 1Gbps stream, and most devices will not provide enough internal buffering to hold a full TCP window's worth of received packets. Add support for sending TCP Selective Acknowledgements. This provides the sender with more detailed information about which packets have been lost, and so allows for a more efficient retransmission strategy. We include a SACK-permitted option in our SYN packet, since experimentation shows that at least Linux peers will not include a SACK-permitted option in the SYN-ACK packet if one was not present in the initial SYN. (RFC2018 does not seem to mandate this behaviour, but it is consistent with the approach taken in RFC1323.) We ignore any received SACK options; this is safe to do since SACK is only ever advisory and we never have to send non-trivial amounts of data. Since our TCP receive queue is a candidate for cache discarding under low memory conditions, we may end up discarding data that has been reported as received via a SACK option. This is permitted by RFC2018. We follow the stricture that SACK blocks must not report data which is no longer held by the receiver: previously-reported blocks are validated against the current receive queue before being included within the current SACK block list. Experiments in a qemu VM using forced packet drops (by setting NETDEV_DISCARD_RATE to 32) show that implementing SACK improves throughput by around 400%. Experiments with a USB2 NIC (an SMSC7500) show that implementing SACK improves throughput by around 700%, increasing the download rate from 35Mbps up to 250Mbps (which is approximately the usable bandwidth limit for USB2). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-11 23:14:39 +00:00
Michael Brown	042a982c4d	[http] Support MD5-sess Digest authentication Microsoft IIS supports only MD5-sess for Digest authentication. Requested-by: Andreas Hammarskjöld <junior@2PintSoftware.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-09 13:45:09 +00:00
Michael Brown	42ea20afee	[http] Abstract out HTTP Digest hash algorithm operations Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-09 13:21:27 +00:00
Michael Brown	1a4e94a828	[legal] Relicense files under GPL2_OR_LATER_OR_UBDL Relicense files with kind permission from Stefan Hajnoczi <stefanha@redhat.com> alongside the contributors who have already granted such relicensing permission. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-05 11:40:13 +00:00
Michael Brown	93b4586447	[retry] Colourise debug output Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-05 11:25:54 +00:00
Michael Brown	47ad8fc1ba	[retry] Rewrite unrelicensable portions of retry.c Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-05 11:06:03 +00:00
Michael Brown	fbc4ba4b4e	[build] Fix the REQUIRE_SYMBOL mechanism At some point in the past few years, binutils became more aggressive at removing unused symbols. To function as a symbol requirement, a relocation record must now be in a section marked with @progbits and must not be in a section which gets discarded during the link (either via --gc-sections or via /DISCARD/). Update REQUIRE_SYMBOL() to generate relocation records meeting these criteria. To minimise the impact upon the final binary size, we use existing symbols (specified via the REQUIRING_SYMBOL() macro) as the relocation targets where possible. We use R_386_NONE or R_X86_64_NONE relocation types to prevent any actual unwanted relocation taking place. Where no suitable symbol exists for REQUIRING_SYMBOL() (such as in config.c), the macro PROVIDE_REQUIRING_SYMBOL() can be used to generate a one-byte-long symbol to act as the relocation target. If there are versions of binutils for which this approach fails, then the fallback will probably involve killing off REQUEST_SYMBOL(), redefining REQUIRE_SYMBOL() to use the current definition of REQUEST_SYMBOL(), and postprocessing the linked ELF file with something along the lines of "nm -u \| wc -l" to check that there are no undefined symbols remaining. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-05 00:59:38 +00:00
Michael Brown	86ae6e6c18	[build] Use REQUIRE_OBJECT() to drag in per-object configuration Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-05 00:57:44 +00:00
Michael Brown	c1ac466838	[iscsi] Rewrite unrelicensable portions of iscsi.c Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-02 20:40:31 +00:00
Michael Brown	01d16d821f	[libc] Rewrite byte-swapping code Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-02 16:35:37 +00:00
Michael Brown	2f020a8df3	[legal] Relicense files under GPL2_OR_LATER_OR_UBDL These files cannot be automatically relicensed by util/relicense.pl since they either contain unusual but trivial contributions (such as the addition of __nonnull function attributes), or contain lines dating back to the initial git revision (and so require manual knowledge of the code's origin). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-02 16:35:29 +00:00
Michael Brown	626ccf76ea	[legal] Relicense files under GPL2_OR_LATER_OR_UBDL Relicence files with kind permission from the following contributors: Alex Williamson <alex.williamson@redhat.com> Eduardo Habkost <ehabkost@redhat.com> Greg Jednaszewski <jednaszewski@gmail.com> H. Peter Anvin <hpa@zytor.com> Marin Hannache <git@mareo.fr> Robin Smidsrød <robin@smidsrod.no> Shao Miller <sha0.miller@gmail.com> Thomas Horsten <thomas@horsten.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-02 14:50:42 +00:00
Michael Brown	b6ee89ffb5	[legal] Relicense files under GPL2_OR_LATER_OR_UBDL Relicense files for which I am the sole author (as identified by util/relicense.pl). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-03-02 14:17:31 +00:00
Alex Williamson	47aebc24d3	[dhcp] Extract timing parameters out to config/dhcp.h iPXE uses DHCP timeouts loosely based on values recommended by the specification, but often abbreviated to reduce timeouts for reliable and/or simple network topologies. Extract the DHCP timing parameters to config/dhcp.h and document them. The resulting default iPXE behavior is exactly the same, but downstreams are now afforded the opportunity to implement spec-compliant behavior via config file overrides. Signed-off-by: Alex Williamson <alex.williamson@redhat.com> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-02-25 16:58:43 +00:00
Michael Brown	bb1abb2b21	[ipv4] Rewrite inet_aton() The implementation of inet_aton() has an unknown provenance. Rewrite this code to avoid potential licensing uncertainty. Also move the code from core/misc.c to its logical home in net/ipv4.c, and add a few extra test cases. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-02-19 14:02:07 +00:00
Michael Brown	095c007aa3	[legal] Add missing copyright header to net/ipv4.c Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-02-18 14:16:59 +00:00
Michael Brown	f3725a86e0	[rndis] Add rndis_rx_err() Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-02-11 17:26:51 +00:00
Michael Brown	2dfdcae938	[tftp] Explicitly abort connection whenever parent interface is closed Fetching the TFTP file size is currently implemented via a custom "tftpsize://" protocol hack. Generalise this approach to instead close the TFTP connection whenever the parent data-transfer interface is closed. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2015-02-06 12:08:54 +00:00
Michael Brown	f6a3bc0aa1	[rndis] Ignore start-of-day RNDIS_INDICATE_STATUS_MSG with status 0x40020006 Windows Server 2012 R2 generates an RNDIS_INDICATE_STATUS_MSG with a status code of 0x4002006. This status code does not appear to be documented anywhere within the sphere of human knowledge. Explicitly ignore this status code in order to avoid unnecessarily cluttering the display when RNDIS debugging is enabled. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-20 21:33:59 +00:00
Michael Brown	639632b059	[hyperv] Assume that VMBus xfer page ranges correspond to RNDIS messages The (undocumented) VMBus protocol seems to allow for transfer page-based packets where the data payload is split into an arbitrary set of ranges within the transfer page set. The RNDIS protocol includes a length field within the header of each message, and it is known from observation that multiple RNDIS messages can be concatenated into a single VMBus message. iPXE currently assumes that the transfer page range boundaries are entirely arbitrary, and uses the RNDIS header length to determine the RNDIS message boundaries. Windows Server 2012 R2 generates an RNDIS_INDICATE_STATUS_MSG for an undocumented and unknown status code (0x40020006) with a malformed RNDIS header length: the length does not cover the StatusBuffer portion of the message. This causes iPXE to report a malformed RNDIS message and to discard any further RNDIS messages within the same VMBus message. The Linux Hyper-V driver assumes that the transfer page range boundaries correspond to RNDIS message boundaries, and so does not notice the malformed length field in the RNDIS header. Match the behaviour of the Linux Hyper-V driver: assume that the transfer page range boundaries correspond to the RNDIS message boundaries and ignore the RNDIS header length. This avoids triggering the "malformed packet" error and also avoids unnecessary data copying: since we now have one I/O buffer per RNDIS message, there is no longer any need to use iob_split(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-20 21:33:53 +00:00
Michael Brown	67291465ea	[rndis] Clear receive filter when closing the device On Windows Server 2012 R2, closing and reopening the device will sometimes result in a non-functional RX datapath. The root cause is unknown. Clearing the receive filter before closing the device seems to fix the problem. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-20 12:06:35 +00:00
Michael Brown	4de0e273a7	[rndis] Send RNDIS_HALT_MSG The RNDIS specification requires that we send RNDIS_HALT_MSG. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-19 18:09:04 +00:00
Michael Brown	1d0ade42db	[rndis] Send RNDIS_INITIALISE_MSG The Hyper-V RNDIS implementation on Windows Server 2012 R2 requires that we send an explicit RNDIS initialisation message in order to get a working RX datapath. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-19 17:05:56 +00:00
Michael Brown	1d2b7c91f7	[rndis] Add generic RNDIS device abstraction RNDIS provides an abstraction of a network device on top of a generic packet transmission mechanism. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-18 14:46:38 +00:00
Michael Brown	14722c27d6	[netdevice] Fix erroneous use of free(iobuf) instead of free_iob(iobuf) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-12-12 10:18:03 +00:00
Michael Brown	6a22170085	[dhcp] Remove obsolete dhcp_chaddr() function As of commit `03f0c23` ("[ipoib] Expose Ethernet-compatible eIPoIB link-layer addresses and headers"), all link layers have used addresses which fit within the DHCP chaddr field. The dhcp_chaddr() function was therefore made obsolete by this commit, but was accidentally left present (though unused) in the source code. Remove the dhcp_chaddr() function and the only remaining use of it, unnecessarily introduced in commit `08bcc0f` ("[dhcp] Check for matching chaddr in received DHCP packets"). Reported-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-09-22 16:48:50 +01:00
Michael Brown	08bcc0fe01	[dhcp] Check for matching chaddr in received DHCP packets On large networks a DHCP XID collision is possible. Fix by explicitly checking the chaddr in received DHCP packets. Originally-fixed-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-09-22 15:29:13 +01:00
Michael Brown	98d09a1e03	[netdevice] Avoid registering duplicate network devices Reject network devices which appear to be duplicates of those already available via a different underlying hardware device. On a Xen PV-HVM system, this allows us to filter out the emulated PCI NICs (which would otherwise appear alongside the netfront NICs). Note that we cannot use the Xen facility to "unplug" the emulated PCI NICs, since there is no guarantee that the OS we subsequently load will have a native netfront driver. We permit devices with the same MAC address if they are attached to the same underlying hardware device (e.g. VLAN devices). Inspired-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-07-30 18:22:09 +01:00
Sven Ulland	de65a240b9	[lacp] Set "aggregatable" flag in response LACPDU Some switches do not allow an individual link (as defined in IEEE Std 802.3ad-2000 section 43.3.5) to work alone in a link aggregation group as described in section 43.3.6. This is verified on Dell's PowerConnect M6220, based on the Broadcom Strata XGS-IV chipset. Set the LACP_STATE_AGGREGATABLE flag in the actor.state field to announce link aggregation in the response LACPDU, which will have the switch enable the link aggregation group and allow frames to pass. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-07-23 11:56:04 +01:00
Michael Brown	c4af977271	[netdevice] Reset network device index when last device is unregistered When functioning as an EFI driver, drivers can be disconnected and reconnected multiple times (e.g. via the EFI shell "connect" command, or by running an executable such as ipxe.efi which will temporarily disconnect existing drivers). Minimise surprise by resetting the network device index to zero whenever the last device is unregistered. This is not foolproof, but it does handle the common case of having all devices unregistered and then reregistered in the original order. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-07-14 12:17:19 +01:00
Michael Brown	8290a95513	[build] Expose build timestamp, build name, and product names Expose the build timestamp (measured in seconds since the Epoch) and the build name (e.g. "rtl8139.rom" or "ipxe.efi"), and provide the product name and product short name in a single centralised location. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-06-24 15:32:35 +01:00
Michael Brown	e047811c85	[scsi] Improve sense code parsing Parse the sense data to extract the reponse code, the sense key, the additional sense code, and the additional sense code qualifier. Originally-implemented-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-06-03 02:04:46 +01:00
Hannes Reinecke	d630052e6f	[ethernet] Provide eth_random_addr() to generate random Ethernet addresses Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-06-01 23:32:24 +01:00
Michael Brown	7627f6c071	[ipv6] Avoid potentially copying from a NULL pointer in ipv6_tx() If ipv6_tx() is called with a non-NULL network device, a NULL or unspecified source address, and a destination address which does not match any routing table entry, then it will attempt to copy the source address from a NULL pointer. I don't think that there is currently any code path which could trigger this behaviour, but we should probably ensure that it can never happen. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-23 14:11:17 +01:00
Michael Brown	3a1adea036	[ipv6] Include network device when transcribing multicast addresses Destination multicast addresses require a sin6_scope_id, which should therefore be transcribed to a network device name by ipv6_sock_ntoa(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-23 14:11:17 +01:00
Michael Brown	6c7146695d	[ipv6] Do not set sin6_scope_id on source address The transmitting network device is specified via the destination address, not the source address. There is no reason to set sin6_scope_id on the source address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-23 14:11:17 +01:00
Michael Brown	6206f8f0f9	[dhcpv6] Do not set sin6_scope_id on the unspecified client socket address Setting sin6_scope_id to a non-zero value will cause the check against the "empty socket address" in udp_demux() to fail, and incoming DHCPv6 responses on interfaces other than net0 will be rejected with a spurious "No UDP connection listening on port 546" error. The transmitting network device is specified via the destination address, not the source address. Fix by simply not setting sin6_scope_id on the client socket address. Reported-by: Anton D. Kachalov <mouse@yandex-team.ru> Tested-by: Anton D. Kachalov <mouse@yandex-team.ru> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-23 14:11:11 +01:00
Marin Hannache	f4e069bf2e	[nfs] Rewrite NFS URI handling Get the NFS URI manipulation code out of nfs_open.c. The resulting code is now much more readable. Signed-off-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-18 21:53:39 +01:00
Michael Brown	e5878ce65d	[syslog] Strip invalid characters from hostname Avoid generating syntactically invalid log messages by ensuring that invalid characters are not present in the hostname. In particular, ensure that any whitespace is stripped, since whitespace functions as a field separator for syslog messages. Reported-by: Alex Davies <adavies@jumptrading.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-16 13:45:52 +01:00
Marin Hannache	ca93505a78	[nfs] Fix an invalid free() when loading a regular (non-symlink) file An invalid free() was ironically introduced by fixing another invalid free in commit `7aa69c4` ("[nfs] Fix an invalid free() when loading a symlink"). Signed-off-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-16 11:01:39 +01:00
Michael Brown	d28bb51f44	[tcp] Defer sending ACKs until all received packets have been processed When running inside a virtual machine (or when using the UNDI driver), transmitting packets can be expensive. When we receive several packets in one poll (e.g. because a slow BIOS timer interrupt routine has caused us to fall behind in processing), we can safely send just a single ACK to cover all of the received packets. This reduces the time spent transmitting and allows us to clear the backlog much faster. Various RFCs (starting with RFC1122) state that there should be an ACK for at least every second segment. We choose not to enforce this rule. Under normal operation each poll should find at most one received packet, and we will then not delay any ACKs. We delay (i.e. omit) ACKs only when under sufficiently heavy load that we are finding multiple packets per poll; under these conditions it is important to clear the backlog quickly since any delay may lead to dropped packets. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-12 17:19:26 +01:00
Marin Hannache	7aa69c4d0d	[nfs] Fix an invalid free() when loading a symlink Signed-off-by: Marin Hannache <git@mareo.fr> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-05-12 17:09:37 +01:00
Michael Brown	e825a96a25	[http] Profile receive datapath Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-04-28 12:31:23 +01:00
Michael Brown	767f2acb98	[tcp] Profile transmit and receive datapaths Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-04-28 12:30:57 +01:00
Michael Brown	f65c81b1d0	[ipv4] Profile transmit and receive datapaths Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-04-28 12:30:30 +01:00
Michael Brown	2c820d684a	[netdevice] Profile common operations Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-04-27 23:14:47 +01:00
Michael Brown	bc8ca6b8ce	[crypto] Generalise X.509 cache to a full certificate store Expand the concept of the X.509 cache to provide the functionality of a certificate store. Certificates in the store will be automatically used to complete certificate chains where applicable. The certificate store may be prepopulated at build time using the CERT=... build command line option. For example: make bin/ipxe.usb CERT=mycert1.crt,mycert2.crt Certificates within the certificate store are not implicitly trusted; the trust list is specified using TRUST=... as before. For example: make bin/ipxe.usb CERT=root.crt TRUST=root.crt This can be used to embed the full trusted root certificate within the iPXE binary, which is potentially useful in an HTTPS-only environment in which there is no HTTP server from which to automatically download cross-signed certificates or other certificate chain fragments. This usage of CERT= extends the existing use of CERT= to specify the client certificate. The client certificate is now identified automatically by checking for a match against the private key. For example: make bin/ipxe.usb CERT=root.crt,client.crt TRUST=root.crt KEY=client.key Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-28 17:09:40 +00:00
Michael Brown	e1ebc50f81	[crypto] Remove dynamically-allocated storage for certificate OCSP URI Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-25 16:30:43 +00:00
Michael Brown	01fa7efa38	[crypto] Remove dynamically-allocated storage for certificate name iPXE currently allocates a copy the certificate's common name as a string. This string is used by the TLS and CMS code to check certificate names against an expected name, and also appears in debugging messages. Provide a function x509_check_name() to centralise certificate name checking (in preparation for adding subjectAlternativeName support), and a function x509_name() to provide a name to be used in debugging messages, and remove the dynamically allocated string. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-25 16:30:43 +00:00
Michael Brown	e845b7da9b	[http] Accept Content-Length header with trailing whitespace At least one HTTP server (Google's OCSP responder) has been observed to generate a Content-Length header with trailing whitespace. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-25 15:46:14 +00:00
Michael Brown	87465258ab	[netdevice] Notify upper-layer drivers when RX processing is (un)frozen Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-14 14:05:38 +00:00
Michael Brown	42bf3b9aa9	[http] Automatically retry request on a 503 Service Unavailable A web server may return a 503 Service Unavailable response along with a Retry-After header to direct the client to retry the request at a later time. The Retry-After header may be a number of seconds, or a full HTTP timestamp (e.g. "Fri, 7 Mar 2014 17:22:14 GMT"). We have no reasonable way of parsing a full HTTP timestamp; if the server chooses to use this format then we simply retry after a fixed 5-second delay. As per RFC 2616, in the absence of a Retry-After header we treat a status code of 503 Service Unavailable as being equivalent to 500 Internal Server Error, and immediately fail the request. Requested-by: Suresh Sundriyal <ssundriy@vmware.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-07 17:32:26 +00:00
Michael Brown	0d657b8e94	[http] Use a retry timer to trigger retried requests Use a retry timer to allow for the possibility of deferring a retried request. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-07 17:32:22 +00:00
Michael Brown	859664ea2a	[tcp] Update window even if ACK does not acknowledge new data iPXE currently ignores ACKs which do not acknowledge any new data. (In particular, it does not stop the retransmission timer; this is done to prevent an immediate retransmission if a duplicate ACK is received while the transmit queue is non-empty.) If a peer provides a window size of zero and later sends a duplicate ACK to update the window size, this update will therefore be ignored and iPXE will never be able to transmit data. Fix by updating the window size even for ACKs which do not acknowledge new data. Reported-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-07 17:30:01 +00:00
Michael Brown	f17a30d547	[netdevice] Mark devices as open before calling open() method When opening a VLAN device, vlan_open() will call netdev_open() on the trunk device. This will result in a call to netdev_notify(), which will cause vlan_notify() to call vlan_sync() on the original VLAN device, which will see that the trunk device is now open but the VLAN device apparently isn't (since it has not yet been flagged as open by netdev_open()). The upshot is a second attempt to open the VLAN device, which will result in an erroneous second call to vlan_open(). This convoluted chain of events then terminates harmlessly since vlan_open() calls netdev_open() on the trunk device, which just returns immediately since the trunk device is by now flagged as being already open. Prevent this from happening by having netdev_open() flag the device as open prior to calling the device's open() method, and reflagging it as closed if the open() method fails. Originally-fixed-by: Wissam Shoukair <wissams@mellanox.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-05 15:25:08 +00:00
Michael Brown	e191298a1d	[tcp] Calculate correct MSS from peer address iPXE currently advertises a fixed MSS of 1460, which is correct only for IPv4 over Ethernet. For IPv6 over Ethernet, the value should be 1440 (allowing for the larger IPv6 header). For non-Ethernet link layers, the value should reflect the MTU of the underlying network device. Use tcpip_mtu() to calculate the transport-layer MTU associated with the peer address, and calculate the MSS to allow for an optionless TCP header as per RFC 6691. As a side benefit, we can now fail a connection immediately with a meaningful error message if we have no route to the destination address. Reported-by: Anton D. Kachalov <mouse@yandex-team.ru> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-04 13:23:29 +00:00
Michael Brown	6414b5ca03	[tcpip] Provide tcpip_mtu() to determine the maximum transmission unit Provide the function tcpip_mtu() to allow external code to determine the (transport-layer) maximum transmission unit for a given socket address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-04 13:13:54 +00:00
Michael Brown	db67de6f31	[tcpip] Provide tcpip_netdev() to determine the transmitting network device Provide the function tcpip_netdev() to allow external code to determine the transmitting network device for a given socket address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-04 13:02:58 +00:00
Michael Brown	11963c4f5f	[tcpip] Add IP statistics collection as per RFC 4293 Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-03-02 20:33:35 +00:00
Michael Brown	7667536527	[uri] Refactor URI parsing and formatting Add support for parsing of URIs containing literal IPv6 addresses (e.g. "http://[fe80::69ff:fe50:5845%25net0]/boot.ipxe"). Duplicate URIs by directly copying the relevant fields, rather than by formatting and reparsing a URI string. This relaxes the requirements on the URI formatting code and allows it to focus on generating human-readable URIs (e.g. by not escaping ':' characters within literal IPv6 addresses). As a side-effect, this allows relative URIs containing parameter lists (e.g. "../boot.php##params") to function as expected. Add validity check for FTP paths to ensure that only printable characters are accepted (since FTP is a human-readable line-based protocol with no support for character escaping). Construct TFTP next-server+filename URIs directly, rather than parsing a constructed "tftp://..." string, Add self-tests for URI functions. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-02-27 13:32:53 +00:00
Michael Brown	ced4f8d1d3	[dhcp] Copy exactly the required length when resizing DHCP options When resizing DHCP options, iPXE currently calculates the length to be copied by subtracting the destination pointer from the end of buffer pointer. This works and guarantees not to write beyond the end of the buffer, but may end up reading beyond the end of the buffer. Fix by calculating the required length exactly. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-02-26 16:44:05 +00:00
Michael Brown	ff341c1861	[dns] Update end-of-name pointer after processing CNAME record Commit `d4c0226` ("[dns] Support DNS search lists") introduced a regression when handling CNAME records resolving to names longer than the original name. The "end of name" offset stored in dns->offset was not updated to reflect the length of the new name, causing dns_question() to append the (empty) search suffix at an incorrect offset within the name buffer, resulting in a mangled DNS name. In the case of a CNAME record resolving to a name shorter than or equal in length to the original name, then the mangling would occur in an unused portion of the name buffer. In the common case of a name server returning the A (or AAAA) record along with the CNAME record, this would cause name resolution to succeed despite the mangling. (If the name server did not return the A or AAAA record along with the CNAME record, then the mangling would be revealed by the subsequent invalid query packet.) Reported-by: Nicolas Sylvain <nsylvain@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-02-26 16:04:34 +00:00
Michael Brown	d4c0226a6c	[dns] Support DNS search lists Update the DNS resolver to support DNS search lists (as provided by DHCP option 119, DHCPv6 option 24, or NDP option 31). Add validation code to ensure that parsing of DNS packets does not overrun the input, get stuck in infinite loops, or (worse) write beyond the end of allocated buffers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2014-02-05 14:56:49 +00:00
Michael Brown	99c679696a	[ipv6] Expose NDP-provided settings (including the DNS server) Signed-off-by: Michael Brown <mcb30@ipxe.org>	2013-12-05 16:44:50 +00:00
Michael Brown	4a6c453b5b	[dhcpv6] Add DHCPv6 "filename" setting Signed-off-by: Michael Brown <mcb30@ipxe.org>	2013-12-05 15:12:50 +00:00
Michael Brown	f3e5df3162	[settings] Merge SETTING_IPv4 and SETTING_IPv6 Allow for equivalent IPv4 and IPv6 settings (which requires equivalent settings to be adjacent within the settings list). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2013-12-05 15:11:15 +00:00
Michael Brown	b0942534eb	[settings] Force settings into alphabetical order within sections Signed-off-by: Michael Brown <mcb30@ipxe.org>	2013-12-05 12:43:28 +00:00

... 2 3 4 5 6 ...

1288 Commits (aa49ce5b1dce3dfbf97bf67ef95524e4710c99f5)