Commit Graph

92 Commits (914bd993fcb347a24d65e64ba838cb34f72ab14d)

Author SHA1 Message Date
Alejandro Sirgo Rica 914bd993fc src: cleanup ogGetImageInfo
Rename ogGetImageInfo to get_image_info.
Move code from ogOperations.py to obtain image data into
get_image_info.
2024-05-30 17:22:23 +02:00
Alejandro Sirgo Rica 34007857f6 src: rename legacy.py into image.py
legacy.py contais mostly functions related to system images.
Rename the file to better represent the contents in it.
2024-05-30 17:22:23 +02:00
Alejandro Sirgo Rica 9a5e83ea1a src: stop using hardcoded paths to cache image directory
Use the constant OG_CACHE_IMAGE_PATH from cache.py to obtain the
location of the directory where images are stored.
This way the path can be changed from one single point.
2024-05-30 17:22:23 +02:00
Alejandro Sirgo Rica 60803fe0ed src: add cache info to the image/restore response
Add a 'cache' field into the json payload the client sends to
the server after a restore operation so the server can update
the new cache contents.

Resquest response structure:
{
    ...
    'cache': [
        {'name': 'windows.img', 'size': 2432370213, checksum: '5d4dcc677bc19f40a647d0002f4ade90'},
        {'name': 'linux.img', 'size': 243234534213, checksum: '3eb22f888f88a55ad954f55644e1192e'}
    ]
    ...
}
2024-05-30 17:22:23 +02:00
Alejandro Sirgo Rica 8de2b785a9 src: add POST cache/delete method
Add API REST method to delete cache contents.

Resquest payload structure:
{
    'images': ['windows.img', 'linux.img']
}

The client will try to delete as many images in cache as available
with names matching the list of filenames in the 'images' field.

Resquest response structure:
{
    'cache': [
        {'name': 'windows.img', 'size': 2432370213, checksum: '5d4dcc677bc19f40a647d0002f4ade90'},
        {'name': 'linux.img', 'size': 243234534213, checksum: '3eb22f888f88a55ad954f55644e1192e'}
    ]
}
2024-05-30 17:22:23 +02:00
Alejandro Sirgo Rica e2d48ba3a0 live: add cache contents to the /refresh payload
Add the list of images in the client's cache partition in the
payload sent to the server.
The information sent is a list of {image_name, img_size, checksum}
elements where img_size is the size of the respective image in bytes.

Resquest response structure:
{
    ...
    'cache': [
        {'name': 'windows.img', 'size': 2432370213, checksum: '5d4dcc677bc19f40a647d0002f4ade90'},
        {'name': 'linux.img', 'size': 243234534213, checksum: '3eb22f888f88a55ad954f55644e1192e'}
    ]
    ...
}
2024-05-30 17:22:18 +02:00
OpenGnSys Support Team 4626383cc4 live: remove unused return value in image_restore()
Never used what configureOs() returns, remove it.
2024-05-27 11:31:19 +02:00
OpenGnSys Support Team 9952c3cc85 live: incorrect reference to image checksum file in logs
checksum file name end by .img.full.sum, not .full.sum
2024-05-25 09:57:17 +02:00
OpenGnSys Support Team fa5b37b2a6 live: rename variable that stores json body in refresh()
Just a simple cleanup.
2024-05-21 11:51:29 +02:00
OpenGnSys Support Team d109e99dbe utils: rename cache_probe() to get_cache_dev_path()
This method reports the /dev path to cache partition, rename it.

Add explicit check if blkid is successful.

And add logging to report that device path to cache is not found.
2024-05-09 11:47:06 +02:00
OpenGnSys Support Team 9ffe1c81bf live: report LINUX-SWAP instead of SWAP
ogCP expects LINUX-SWAP to specify a swap filesystem.

Add a similar workaround to the one that is done for VFAT for symmetry between
inputs and outputs that circulate over the API.
2024-05-07 11:54:39 +02:00
OpenGnSys Support Team 1ca3639389 live: rewrite log in case tiptorrent client fails
Specify that image file cannot be found in cache because tiptorrent has failed,
otherwise it is confusing.
2024-05-06 19:10:48 +02:00
OpenGnSys Support Team 8ecd57552a live: restore partprobe before building filesystem
Otherwise mkfs silently fails because OS reports out-of-sync partition table.
2024-05-06 19:08:12 +02:00
OpenGnSys Support Team 2dd5105995 live: force flush to disk after partition table is written 2024-05-06 18:51:57 +02:00
OpenGnSys Support Team f42e2ba201 live: partprobe breaks with mounted partitions
partprobe requires that all disk partitions are unmounted.

partprobe needs to be called to report the OS that the partition table
has changed, otherwise ogclient reports incorrect partition information.

iterate over the partition list and mount cache after partprobe is
called.
2024-05-06 18:33:41 +02:00
OpenGnSys Support Team 170d7e1be9 live: umount all partitions before partition setup
If new partition layout is specified, unmount cache and any other partition
under /mnt.
2024-05-06 18:32:54 +02:00
Alejandro Sirgo Rica 8171ddd15f live: fix omited error report in tip_client_get
tip_client_get raises the proper error exceptions but the except
block in _restore_image_tiptorrent overwrites the reported error.
Move the raise statements in _restore_image_tiptorrent outside
of the except block.
2024-05-06 14:23:29 +02:00
OpenGnSys Support Team 84c2944bf3 live: revisit logging for partition setup, image create and restore 2024-04-23 10:55:37 +02:00
Alejandro Sirgo Rica e19437290d live: improve exception handling in image_create
Reduce the scope of the try except block that controls the case
of deleting the image backup in case of error. Now it only covers
the section of code after backup creation and up to image
verification. Check when the Exception is an OgError to raise
with added context.

Prevent the deletion of the target image in case of error before
the backup creation.

Bundle the backup creation on its own try except block to give
more feedback on a failed backup creation.

Enables a better error management allowing unhandled
exceptions to be reported properly.
2024-04-03 13:31:10 +02:00
Alejandro Sirgo Rica cbe7f8d49c src: use explicit exception types in except Exception blocks
Capture only the relevant exception types in each except block.
The capture of the Exception type means hiding information for
unhandled error cases, even for syntax errors in the codebase.
Using a more fine grained exception filtering improves error
traceability.
2024-04-03 13:31:10 +02:00
Alejandro Sirgo Rica dfde363aa6 src: log backtrace in unhandled error cases
Log an error message in known error cases and log a backtrace
otherwise.

Define a new error type OgError to be used in all the 'raise'
blocks to define the error message to log. The exception
propagates until it reaches send_internal_server_error() where
the exception type is checked. If the type is OgError we log
the exception message. Logs the backtrace for other types.

The initial error implementation printed a backtrace everytime
an error ocurred. The next iteration changed it to only print
a backtrace in a very particular case but ended up omiting too
much information such as syntax errors or unknown error context.
The actual implementation only logs the cases we already cover in
the codebase and logs a bracktrace in the others, enabling a
better debugging experience.
2024-04-03 13:31:10 +02:00
OpenGnSys Support Team 1aba9d0923 Revert "live: improve lzop and partclone error handling"
This reverts commit 57787dab54.

Read from stderr is blocking if no data is available, revert this patch since
ogClient hangs indefinitely in lzop invocations due to races in process
execution through Popen.
2024-04-01 13:44:17 +02:00
OpenGnSys Support Team a97fd4acad live: display info logging when restoring image starts
instead of using debug level, this is very useful to track the process.
2024-03-27 17:15:59 +01:00
Alejandro Sirgo Rica 25b00bfd69 live: use .ant image as main image after image creation error
Restore image file from .ant to original file name if new image
creation fails. Remove new imagen and move the .ant image file in
place of the original as previously an error meant a rename of the
image file without a revert to keep the image available.
2024-03-27 10:34:22 +01:00
Alejandro Sirgo Rica 55fadef718 utils: drop ogCopyEfiBootLoader script
Implement a Python equivalent of ogCopyEfiBootLoader as the
function copy_efi_bootloader. This function copies the contents of
the folder of the EFI loader in the ESP into a ogBoot folder at
the root of the partition target of an image creation.
copy_efi_bootloader is a Windows only functionality.
2024-03-26 13:32:58 +01:00
Alejandro Sirgo Rica 57787dab54 live: improve lzop and partclone error handling
Control non 0 returncode of the lzop and partclone subprocess
in image creation and restoration because this means that either
lzop or partclone has failed.
The implementation must cover cases such as not enough storage
space and log errors into /tmp/command.log and the log file of
the client handling the request.
Check the returncode of lzop and partclone subprocesses and
log the stderr of the process reporting non zero returncode.
2024-03-26 13:32:48 +01:00
Alejandro Sirgo Rica 16dcc9b25b live: improve logging in image_create
Log the whole context of the error when an exception happens.
The previous exception handling was hidding important information
about the cause of the error.
2024-03-26 13:23:04 +01:00
Alejandro Sirgo Rica 049b7a5a2b src: make exception messages more contextual and explicit
Provide more information in exception messages as those are the
source of the logging messages. Add information about paths, files
or configuration related to the operation associated to the
exception.
2024-03-21 10:29:57 +01:00
Alejandro Sirgo Rica 8741b2e272 src: change generic exception types to be more explicit
Replace exception types to be more explicit about the nature of
the error.
Improve the exception raising semantics by using the 'from' keyword,
this wraps an older exception into a new one so it is still considered
the same object.
2024-03-21 10:29:57 +01:00
Alejandro Sirgo Rica 2a4ce65a20 src: centralize error logging into send_internal_server_error
Use only the exception messages as the main resource for error
messages.
The previous error code had string duplication in the form of:
	logging.error('msg here')
	raise Exception('msg here')

That approach also has the downside of having log duplication as
it had the local logging.err() and a global logging.exception()
inside send_internal_server_error capturing the exception message.
The actual code only requires raising an exception with a proper
error message.
Improve exception messages to give more error context.
Log every AssertionError as a backtrace.
Use the 'raise Exception from e' syntax to modify the a previously
raised exception 'e' into an exception with aditional context or
different type. This also prevents the message that warns about
newer exceptions being launch after an initial exception.
2024-03-21 10:29:57 +01:00
Alejandro Sirgo Rica 8012562302 utils: implement BIOS boot for windows
Create ogboot.me and ogboot.secondboot as empty files and
ogboot.firstboot with the value "iniciado" in the root of
the BIOS Windows system partition.
The files must contain data for GRUB to be able to write content,
therefore these are created containing 3072 null bytes.
The Windows boot process is handled by the "pxe" profile.
There the files ogboot.me, ogboot.firstboot and ogboot.secondboot
are used as a state machine to chose between booting Windows and
ogLive.
The first Windows boot happens if ogboot.me and ogboot.firstboot
are identical, then "iniciado" is written in ogboot.firstboot.
We skip this stage as we create ogboot.firstboot with 'iniciado'.
The second Windows boot occurs if ogboot.me and ogboot.secondboot
are boot identical, then "iniciado" is written in ogboot.secondboot.
After the Windows boot ogLive is booted.
2024-03-21 10:29:21 +01:00
Alejandro Sirgo Rica 37600660f3 live: check if cache partition is available before calling tiptorrent
The image restore command must check if the cache partition is
available. Otherwise if the user forgets to create the cache
tiptorrent fails.
2024-03-21 10:29:06 +01:00
Alejandro Sirgo Rica 4d4171e459 utils: move all boot from OS functionality into boot.py
This change is a preparative for reimplementing the BIOS boot
in order to deprecate the legacy script. All the codepaths to
boot systems located at a partition are now called from the
boot_os_at function enabling an easier structure for the incoming
code.
2024-03-08 13:03:00 +01:00
Alejandro Sirgo Rica 7f18485eff utils: improve uefi detection mechanism
Checking the existence /sys/firmware/efi as it might appear
sometimes in BIOS installs if the BIOS configuration is not
proper. Checking for the EFI partition is the safest method
to veryfy the install type.
2024-03-08 12:43:10 +01:00
Jose M. Guisado 23b4b1feb6 live: drop IniciarSesion script when uefi booting
Replace IniciarSesion script in favor of native Python code when booting
a UEFI system. This applies when running the "session" command.

WIP: Only UEFI boots Windows systems. Raise NotImplementedError
exception trying to boot a Linux system using UEFI.
2024-03-04 11:33:10 +01:00
OpenGnSys Support Team e3bb01f5f1 live: improve logging with setup command
Improve logging when setting up partition, provide more hints on progress.

Fail in case partition layout is not supported.
2024-02-22 11:33:50 +01:00
OpenGnSys Support Team 92ef3d68aa live: call partprobe on the specific disk
otherwise partprobe does its best to find the disk, according to what I see
through strace.
2024-02-19 11:56:25 +01:00
OpenGnSys Support Team dbda6abd22 poweroff: always call poweroff_oglive and _reboot_oglive
Remove leftover fallback to directly call utilities to poweroff and reboot.
2024-02-19 10:07:27 +01:00
OpenGnSys Support Team 4109bb6ecc live: split logging to warn not to turn off client during image creation
just split this log message.
2024-02-15 16:58:51 +01:00
OpenGnSys Support Team 2da8b98fff fs: check if writing md5sum to full.sum file fails
writing to file might fail (permission denied, disk full), check for errors.
2024-02-15 16:58:51 +01:00
OpenGnSys Support Team 0fc7f8f33e src: ogChangeRepo returns zero on success and -1 on error
do not return the returncode, instead return an integer.

do not use

	except CalledProcessError as e:

it causes a another exception while handling exception.

Remount the original image repository.

it should be possible to simplify this further by:

- stacking mounts, no need to umount initial repo and mount it again
  when switching to the new repo, because remount back initial repo
  might fail (!)

- use check=False and simply check for x.returncode
2024-02-15 16:22:23 +01:00
OpenGnSys Support Team 44250d0334 live: remove mbuffer leftover in image restore command
Remove mbuffer, this is never used.

mbuffer has been never been used since ogClient supports native image restore.

Originally this was used like this:

	partclone [...] | mbuffer -q -M 40M | lzop [...]

supposely to speed up partclone in case the device where the read happens is
slowier than the device that is used for writes.

See mbuffer(1) manpage examples.

In any case, this needs benchmarking to really make sure this is helping.

Remove it until that ever happens.
2024-02-15 16:22:23 +01:00
OpenGnSys Support Team 6b1f20faf3 live: log message improvements for image creation and restore
Provide more context information for debugging issues with image creation and
restore.
2024-02-15 16:22:13 +01:00
Alejandro Sirgo Rica 478c4447be src: improve error check in image_create and image_restore
cover more error cases where exceptions need to be raised.
check return code in the invoked subprocess.

restoreImageCustom has been intentionally left behind, it
is unclear what this custom script returns on success and
error.
2024-02-14 12:28:28 +01:00
Alejandro Sirgo Rica c1529c5eec src: fix whitespace in ogOperations.py
make whitespace conherent with the rest of the file contents.
2024-02-14 11:09:54 +01:00
OpenGnSys Support Team 9beb55894d live: refine existing logging
- suggest to check permissions in samba folder
- fix typo, s/filesyste/filesystem/
2023-12-18 13:47:29 +01:00
OpenGnSys Support Team 32673cf337 live: adding logging to notify that image file already exists
Just informational, provide a notice that the file already exists.
2023-12-17 20:53:43 +01:00
OpenGnSys Support Team dff126cf40 live: ensure image file exists after partclone
check that there is a file and that is accessible
2023-12-17 11:27:28 +01:00
OpenGnSys Support Team 2bddf205d9 live: display filesystem and device path if image_create() fails
display filesystem and path to device.
2023-12-16 17:15:34 +01:00
OpenGnSys Support Team 5d19ff5fe9 live: validate rw access to image folder after remount
check that it is readable and writable
2023-12-16 17:14:02 +01:00