'How to extract a set of *.tar.gz.(letters) files?

I have downloaded a medical data set to use in machine learning and the files are like this:

dicom_v1.tar.gz.aa
dicom_v1.tar.gz.ab
dicom_v1.tar.gz.ac
dicom_v1.tar.gz.ad

I don't know how to extract these files. When I use WinRAR or 7-Zip, it doesn't work.

It is written in the attached ReadMe file:

To decompress this data set execute in a Unix command line:
cat dicom_archive.tar.* | tar -xzvf

How can I do this in Windows 10 as I am a beginner?



Solution 1:[1]

These files have been split into chunks for distribution, so you need to put them back together before you can see whether Winrar or 7-zip will be able to extract them.

Since you're using Windows, you probably want to the Powershell get-content command (which helpfully aliased to cat btw) - and since the files are chunks of a compressed archive you probably want to use the -raw argument too.

https://shellgeek.com/use-cat-equivalent-type-command-in-windows/ is a good page to read.

I would try something like this as a starting point:

PS C:\> get-content -raw dicom_v1.tar.gz.* > dicom_v1.tar.gz

According to https://pureinfotech.com/extract-tar-gz-files-windows-10/ you should be able to run tar with the z flag natively in Powershell to extract all the files from the dicom archive.

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 James McPherson