12/16/2023 0 Comments Compression definitionThere are major benefits to compressing data prior to backup: An organization that performs full backups will often have close to the same data from backup to backup. Administrators, though, can seamlessly integrate compression in their backup systems.īackup is a redundant type of workload, as the process captures the same files frequently. Data compression and backupĬompression is often used for data that's not accessed much, as the process can be intensive and slow down systems. Many storage systems support both compression and deduplication. Data compression tends to be more effective than deduplication in reducing the size of unique information, such as images, audio, videos, databases and executable files. Deduplication typically looks for larger chunks of duplicate data than compression, and systems can deduplicate using a fixed or variable-sized chunk.ĭeduplication is most effective in environments that have a high degree of redundant data, such as virtual desktop infrastructure or storage backup systems. The system saves unique instances of each block, uses a hash algorithm to process them and generates a unique identifier to store them in an index. Block-level deduplication identifies duplicate data at the subfile level. Data compression algorithms reduce the size of the bit strings in a data stream that is far smaller in scope and generally remembers no more than the last megabyte or less of data.įile-level deduplication eliminates redundant files and replaces them with stubs pointing to the original file. Deduplication is a type of compression that looks for redundant chunks of data across a storage or file system and then replaces each duplicate chunk with a pointer to the original. Formats such as GIF and PNG use lossless compression.Ĭompression is often compared to data deduplication, but the two techniques operate differently. JPEG is an image file format that supports lossy image compression. Graphic image file formats are typically designed to compress information since the files tend to be large. Graphics image compression can be lossy or lossless. Lossy compression is useful with graphics, audio, video and images, where the removal of some data bits has little or no discernible effect on the representation of the content. Lossy compression permanently eliminates bits of data that are redundant, unimportant or imperceptible. Lossless compression is the typical approach with executables, as well as text and spreadsheet files, where the loss of words or numbers would change the information. Lossless compression enables the restoration of a file to its original state, without the loss of a single bit of data, when the file is uncompressed. Data compression methods: lossless and lossy compressionĬompressing data can be a lossless or lossy process. For example, some files may already come compressed, so compressing those files would not have a significant impact. Virtually any type of file can be compressed, but it's important to follow best practices when choosing which ones to compress. Compression will be an important method of data reduction as data continues to grow exponentially. As a result of compression, administrators spend less money and less time on storage.Ĭompression optimizes backup storage performance and has recently shown up in primary storage data reduction. For example, in a 2:1 compression ratio, a 20 megabyte ( MB) file takes up 10 MB of space. Why is data compression important?ĭata compression can dramatically decrease the amount of storage a file takes up. When information is sent or received via the internet, larger files - either singly or with others as part of an archive file - may be transmitted in a ZIP, GZIP or other compressed format. Data compression can reduce a text file to 50% or a significantly higher percentage of its original size.įor data transmission, compression can be performed on the data content or on the entire transmission unit, including header data. Text compression can be as simple as removing all unneeded characters, inserting a single repeat character to indicate a string of repeated characters and substituting a smaller bit string for a frequently occurring bit string. The formula may also insert a reference or pointer to a string of 0s and 1s that the program has already seen. For instance, an algorithm may represent a string of bits - or 0s and 1s - with a smaller string of 0s and 1s by using a dictionary for the conversion between them. How compression worksĬompression is performed by a program that uses a formula or algorithm to determine how to shrink the size of the data. Compressing data can save storage capacity, speed up file transfer and decrease costs for storage hardware and network bandwidth. Data compression is a reduction in the number of bits needed to represent data.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |