This essay Data Compression has a total of 1543 words and 8 pages.
‘Data Compression’ just sounds complicated. Don’t be
afraid, compression is our good friend for many reasons. It saves hard drive
space. It makes data files to handle. It also cuts those immense file download
times from the Internet. Wouldn’t it be nice if we could compress all files
down to just a few bytes?
There is a limit to how much you can compress
a file. How random the file is, is the determining factor to how far it can
be compressed. If the file is completely random and no pattern can be found,
then the shortest representation of the file is the file it self. The actual
proof that proves this is at the end of my paper. The key to compressing a
file is to find some sort of exploitable pattern. Most of this paper will
be explaining those patterns that are commonly used.
Null suppression is
the most primitive form of data compression that I could find. Basically,
it says that if you have different fields that data is in (possibly a spread
sheet), and any of them have only zeros in them, then the program just eliminates
the data and goes straight from the empty data set to the next.
step up from null suppression is Run Length Encoding. Run length encoding
simply tells you how many of what you have in a row. It would change a set
of binary data like 0011100001} into what the computer reads as (2)zeros,
(3)ones, (4)zeros, 1. As you can see, it works on the same basic idea of finding
a series of 0’s (null suppression) and 1’s in this case too and abbreviating
Once the whole idea of data compression caught on, more people started
working on programs for it. From these people we got some new premises to
work with. Substitutional encoding is a big one. It was invented jointly
by two people: Abraham Lempel and Jakob Ziv. Most compression algorithms (big
word meaning roughly ‘program’) using substitutional encoding start with ‘LZ’
LZ-77 is a really neat compression in which the program
starts off just copying the source file over to the new target file, but when
it recognizes a phrase of data that it has previously written, it replaces
the second set of data in the target file with directions on how to get to
the first occurrence of it and copy it in the directions’ place. This is more
commonly called a sliding-window compression because the focus of the program
is always sliding all around the file.
LZ-78 is the compression that most
people have in their homes. Some of the more common ones are ZIP, LHA, ARJ,
ZOO, and GZIP. The main idea behind LZ-78 is a ‘dictionary’. Yet it works
quite a bit like the LZ-77. For every phrase it comes across, it indexes the
string by a number and writes it in a ‘dictionary’. When the program comes
across the same string, it uses the associated number in the ‘dictionary’ instead
of the string. The ‘dictionary’ is then written along side the compressed
file to be used in decoding.
There is a combined version of LZ-77 an
LZ-78. It is called LZFG. It only writes to the dictionary when it finds
the repeated phrase, not on every phrase. Then instead of LZFG replacing the
second set of data with directions on how to get to the first occurrence of
it, the program puts in the number reference for the dictionary’s translation.
Not only is it faster, but it compresses better because of the fact that it
doesn’t have as big of a dictionary attached.
Statistical encoding is another
one of the new compression concepts. It is an offshoot of the LZ family of
compressors; It uses basically the same style as LZFG, but instead of assigning
the numbers in order that the strings come out of the source file, statistical
compressors do some research. It calculates the number of times each string
is used and then ranks the string with the most number of uses at the top of
the hash table. The string with the least is ranked at the bottom. (A hash
table is where the rank is figured) The higher up a string is on this list,
the smaller of a reference number it gets to minimize the total bit usage.
This gives this compression just a slight edge on the others, but every little
bit helps. (ha ha -bit- )
Beware! There are a few compression programs
out there that claim wonderful compression ratios; ratios that beat the compression
limit for that file’s randomness. These programs
Topics Related to Data Compression
Archive formats, Data compression, Compress, LZ77 and LZ78, Hash function, Zip, Run-length encoding, LHA, Tar, Gzip, Lossless compression, Rzip
Essays Related to Data Compression
Overview of Video On Demand SystemsOverview of Video On Demand Systems Joseph Newcomer SCOPE INTRODUCTION THE INITIATIVE FOR WORLDWIDE MULTIMEDIA TELECONFERENCING AND VIDEO SERVER STANDARDS NEW BUSINESS IMPERATIVES STARTING WITH STANDARDS TWO STANDARDS, ONE GOAL STANDARDS FIRST SUMMARY CONTENT PREPARATION: REQUIREMENTS: CODECs/Compression Object Oriented Database Management Systems Encoding Verification SUMMARY VIDEO SERVER REQUIREMENTS LIMITATIONS PRODUCTS DISTRIBUTION NETWORK: LAN TYPES PROTOCOLS WAN TYPES SCOPE Video on demand
Reality TelevisionReality Television Today’s viewers are so crazy love reality program, not only due to the prize gaining, but also because reality TV allows viewers to fantasize about gaining status through automatic fame. Ordinary people can watch the shows, see people like themselves and imagine that they too could become celebrities by being on television. It does not matter as much that the contestants often are shown in an unfavorable light; the fact that millions of viewers are paying attention means that
Designers/Artists of the PastDesigners/Artists of the Past Assignment Two QUESTION: THE DESIGNERS/ARTISTS OF THE PAST HAVE SO MUCH TO SAY TO US, AS PRACTISING DESINGERS, TODAY. DO YOU AGREE OR DISAGREE WITH THIS STATEMENT? Art and design do coveys various types of communication, for informative purposes and for entertainment value. In order for a creator to produce something creative they must draw from their inner impulses, and the environmental and historical influences around them in order to communicate a language. Thro
Sony’s History - Through The Eyes of the Company’s Sony’s History - Through The Eyes of the Company’s Leaders Sony was founded in 1946 by Masaru Ibuka and Akio Morita. The two complemented each other with a unique blend of product innovation and marketing savvy, and formed a company that would eventually grow into a more than $60 billion global organization. In 1950, in post-war Japan, Ibuka and Morita created Sony’s first hardware device, a tape player/recorder called the G-TYPE recorder. Materials were in such high demand that the first tapes
Disk Operating SystemDisk Operating System User Interface Internet Disk Tools Links Bottom of Page Disk and System Utilities... Disk Defragmenter ScanDisk Backup Configuration Backup Emergency Recovery Utility Antivirus Programs MS-DOS in Windows 95 Why Learn DOS? The most important and often overlooked facet of protecting your data in this day and age of the graphical user interface is to learn the basics of the DOS command line. Although with Windows 95 you can happily go on forever in your daily computing with ab
The Y2K BugThe Y2K Bug Introduction The year 2000 problem could have been completely prevented had some early people envisioned the degree to which the microprocessor would change our lives. Surely, no one would have thought that in the early days of ENIAC that everything from your alarm clock to your car would be computerized. Even the IT managers of the 80\'s could not be blamed: The disk space savings from dropping the two digits of the date over 100 Million Records would represent almost 200 Megabytes!
Electronic CommerceElectronic Commerce Federal Government Electronic Commerce and - FACNET - Electronic Commerce (EC) is a concept whereby the acquisition of supplies and services is conducted via a paperless exchange of business information through computers. This new approach to procurement activity requires agreed upon formats and standards by which computers can communicate to each other and transfer information back and forth. This exchange of information is defined as Electronic Data Interchange, or EDI. The