Context Navigation

← Previous Change
Next Change →

decompress_bunzip2.c

Timestamp:

Nov 6, 2007, 11:01:53 AM (16 years ago)

Author:

Bruno Cornec

Message:

Better output for mindi-busybox revision
Remove dummy file created on NFS - report from Arnaud Tiger <arnaud.tiger_at_hp.com>
strace useful for debug
fix new versions for pb (2.0.0 for mindi and 1.7.2 for mindi-busybox)
fix build process for mindi-busybox + options used in that version (dd for label-partitions-as-necessary)
fix typo in label-partitions-as-necessary which doesn't seem to work
Update to busybox 1.7.2
perl is now required at restore time to support uuid swap partitions (and will be used for many other thigs

in the future for sure)

next mindi version will be 2.0.0 due to all the changes made in it (udev may break working distros)
small optimization in mindi on keyboard handling (one single find instead of multiple)
better interaction for USB device when launching mindi manually
attempt to automatically guess block disk size for ramdisk
fix typos in bkphw
Fix the remaining problem with UUID support for swap partitions
Updates mondoarchive man page for USB support
Adds preliminary Hardware support to mindi (Proliant SSSTK)
Tries to add udev support also for rhel4
Fix UUID support which was still broken.
Be conservative in test for the start-nfs script
Update config file for mindi-busybox for 1.7.2 migration
Try to run around a busybox bug (1.2.2 pb on inexistant links)
Add build content for mindi-busybox in pb
Remove distributions content for mindi-busybox
Fix a warning on inexistant raidtab
Solve problem on tmpfs in restore init (Problem of inexistant symlink and busybox)
Create MONDO_CACHE and use it everywhere + creation at start
Really never try to eject a USB device
Fix a issue with &> usage (replaced with 1> and 2>)
Adds magic file to depllist in order to have file working + ldd which helps for debugging issues
tty modes correct to avoid sh error messages
Use ext3 normally and not ext2 instead
USB device should be corrected after reading (take 1st part)
Adds a mount_USB_here function derived from mount_CDROM_here
usb detection place before /dev detection in device name at restore time
Fix when restoring from USB: media is asked in interactive mode
Adds USB support for mondorestore
mount_cdrom => mount_media
elilo.efi is now searched throughout /boot/efi and not in a fixed place as there is no standard
untar-and-softlink => untar (+ interface change)
suppress useless softlinks creation/removal in boot process
avoids udevd messages on groups
Increase # of disks to 99 as in mindi at restore time (should be a conf file parameter)
skip existing big file creation
seems to work correctly for USB mindi boot
Adds group and tty link to udev conf
Always load usb-torage (even 2.6) to initiate USB bus discovery
Better printing of messages
Attempt to fix a bug in supporting OpenSusE 10.3 kernel for initramfs (mindi may now use multiple regex for kernel initrd detection)
Links were not correctly done as non relative for modules in mindi
exclusion of modules denied now works
Also create modules in their ordinary place, so that classical modprobe works + copy modules.dep
Fix bugs for DENY_MODS handling
Add device /dev/console for udev
ide-generic should now really be excluded
Fix a bug in major number for tty
If udev then adds modprobe/insmod to rootfs
tty0 is also cretaed with udev
ide-generic put rather in DENY_MODS
udevd remove from deplist s handled in mindi directly
better default for mindi when using --usb
Handles dynamically linked busybox (in case we want to use it soon ;-)
Adds fixed devices to create for udev
ide-generic should not be part of the initrd when using libata v2
support a dynamically linked udev (case on Ubuntu 7.10 and Mandriva 2008.0 so should be quite generic) This will give incitation to move to dyn. linked binaries in the initrd which will help for other tasks (ia6 4)
Improvement in udev support (do not use cl options not available in busybox)
Udev in mindi
- auto creation of the right links at boot time with udev-links.conf(from Mandriva 2008.0)
- rework startup of udev as current makes kernel crash (from Mandriva 2008.0)
- add support for 64 bits udev
Try to render MyInsmod silent at boot time
Adds udev support (mandatory for newest distributions to avoid remapping of devices in a different way as on the original system)
We also need vaft format support for USB boot
Adds libusual support (Ubuntu 7.10 needs it for USB)
Improve Ubuntu/Debian keyboard detection and support
pbinit adapted to new pb (0.8.10). Filtering of docs done in it
Suppress some mondo warnings and errors on USB again
Tries to fix lack of files in deb mindi package
Verify should now work for USB devices
More log/mesages improvement for USB support
- Supress g_erase_tmpdir_and_scratchdir
Improve some log messages for USB support
Try to improve install in mindi to avoid issues with isolinux.cfg not installed vene if in the pkg :-(
Improve mindi-busybox build
In conformity with pb 0.8.9
Add support for Ubuntu 7.10 in build process
Add USB Key button to Menu UI (CD streamer removed)
Attempt to fix error messages on tmp/scratch files at the end by removing those dir at the latest possible.
Fix a bug linked to the size of the -E param which could be used (Arnaud Tiger/René Ribaud).
Integrate ~/.pbrc content into mondorescue.pb (required project-builder >= 0.8.7)
Put mondorescue in conformity with new pb filtering rules
Add USB support at restore time (no test done yet). New start-usb script PB varibale added where useful
Unmounting USB device before removal of temporary scratchdir
Stil refining USB copy back to mondo (one command was not executed)
No need to have the image subdor in the csratchdir when USB.
umount the USB partition before attempting to use it
Remove useless copy from mindi to mondo at end of USB handling

(risky merge, we are raising the limits of 2 diverging branches. The status of stable is not completely sure as such. Will need lots of tests, but it's not yet done :-()
(merge -r1692:1769 $SVN_M/branches/2.2.5)

File:

: 1 edited

branches/stable/mindi-busybox/archival/libunarchive/decompress_bunzip2.c (modified) (33 diffs)

Legend:

: Unmodified
: Added
: Removed

branches/stable/mindi-busybox/archival/libunarchive/decompress_bunzip2.c

-              r821
+              r1770
  */
-#include <setjmp.h>
-#include <stdio.h>
-#include <stdlib.h>
-#include <string.h>
-#include <unistd.h>
-#include <limits.h>
 #include "libbb.h"
 #include "unarchive.h"
 /* Constants for Huffman coding */
 #define MAX_GROUPS          6
 #define GROUP_SIZE          50      /* 64 would have been more efficient */
 #define MAX_HUFCODE_BITS    20      /* Longest Huffman code allowed */
 #define MAX_SYMBOLS         258     /* 256 literals + RUNA + RUNB */
 #define SYMBOL_RUNA         0
 #define SYMBOL_RUNB         1
+#define MAX_GROUPS          6
+#define GROUP_SIZE          50      /* 64 would have been more efficient */
+#define MAX_HUFCODE_BITS    20      /* Longest Huffman code allowed */
+#define MAX_SYMBOLS         258     /* 256 literals + RUNA + RUNB */
+#define SYMBOL_RUNA         0
+#define SYMBOL_RUNB         1
 /* Status return values */
 #define RETVAL_OK                       0
 #define RETVAL_LAST_BLOCK               (-1)
 #define RETVAL_NOT_BZIP_DATA            (-2)
 #define RETVAL_UNEXPECTED_INPUT_EOF     (-3)
 #define RETVAL_UNEXPECTED_OUTPUT_EOF    (-4)
 #define RETVAL_DATA_ERROR               (-5)
 #define RETVAL_OUT_OF_MEMORY            (-6)
 #define RETVAL_OBSOLETE_INPUT           (-7)
+#define RETVAL_OK                       0
+#define RETVAL_LAST_BLOCK               (-1)
+#define RETVAL_NOT_BZIP_DATA            (-2)
+#define RETVAL_UNEXPECTED_INPUT_EOF     (-3)
+#define RETVAL_UNEXPECTED_OUTPUT_EOF    (-4)
+#define RETVAL_DATA_ERROR               (-5)
+#define RETVAL_OUT_OF_MEMORY            (-6)
+#define RETVAL_OBSOLETE_INPUT           (-7)
 /* Other housekeeping constants */
 #define IOBUF_SIZE          4096
+#define IOBUF_SIZE          4096
 /* This is what we know about each Huffman coding group */
 struct group_data {
     /* We have an extra slot at the end of limit[] for a sentinal value. */
     int limit[MAX_HUFCODE_BITS+1],base[MAX_HUFCODE_BITS],permute[MAX_SYMBOLS];
+    int limit[MAX_HUFCODE_BITS+1], base[MAX_HUFCODE_BITS], permute[MAX_SYMBOLS];
     int minLen, maxLen;
 };
 …
    memory that persists between calls to bunzip */
 typedef struct {
+struct bunzip_data {
     /* State for interrupting output loop */
+    int writeCopies,writePos,writeRunCountdown,writeCount,writeCurrent;
+    int writeCopies, writePos, writeRunCountdown, writeCount, writeCurrent;
     /* I/O tracking data (file handles, buffers, positions, etc.) */
+    int in_fd,out_fd,inbufCount,inbufPos /*,outbufPos*/;
+    int in_fd, out_fd, inbufCount, inbufPos /*, outbufPos*/;
     unsigned char *inbuf /*,*outbuf*/;
     unsigned int inbufBitCount, inbufBits;
+    unsigned inbufBitCount, inbufBits;
     /* The CRC values stored in the block header and calculated from the data */
     uint32_t headerCRC, totalCRC, writeCRC;
+    uint32_t *crc32Table;
     /* Intermediate buffer and its size (in bytes) */
+    unsigned int *dbuf, dbufSize;
+    /* These things are a bit too big to go on the stack */
+    unsigned *dbuf, dbufSize;
+    /* For I/O error handling */
+    jmp_buf jmpbuf;
+    /* Big things go last (register-relative addressing can be larger for big offsets */
+    uint32_t crc32Table[256];
     unsigned char selectors[32768];         /* nSelectors=15 bits */
     struct group_data groups[MAX_GROUPS];   /* Huffman coding tables */
+    /* For I/O error handling */
+    jmp_buf jmpbuf;
+} bunzip_data;
+};
+/* typedef struct bunzip_data bunzip_data; -- done in .h file */
 /* Return the next nnn bits of input.  All reads from the compressed input
    are done through this function.  All reads are big endian */
 static unsigned int get_bits(bunzip_data *bd, char bits_wanted)
+static unsigned get_bits(bunzip_data *bd, char bits_wanted)
+{
     unsigned int bits=0;
+    unsigned bits = 0;
     /* If we need to get more data from the byte buffer, do so.  (Loop getting
        one byte at a time to enforce endianness and avoid unaligned access.) */
     while (bd->inbufBitCount<bits_wanted) {
+    while (bd->inbufBitCount < bits_wanted) {
         /* If we need to read more data from file into byte buffer, do so */
+        if(bd->inbufPos==bd->inbufCount) {
+            if((bd->inbufCount = read(bd->in_fd, bd->inbuf, IOBUF_SIZE)) <= 0)
+                longjmp(bd->jmpbuf,RETVAL_UNEXPECTED_INPUT_EOF);
+            bd->inbufPos=0;
+        if (bd->inbufPos == bd->inbufCount) {
+            /* if "no input fd" case: in_fd == -1, read fails, we jump */
+            bd->inbufCount = read(bd->in_fd, bd->inbuf, IOBUF_SIZE);
+            if (bd->inbufCount <= 0)
+                longjmp(bd->jmpbuf, RETVAL_UNEXPECTED_INPUT_EOF);
+            bd->inbufPos = 0;
+        }
         /* Avoid 32-bit overflow (dump bit buffer to top of output) */
         if(bd->inbufBitCount>=24) {
             bits=bd->inbufBits&((1<<bd->inbufBitCount)-1);
             bits_wanted-=bd->inbufBitCount;
             bits<<=bits_wanted;
             bd->inbufBitCount=0;
+        if (bd->inbufBitCount >= 24) {
+            bits = bd->inbufBits & ((1 << bd->inbufBitCount) - 1);
+            bits_wanted -= bd->inbufBitCount;
+            bits <<= bits_wanted;
+            bd->inbufBitCount = 0;
+        }
         /* Grab next 8 bits of input from buffer. */
         bd->inbufBits=(bd->inbufBits<<8)|bd->inbuf[bd->inbufPos++];
         bd->inbufBitCount+=8;
+        bd->inbufBits = (bd->inbufBits<<8) | bd->inbuf[bd->inbufPos++];
+        bd->inbufBitCount += 8;
+    }
     /* Calculate result */
     bd->inbufBitCount-=bits_wanted;
     bits|=(bd->inbufBits>>bd->inbufBitCount)&((1<<bits_wanted)-1);
+    bd->inbufBitCount -= bits_wanted;
+    bits |= (bd->inbufBits >> bd->inbufBitCount) & ((1 << bits_wanted) - 1);
     return bits;
 …
+{
     struct group_data *hufGroup;
     int dbufCount,nextSym,dbufSize,groupCount,*base,*limit,selector,
         i,j,k,t,runPos,symCount,symTotal,nSelectors,byteCount[256];
+    int dbufCount, nextSym, dbufSize, groupCount, *base, *limit, selector,
+        i, j, k, t, runPos, symCount, symTotal, nSelectors, byteCount[256];
     unsigned char uc, symToByte[256], mtfSymbol[256], *selectors;
     unsigned int *dbuf,origPtr;
     dbuf=bd->dbuf;
     dbufSize=bd->dbufSize;
     selectors=bd->selectors;
+    unsigned *dbuf, origPtr;
+    dbuf = bd->dbuf;
+    dbufSize = bd->dbufSize;
+    selectors = bd->selectors;
     /* Reset longjmp I/O error handling */
     i=setjmp(bd->jmpbuf);
     if(i) return i;
+    i = setjmp(bd->jmpbuf);
+    if (i) return i;
     /* Read in header signature and CRC, then validate signature.
        (last block signature means CRC is for whole file, return now) */
     i = get_bits(bd,24);
     j = get_bits(bd,24);
     bd->headerCRC=get_bits(bd,32);
+    i = get_bits(bd, 24);
+    j = get_bits(bd, 24);
+    bd->headerCRC = get_bits(bd, 32);
     if ((i == 0x177245) && (j == 0x385090)) return RETVAL_LAST_BLOCK;
     if ((i != 0x314159) || (j != 0x265359)) return RETVAL_NOT_BZIP_DATA;
 …
        it didn't actually work. */
+    if(get_bits(bd,1)) return RETVAL_OBSOLETE_INPUT;
+    if((origPtr=get_bits(bd,24)) > dbufSize) return RETVAL_DATA_ERROR;
+    if (get_bits(bd, 1)) return RETVAL_OBSOLETE_INPUT;
+    origPtr = get_bits(bd, 24);
+    if (origPtr > dbufSize) return RETVAL_DATA_ERROR;
     /* mapping table: if some byte values are never used (encoding things
 …
        back to the corresponding bytes. */
+    t=get_bits(bd, 16);
+    symTotal=0;
+    for (i=0;i<16;i++) {
+        if(t&(1<<(15-i))) {
+            k=get_bits(bd,16);
+            for(j=0;j<16;j++)
+                if(k&(1<<(15-j))) symToByte[symTotal++]=(16*i)+j;
+    t = get_bits(bd, 16);
+    symTotal = 0;
+    for (i = 0; i < 16; i++) {
+        if (t & (1 << (15-i))) {
+            k = get_bits(bd, 16);
+            for (j = 0; j < 16; j++)
+                if (k & (1 << (15-j)))
+                    symToByte[symTotal++] = (16*i) + j;
+        }
+    }
 …
     /* How many different Huffman coding groups does this block use? */
+    groupCount=get_bits(bd,3);
+    if (groupCount<2 || groupCount>MAX_GROUPS) return RETVAL_DATA_ERROR;
+    groupCount = get_bits(bd, 3);
+    if (groupCount < 2 || groupCount > MAX_GROUPS)
+        return RETVAL_DATA_ERROR;
     /* nSelectors: Every GROUP_SIZE many symbols we select a new Huffman coding
 …
        start of the list.) */
+    if(!(nSelectors=get_bits(bd, 15))) return RETVAL_DATA_ERROR;
+    for(i=0; i<groupCount; i++) mtfSymbol[i] = i;
+    for(i=0; i<nSelectors; i++) {
+    nSelectors = get_bits(bd, 15);
+    if (!nSelectors) return RETVAL_DATA_ERROR;
+    for (i = 0; i < groupCount; i++) mtfSymbol[i] = i;
+    for (i = 0; i < nSelectors; i++) {
         /* Get next value */
+        for(j=0;get_bits(bd,1);j++) if (j>=groupCount) return RETVAL_DATA_ERROR;
+        for (j = 0; get_bits(bd, 1); j++)
+            if (j>=groupCount) return RETVAL_DATA_ERROR;
         /* Decode MTF to get the next selector */
         uc = mtfSymbol[j];
         for(;j;j--) mtfSymbol[j] = mtfSymbol[j-1];
         mtfSymbol[0]=selectors[i]=uc;
+        for (;j;j--) mtfSymbol[j] = mtfSymbol[j-1];
+        mtfSymbol[0] = selectors[i] = uc;
+    }
 …
        literal symbols, plus two run symbols (RUNA, RUNB) */
     symCount=symTotal+2;
     for (j=0; j<groupCount; j++) {
         unsigned char length[MAX_SYMBOLS],temp[MAX_HUFCODE_BITS+1];
         int minLen, maxLen, pp;
+    symCount = symTotal + 2;
+    for (j = 0; j < groupCount; j++) {
+        unsigned char length[MAX_SYMBOLS], temp[MAX_HUFCODE_BITS+1];
+        int minLen, maxLen, pp;
         /* Read Huffman code lengths for each symbol.  They're stored in
 …
            length 0 becomes negative, so an unsigned inequality catches it.) */
         t=get_bits(bd, 5)-1;
+        t = get_bits(bd, 5) - 1;
         for (i = 0; i < symCount; i++) {
             for(;;) {
                 if (((unsigned)t) > (MAX_HUFCODE_BITS-1))
+            for (;;) {
+                if ((unsigned)t > (MAX_HUFCODE_BITS-1))
                     return RETVAL_DATA_ERROR;
 …
                    bits and unget the second if the first was 0. */
                 k = get_bits(bd,2);
+                k = get_bits(bd, 2);
                 if (k < 2) {
                     bd->inbufBitCount++;
 …
                 /* Add one if second bit 1, else subtract 1.  Avoids if/else */
                 t+=(((k+1)&2)-1);
+                t += (((k+1) & 2) - 1);
+            }
             /* Correct for the initial -1, to get the final symbol length */
             length[i]=t+1;
+            length[i] = t + 1;
+        }
         /* Find largest and smallest lengths in this group */
         minLen=maxLen=length[0];
         for(i = 1; i < symCount; i++) {
             if(length[i] > maxLen) maxLen = length[i];
             else if(length[i] < minLen) minLen = length[i];
+        minLen = maxLen = length[0];
+        for (i = 1; i < symCount; i++) {
+            if (length[i] > maxLen) maxLen = length[i];
+            else if (length[i] < minLen) minLen = length[i];
+        }
 …
          */
         hufGroup=bd->groups+j;
+        hufGroup = bd->groups + j;
         hufGroup->minLen = minLen;
         hufGroup->maxLen = maxLen;
 …
            entry.  We do this again when using them (during symbol decoding).*/
         base=hufGroup->base-1;
         limit=hufGroup->limit-1;
+        base = hufGroup->base - 1;
+        limit = hufGroup->limit - 1;
         /* Calculate permute[].  Concurently, initialize temp[] and limit[]. */
+        pp=0;
+        for(i=minLen;i<=maxLen;i++) {
+            temp[i]=limit[i]=0;
+            for(t=0;t<symCount;t++)
+                if(length[t]==i) hufGroup->permute[pp++] = t;
+        pp = 0;
+        for (i = minLen; i <= maxLen; i++) {
+            temp[i] = limit[i] = 0;
+            for (t = 0; t < symCount; t++)
+                if (length[t] == i)
+                    hufGroup->permute[pp++] = t;
+        }
         /* Count symbols coded for at each bit length */
         for (i=0;i<symCount;i++) temp[length[i]]++;
+        for (i = 0; i < symCount; i++) temp[length[i]]++;
         /* Calculate limit[] (the largest symbol-coding value at each bit
 …
          * limit minus the cumulative count of symbols coded for already). */
         pp=t=0;
         for (i=minLen; i<maxLen; i++) {
             pp+=temp[i];
+        pp = t = 0;
+        for (i = minLen; i < maxLen; i++) {
+            pp += temp[i];
             /* We read the largest possible symbol size and then unget bits
 …
                don't affect the value>limit[length] comparison. */
+            limit[i]= (pp << (maxLen - i)) - 1;
+            pp<<=1;
+            base[i+1]=pp-(t+=temp[i]);
+            limit[i] = (pp << (maxLen - i)) - 1;
+            pp <<= 1;
+            t += temp[i];
+            base[i+1] = pp - t;
+        }
         limit[maxLen+1] = INT_MAX; /* Sentinal value for reading next sym. */
         limit[maxLen]=pp+temp[maxLen]-1;
         base[minLen]=0;
+        limit[maxLen] = pp + temp[maxLen] - 1;
+        base[minLen] = 0;
+    }
 …
     /* Initialize symbol occurrence counters and symbol Move To Front table */
     for(i=0;i<256;i++) {
+    for (i = 0; i < 256; i++) {
         byteCount[i] = 0;
         mtfSymbol[i]=(unsigned char)i;
+        mtfSymbol[i] = (unsigned char)i;
+    }
     /* Loop through compressed symbols. */
     runPos=dbufCount=selector=0;
     for(;;) {
+    runPos = dbufCount = selector = 0;
+    for (;;) {
         /* fetch next Huffman coding group from list. */
         symCount=GROUP_SIZE-1;
         if(selector>=nSelectors) return RETVAL_DATA_ERROR;
         hufGroup=bd->groups+selectors[selector++];
         base=hufGroup->base-1;
         limit=hufGroup->limit-1;
 continue_this_group:
+        symCount = GROUP_SIZE - 1;
+        if (selector >= nSelectors) return RETVAL_DATA_ERROR;
+        hufGroup = bd->groups + selectors[selector++];
+        base = hufGroup->base - 1;
+        limit = hufGroup->limit - 1;
+ continue_this_group:
         /* Read next Huffman-coded symbol. */
 …
            inline (falling back to a call to get_bits if the buffer runs
            dry).  The following (up to got_huff_bits:) is equivalent to
            j=get_bits(bd,hufGroup->maxLen);
+           j = get_bits(bd, hufGroup->maxLen);
          */
         while (bd->inbufBitCount<hufGroup->maxLen) {
             if(bd->inbufPos==bd->inbufCount) {
                 j = get_bits(bd,hufGroup->maxLen);
+        while (bd->inbufBitCount < hufGroup->maxLen) {
+            if (bd->inbufPos == bd->inbufCount) {
+                j = get_bits(bd, hufGroup->maxLen);
                 goto got_huff_bits;
+            }
             bd->inbufBits=(bd->inbufBits<<8)|bd->inbuf[bd->inbufPos++];
             bd->inbufBitCount+=8;
+            bd->inbufBits = (bd->inbufBits << 8) | bd->inbuf[bd->inbufPos++];
+            bd->inbufBitCount += 8;
         };
         bd->inbufBitCount-=hufGroup->maxLen;
         j = (bd->inbufBits>>bd->inbufBitCount)&((1<<hufGroup->maxLen)-1);
 got_huff_bits:
+        bd->inbufBitCount -= hufGroup->maxLen;
+        j = (bd->inbufBits >> bd->inbufBitCount) & ((1 << hufGroup->maxLen) - 1);
+ got_huff_bits:
         /* Figure how how many bits are in next symbol and unget extras */
         i=hufGroup->minLen;
         while(j>limit[i]) ++i;
+        i = hufGroup->minLen;
+        while (j > limit[i]) ++i;
         bd->inbufBitCount += (hufGroup->maxLen - i);
         /* Huffman decode value to get nextSym (with bounds checking) */
+        if ((i > hufGroup->maxLen)
+            || (((unsigned)(j=(j>>(hufGroup->maxLen-i))-base[i]))
+                >= MAX_SYMBOLS))
+        if (i > hufGroup->maxLen)
+            return RETVAL_DATA_ERROR;
+        j = (j >> (hufGroup->maxLen - i)) - base[i];
+        if ((unsigned)j >= MAX_SYMBOLS)
             return RETVAL_DATA_ERROR;
         nextSym = hufGroup->permute[j];
 …
            how many times to repeat the last literal. */
         if (((unsigned)nextSym) <= SYMBOL_RUNB) { /* RUNA or RUNB */
+        if ((unsigned)nextSym <= SYMBOL_RUNB) { /* RUNA or RUNB */
             /* If this is the start of a new run, zero out counter */
             if(!runPos) {
+            if (!runPos) {
                 runPos = 1;
                 t = 0;
 …
             t += (runPos << nextSym); /* +runPos if RUNA; +2*runPos if RUNB */
             if(runPos < dbufSize) runPos <<= 1;
+            if (runPos < dbufSize) runPos <<= 1;
             goto end_of_huffman_loop;
+        }
 …
            literal used is the one at the head of the mtfSymbol array.) */
         if(runPos) {
             runPos=0;
             if(dbufCount+t>=dbufSize) return RETVAL_DATA_ERROR;
+        if (runPos) {
+            runPos = 0;
+            if (dbufCount + t >= dbufSize) return RETVAL_DATA_ERROR;
             uc = symToByte[mtfSymbol[0]];
             byteCount[uc] += t;
             while(t--) dbuf[dbufCount++]=uc;
+            while (t--) dbuf[dbufCount++] = uc;
+        }
         /* Is this the terminating symbol? */
         if(nextSym>symTotal) break;
+        if (nextSym > symTotal) break;
         /* At this point, nextSym indicates a new literal character.  Subtract
 …
 non-literal nextSym values equals -1.) */
         if(dbufCount>=dbufSize) return RETVAL_DATA_ERROR;
+        if (dbufCount >= dbufSize) return RETVAL_DATA_ERROR;
         i = nextSym - 1;
         uc = mtfSymbol[i];
 …
         } while (--i);
         mtfSymbol[0] = uc;
         uc=symToByte[uc];
+        uc = symToByte[uc];
         /* We have our literal byte.  Save it into dbuf. */
         byteCount[uc]++;
         dbuf[dbufCount++] = (unsigned int)uc;
+        dbuf[dbufCount++] = (unsigned)uc;
         /* Skip group initialization if we're not done with this group.  Done
          * this way to avoid compiler warning. */
 end_of_huffman_loop:
         if(symCount--) goto continue_this_group;
+ end_of_huffman_loop:
+        if (symCount--) goto continue_this_group;
+    }
     /* At this point, we've read all the Huffman-coded symbols (and repeated
        runs) for this block from the input stream, and decoded them into the
+       runs) for this block from the input stream, and decoded them into the
        intermediate buffer.  There are dbufCount many decoded bytes in dbuf[].
        Now undo the Burrows-Wheeler transform on dbuf.
 …
     /* Turn byteCount into cumulative occurrence counts of 0 to n-1. */
     j=0;
     for(i=0;i<256;i++) {
         k=j+byteCount[i];
+    j = 0;
+    for (i = 0; i < 256; i++) {
+        k = j + byteCount[i];
         byteCount[i] = j;
         j=k;
+        j = k;
+    }
     /* Figure out what order dbuf would be in if we sorted it. */
     for (i=0;i<dbufCount;i++) {
         uc=(unsigned char)(dbuf[i] & 0xff);
+    for (i = 0; i < dbufCount; i++) {
+        uc = (unsigned char)(dbuf[i] & 0xff);
         dbuf[byteCount[uc]] |= (i << 8);
         byteCount[uc]++;
 …
        it doesn't qualify as a run (hence writeRunCountdown=5). */
     if(dbufCount) {
         if(origPtr>=dbufCount) return RETVAL_DATA_ERROR;
         bd->writePos=dbuf[origPtr];
         bd->writeCurrent=(unsigned char)(bd->writePos&0xff);
         bd->writePos>>=8;
         bd->writeRunCountdown=5;
+    }
     bd->writeCount=dbufCount;
+    if (dbufCount) {
+        if (origPtr >= dbufCount) return RETVAL_DATA_ERROR;
+        bd->writePos = dbuf[origPtr];
+        bd->writeCurrent = (unsigned char)(bd->writePos & 0xff);
+        bd->writePos >>= 8;
+        bd->writeRunCountdown = 5;
+    }
+    bd->writeCount = dbufCount;
     return RETVAL_OK;
 …
 */
 static int read_bunzip(bunzip_data *bd, char *outbuf, int len)
+int read_bunzip(bunzip_data *bd, char *outbuf, int len)
+{
     const unsigned int *dbuf;
     int pos,current,previous,gotcount;
+    const unsigned *dbuf;
+    int pos, current, previous, gotcount;
     /* If last read was short due to end of file, return last block now */
     if(bd->writeCount<0) return bd->writeCount;
+    if (bd->writeCount < 0) return bd->writeCount;
     gotcount = 0;
     dbuf=bd->dbuf;
     pos=bd->writePos;
     current=bd->writeCurrent;
+    dbuf = bd->dbuf;
+    pos = bd->writePos;
+    current = bd->writeCurrent;
     /* We will always have pending decoded data to write into the output
 …
         /* Loop outputting bytes */
         for(;;) {
+        for (;;) {
             /* If the output buffer is full, snapshot state and return */
             if(gotcount >= len) {
                 bd->writePos=pos;
                 bd->writeCurrent=current;
+            if (gotcount >= len) {
+                bd->writePos  =pos;
+                bd->writeCurrent = current;
                 bd->writeCopies++;
                 return len;
 …
             outbuf[gotcount++] = current;
             bd->writeCRC=(((bd->writeCRC)<<8)
                           ^bd->crc32Table[((bd->writeCRC)>>24)^current]);
+            bd->writeCRC = (bd->writeCRC << 8)
+                          ^ bd->crc32Table[(bd->writeCRC >> 24) ^ current];
             /* Loop now if we're outputting multiple copies of this byte */
 …
                 continue;
+            }
 decode_next_byte:
+ decode_next_byte:
             if (!bd->writeCount--) break;
             /* Follow sequence vector to undo Burrows-Wheeler transform */
             previous=current;
             pos=dbuf[pos];
             current=pos&0xff;
             pos>>=8;
+            previous = current;
+            pos = dbuf[pos];
+            current = pos & 0xff;
+            pos >>= 8;
             /* After 3 consecutive copies of the same byte, the 4th is a repeat
 …
              * of counting up because testing for non-zero is faster */
+            if(--bd->writeRunCountdown) {
+                if(current!=previous) bd->writeRunCountdown=4;
+            if (--bd->writeRunCountdown) {
+                if (current != previous)
+                    bd->writeRunCountdown = 4;
             } else {
                 /* We have a repeated run, this byte indicates the count */
                 bd->writeCopies=current;
                 current=previous;
                 bd->writeRunCountdown=5;
+                bd->writeCopies = current;
+                current = previous;
+                bd->writeRunCountdown = 5;
                 /* Sometimes there are just 3 bytes (run length 0) */
                 if(!bd->writeCopies) goto decode_next_byte;
+                if (!bd->writeCopies) goto decode_next_byte;
                 /* Subtract the 1 copy we'd output anyway to get extras */
 …
         /* Decompression of this block completed successfully */
         bd->writeCRC=~bd->writeCRC;
         bd->totalCRC=((bd->totalCRC<<1) | (bd->totalCRC>>31)) ^ bd->writeCRC;
+        bd->writeCRC = ~bd->writeCRC;
+        bd->totalCRC = ((bd->totalCRC << 1) | (bd->totalCRC >> 31)) ^ bd->writeCRC;
         /* If this block had a CRC error, force file level CRC error. */
         if(bd->writeCRC!=bd->headerCRC) {
             bd->totalCRC=bd->headerCRC+1;
+        if (bd->writeCRC != bd->headerCRC) {
+            bd->totalCRC = bd->headerCRC+1;
             return RETVAL_LAST_BLOCK;
+        }
 …
     /* (previous is just a convenient unused temp variable here) */
     previous=get_next_block(bd);
     if(previous) {
         bd->writeCount=previous;
         return (previous!=RETVAL_LAST_BLOCK) ? previous : gotcount;
+    }
     bd->writeCRC=~0;
     pos=bd->writePos;
     current=bd->writeCurrent;
+    previous = get_next_block(bd);
+    if (previous) {
+        bd->writeCount = previous;
+        return (previous != RETVAL_LAST_BLOCK) ? previous : gotcount;
+    }
+    bd->writeCRC = ~0;
+    pos = bd->writePos;
+    current = bd->writeCurrent;
     goto decode_next_byte;
+}
 /* Allocate the structure, read file header.  If in_fd==-1, inbuf must contain
 …
    ignored, and data is read from file handle into temporary buffer. */
+static int start_bunzip(bunzip_data **bdp, int in_fd, unsigned char *inbuf,
+/* Because bunzip2 is used for help text unpacking, and because bb_show_usage()
+   should work for NOFORK applets too, we must be extremely careful to not leak
+   any allocations! */
+int start_bunzip(bunzip_data **bdp, int in_fd, const unsigned char *inbuf,
                         int len)
+{
     bunzip_data *bd;
+    unsigned int i;
+    const unsigned int BZh0=(((unsigned int)'B')<<24)+(((unsigned int)'Z')<<16)
+                            +(((unsigned int)'h')<<8)+(unsigned int)'0';
+    unsigned i;
+    enum {
+        BZh0 = ('B' << 24) + ('Z' << 16) + ('h' << 8) + '0'
+    };
     /* Figure out how much data to allocate */
     i=sizeof(bunzip_data);
     if(in_fd!=-1) i+=IOBUF_SIZE;
+    i = sizeof(bunzip_data);
+    if (in_fd != -1) i += IOBUF_SIZE;
     /* Allocate bunzip_data.  Most fields initialize to zero. */
     bd=*bdp=xzalloc(i);
+    bd = *bdp = xzalloc(i);
     /* Setup input buffer */
+    if(-1==(bd->in_fd=in_fd)) {
+        bd->inbuf=inbuf;
+        bd->inbufCount=len;
+    } else bd->inbuf=(unsigned char *)(bd+1);
+    bd->in_fd = in_fd;
+    if (-1 == in_fd) {
+        /* in this case, bd->inbuf is read-only */
+        bd->inbuf = (void*)inbuf; /* cast away const-ness */
+        bd->inbufCount = len;
+    } else
+        bd->inbuf = (unsigned char *)(bd + 1);
     /* Init the CRC32 table (big endian) */
     bd->crc32Table = bb_crc32_filltable(1);
+    crc32_filltable(bd->crc32Table, 1);
     /* Setup for I/O error handling via longjmp */
     i=setjmp(bd->jmpbuf);
     if(i) return i;
+    i = setjmp(bd->jmpbuf);
+    if (i) return i;
     /* Ensure that file starts with "BZh['1'-'9']." */
     i = get_bits(bd,32);
     if (((unsigned int)(i-BZh0-1)) >= 9) return RETVAL_NOT_BZIP_DATA;
+    i = get_bits(bd, 32);
+    if ((unsigned)(i - BZh0 - 1) >= 9) return RETVAL_NOT_BZIP_DATA;
     /* Fourth byte (ascii '1'-'9'), indicates block size in units of 100k of
        uncompressed data.  Allocate intermediate buffer for block. */
+    bd->dbufSize=100000*(i-BZh0);
+    bd->dbuf=xmalloc(bd->dbufSize * sizeof(int));
+    bd->dbufSize = 100000 * (i - BZh0);
+    /* Cannot use xmalloc - may leak bd in NOFORK case! */
+    bd->dbuf = malloc_or_warn(bd->dbufSize * sizeof(int));
+    if (!bd->dbuf) {
+        free(bd);
+        xfunc_die();
+    }
     return RETVAL_OK;
+}
+/* Example usage: decompress src_fd to dst_fd.  (Stops at end of bzip data,
+   not end of file.) */
+int uncompressStream(int src_fd, int dst_fd)
+void dealloc_bunzip(bunzip_data *bd)
+{
+    free(bd->dbuf);
+    free(bd);
+}
+/* Decompress src_fd to dst_fd.  Stops at end of bzip data, not end of file. */
+USE_DESKTOP(long long) int
+unpack_bz2_stream(int src_fd, int dst_fd)
+{
+    USE_DESKTOP(long long total_written = 0;)
     char *outbuf;
     bunzip_data *bd;
     int i;
+    outbuf=xmalloc(IOBUF_SIZE);
+    if(!(i=start_bunzip(&bd,src_fd,0,0))) {
+        for(;;) {
+            if((i=read_bunzip(bd,outbuf,IOBUF_SIZE)) <= 0) break;
+            if(i!=write(dst_fd,outbuf,i)) {
+                i=RETVAL_UNEXPECTED_OUTPUT_EOF;
+    outbuf = xmalloc(IOBUF_SIZE);
+    i = start_bunzip(&bd, src_fd, NULL, 0);
+    if (!i) {
+        for (;;) {
+            i = read_bunzip(bd, outbuf, IOBUF_SIZE);
+            if (i <= 0) break;
+            if (i != safe_write(dst_fd, outbuf, i)) {
+                i = RETVAL_UNEXPECTED_OUTPUT_EOF;
                 break;
+            }
+            USE_DESKTOP(total_written += i;)
+        }
+    }
 …
     /* Check CRC and release memory */
     if(i==RETVAL_LAST_BLOCK) {
         if (bd->headerCRC!=bd->totalCRC) {
             bb_error_msg("Data integrity error when decompressing.");
+    if (i == RETVAL_LAST_BLOCK) {
+        if (bd->headerCRC != bd->totalCRC) {
+            bb_error_msg("data integrity error when decompressing");
         } else {
             i=RETVAL_OK;
+        }
     } else if (i==RETVAL_UNEXPECTED_OUTPUT_EOF) {
         bb_error_msg("Compressed file ends unexpectedly");
+            i = RETVAL_OK;
+        }
+    } else if (i == RETVAL_UNEXPECTED_OUTPUT_EOF) {
+        bb_error_msg("compressed file ends unexpectedly");
     } else {
+        bb_error_msg("Decompression failed");
+    }
+    free(bd->dbuf);
+    free(bd);
+        bb_error_msg("decompression failed");
+    }
+    dealloc_bunzip(bd);
     free(outbuf);
     return i;
+    return i ? i : USE_DESKTOP(total_written) + 0;
+}
 #ifdef TESTING
+static char * const bunzip_errors[]={NULL,"Bad file checksum","Not bzip data",
+        "Unexpected input EOF","Unexpected output EOF","Data error",
+         "Out of memory","Obsolete (pre 0.9.5) bzip format not supported."};
+static char *const bunzip_errors[] = {
+    NULL, "Bad file checksum", "Not bzip data",
+    "Unexpected input EOF", "Unexpected output EOF", "Data error",
+    "Out of memory", "Obsolete (pre 0.9.5) bzip format not supported"
+};
 /* Dumb little test thing, decompress stdin to stdout */
 int main(int argc, char *argv[])
+int main(int argc, char **argv)
+{
     int i=uncompressStream(0,1);
+    int i = unpack_bz2_stream(0, 1);
     char c;
+    if(i) fprintf(stderr,"%s\n", bunzip_errors[-i]);
+    else if(read(0,&c,1)) fprintf(stderr,"Trailing garbage ignored\n");
+    if (i < 0)
+        fprintf(stderr,"%s\n", bunzip_errors[-i]);
+    else if (read(0, &c, 1))
+        fprintf(stderr,"Trailing garbage ignored\n");
     return -i;
+}

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 1770 in MondoRescue for branches/stable/mindi-busybox/archival/libunarchive/decompress_bunzip2.c

Legend:

branches/stable/mindi-busybox/archival/libunarchive/decompress_bunzip2.c

Download in other formats: