Document corelist change in perldelta

[perl5.git] / pod / perlpacktut.pod
diff --git a/pod/perlpacktut.pod b/pod/perlpacktut.pod

index 7c52d64..b0b5bdf 100644 (file)
--- a/pod/perlpacktut.pod
+++ b/pod/perlpacktut.pod
@@ -73,20 +73,19 @@ remains.
  The inverse operation - packing byte contents from a string of hexadecimal
  digits - is just as easily written. For instance:
  
-   my $s = pack( 'H2' x 10, map { "3$_" } ( 0..9 ) );
+   my $s = pack( 'H2' x 10, 30..39 );
     print "$s\n";
  
  Since we feed a list of ten 2-digit hexadecimal strings to C<pack>, the
  pack template should contain ten pack codes. If this is run on a computer
  with ASCII character coding, it will print C<0123456789>.
  
-
  =head1 Packing Text
  
  Let's suppose you've got to read in a data file like this:
  
      Date      |Description                | Income|Expenditure
-    01/24/2001 Ahmed's Camel Emporium                  1147.99
+    01/24/2001 Zed's Camel Emporium                    1147.99
      01/28/2001 Flea spray                                24.99
      01/29/2001 Camel rides to tourists      235.00
  
@@ -201,7 +200,7 @@ how much we've spent:
  
  Oh, hmm. That didn't quite work. Let's see what happened:
  
-    01/24/2001 Ahmed's Camel Emporium                   1147.99
+    01/24/2001 Zed's Camel Emporium                     1147.99
      01/28/2001 Flea spray                                 24.99
      01/29/2001 Camel rides to tourists     1235.00
      03/23/2001Totals                     1235.001172.98
@@ -226,7 +225,7 @@ additional spaces to line up our fields, like this:
  but they don't translate to spaces in the output.) Here's what we got
  this time:
  
-    01/24/2001 Ahmed's Camel Emporium                   1147.99
+    01/24/2001 Zed's Camel Emporium                     1147.99
      01/28/2001 Flea spray                                 24.99
      01/29/2001 Camel rides to tourists     1235.00
      03/23/2001 Totals                      1235.00 1172.98
@@ -660,7 +659,7 @@ Usually you'll want to pack or unpack UTF-8 strings:
  
  Please note: in the general case, you're better off using
  Encode::decode_utf8 to decode a UTF-8 encoded byte string to a Perl
-unicode string, and Encode::encode_utf8 to encode a Perl unicode string
+Unicode string, and Encode::encode_utf8 to encode a Perl Unicode string
  to UTF-8 bytes. These functions provide means of handling invalid byte
  sequences and generally have a friendlier interface.
  
@@ -853,6 +852,51 @@ template for C<pack> and C<unpack> because C<pack> can't determine
  a repeat count for a C<()>-group.
  
  
+=head2 Intel HEX
+
+Intel HEX is a file format for representing binary data, mostly for
+programming various chips, as a text file. (See
+L<http://en.wikipedia.org/wiki/.hex> for a detailed description, and
+L<http://en.wikipedia.org/wiki/SREC_(file_format)> for the Motorola
+S-record format, which can be unravelled using the same technique.)
+Each line begins with a colon (':') and is followed by a sequence of
+hexadecimal characters, specifying a byte count I<n> (8 bit),
+an address (16 bit, big endian), a record type (8 bit), I<n> data bytes
+and a checksum (8 bit) computed as the least significant byte of the two's
+complement sum of the preceding bytes. Example: C<:0300300002337A1E>.
+
+The first step of processing such a line is the conversion, to binary,
+of the hexadecimal data, to obtain the four fields, while checking the
+checksum. No surprise here: we'll start with a simple C<pack> call to 
+convert everything to binary:
+
+   my $binrec = pack( 'H*', substr( $hexrec, 1 ) );
+
+The resulting byte sequence is most convenient for checking the checksum.
+Don't slow your program down with a for loop adding the C<ord> values
+of this string's bytes - the C<unpack> code C<%> is the thing to use
+for computing the 8-bit sum of all bytes, which must be equal to zero:
+
+   die unless unpack( "%8C*", $binrec ) == 0;
+
+Finally, let's get those four fields. By now, you shouldn't have any
+problems with the first three fields - but how can we use the byte count
+of the data in the first field as a length for the data field? Here
+the codes C<x> and C<X> come to the rescue, as they permit jumping
+back and forth in the string to unpack.
+
+   my( $addr, $type, $data ) = unpack( "x n C X4 C x3 /a", $bin ); 
+
+Code C<x> skips a byte, since we don't need the count yet. Code C<n> takes
+care of the 16-bit big-endian integer address, and C<C> unpacks the
+record type. Being at offset 4, where the data begins, we need the count.
+C<X4> brings us back to square one, which is the byte at offset 0.
+Now we pick up the count, and zoom forth to offset 4, where we are
+now fully furnished to extract the exact number of data bytes, leaving
+the trailing checksum byte alone.
+
+
+
  =head1 Packing and Unpacking C Structures
  
  In previous sections we have seen how to pack numbers and character