Type (or paste) some ASCII text in the box to try it out. An explanation will appear, but you can also read information on the USCII homepage to learn more.
While ASCII uses small arbitrary values (e.g. 65 for "A", 66 for "B"), USCII encodings use much larger values (e.g. 15621226033 for "A", 16400753439 for "B"). These unusual numbers were chosen because they mathematically contain bitmaps of the symbols they represent! Using a technique modeled after SETI's Arecibo Message, semiprimes are employed to suggest the two-dimensional decoding of these bit patterns.
USCII signals are able to carry meaning to a receiver independent of any table. However, transmitters agree to use standard bitmaps (resulting in standard encoded values). This way they can also be used like conventional character codes—for instance, to pick which character to draw from a vector font!
Success: USCII-5x7-ENGLISH-C0 data (including the string container, header, and footer) is bytes long
What did the encoding do?
We start by making a series of 5x7 bitmaps that represent the letters and control codes in your input:
Note that there are five solid characters at the head of the signal, and seven solid characters at the end. This is called the "meter" because it helps set up the rhythm of the 35-bit sections (as well as reinforcing the importance of the prime factors).
The next step is to wrap up the data in a kind of "container". When conventional ASCII strings are stored in computer memory, there needs to be a way to know where the string ends. Sometimes only the end is indicated, such as with a Null Terminator...adding one byte. Numerous other strategies exist.
However: in USCII the goal is semiotics—not frugality. To make a string in memory as conspicuous and robust a pattern as possible; we indicate the end and the beginning with patterns. The patterns used to accomplish this are designed to reinforce the significance of the prime factors, while minimizing any appearance of carrying actual message content.
To accomplish this containment, we start by interleaving the characters with five binary zero bits... which separates the meter blocks and calls more attention to the significance of the number 5. There is an added benefit to padding out the 35-bit semiprime series with 5-bits to reach 40: each character unit fits exactly in 40 bits (5 bytes), for easier processing on computers which segment data into "byte" multiples of 8 bits.
Next we set up some surrounding empty space, called the "silence". This is just a steady stream of 0 bits before the leading meter, and continuous 1 bits after the trailing meter. Without the silence included in the message itself, then two consecutive USCII strings in memory (for instance) would have meter bumping up against each other...potentially obscuring the meaning of the meter length. For meaningfulness in that case, we choose 7 "character-sized" 40-bit units of zero silence in the beginning, and 5 40-bit units of one silence in the end.
Due to the careful size choices, we can cleanly encode this in bytes...and form it into the hexadecimal for the conversion.
If the recipients of the message are mathematically-minded, they'll be cued into noticing the importance of 5 and 7, and thus notice the 35-bit runs. Since 35 is a semiprime number, it's only divisible by 5 and 7, hence it might lead them to decode it as a 5x7 bitmap. See if your friends with geek-cred can figure it out, or send them the decoder if you are feeling generous. :)