The Instrumentation Key Theory

Disclaimer:

I have no hard data to support the claims that I am presenting below. This document is merely the first step in designing the experiments needed to test these hypothesis. I'm creating this document to help clarify my views so that I can identify the parts that need to be tested. I'm using the tag 'I believe' to identify the places where (I believe) testing is needed.

Introduction

I am attempting to create a universal human computer interface that can be used by any person on earth without regard to the language that they speak.

A 'verbal' human computer interface would require multiple translation facilities for all of the major human languages. These translators would have to comprehend both irregular syntax and grammar, along with varying regional and personal accents. Instrumentation attempts to minimize this problem by using a common generic syntax and thereby reducing translation requirements to single words and simple conversational phrases.

Furthermore, a verbal interface could be insecure and annoying in crowded environments and confusing in noisy environments. With Instrumentation and a heads-up display (like Google Glass), you could carry on a secure, silent conversation with your hands (and keypads) hidden in your pockets (disclaimer: large pockets not included).

I believe that the chorded (or chording) keypad can be the most efficient type of keyboard.

In this design each finger has one and only one key and each key is either on or off (there is no third possibility). I believe that this creates the simplest digital interface (in any sense of the word 'digital').

I believe that Instrumentation defines an 'optimal' implementation of two, five key chorded keypads (one for each hand). I call it optimal because (I believe that) it addresses the largest practical space in the most efficient way.

Finally I believe that the Instrumentation glyph and the ontology that underlies it provides a very mnemonic taxonomy which ties the entire design together.

In the remainder of this document I shall attempt to justify these claims.

The Largest Practical Space

If all ten fingers (including the thumbs) were used to create a single binary index, the address space would only contain two to the tenth terms (or 2^10) or one kibi-term. This is much smaller than Instrumentation's address space.

FYI: In binary exponential notation, kibi-term = 2^10 = 1024 terms, mebi-term = 2^20 = 1048576 terms, etc. I will be using this convention where possible for the rest of this document.

The Instrumentation address space is created by using eight fingers to specify a binary number from zero to 255 (or #0 to #FF in hexadecimal). The two thumbs then specify a two bit exponent or 'layer' of zero to three. In hexadecimal, 255 is #FF. This means that the right hand forms the high order #F0 and the left hand forms the low order #0F for each layer. I believe that this recasts these hexadecimal numbers as "counting on the fingers".

When all four layers are specified (#FF #FF #FF #FF ), this creates an index for an address space of 256 to the fourth power or 2^32 or four gibi-terms.

If six fingers were used as the base and four fingers were used as the exponent, it would create an address space of 64 to the sixteenth or 2^96 or 7.9X10^28 terms. This is much larger than the proposed space, but it ignores the natural partitioning of the hands (two thumbs and eight fingers) and the user would be required to specify sixteen layers of zero to 64 to generate a complete address. In essence, this method awkwardly creates an address space that is so large that (I believe) few users would be able to master it.

A base of four fingers and an exponent of six fingers would also address 2^96 terms but it would require 64 layers of sixteen for a complete address. A base of two and an exponent of eight would address 2^32 terms but would require 256 layers of four. These schemes provide no additional benefit at greater cost.

There are no other comparable addressing schemes that use the fingers symmetrically, where the same number of fingers on each hand are used to create the base and the exponent. The following are the remaining asymmetric schemes.

if one thumb was use as an exponent we would have a base of 512 and an exponent of two, which would address 2^18 terms or 256 kibi-terms and would require two layers of 512. This might be workable, but I don't think the vocabulary would be large enough to contain all of the potential scientific and technical terms and functions needed by everyone everywhere for the foreseeable future.

if seven fingers were used as the base and three fingers were used as the exponent, the address space would be 127^8 terms or 2^56 terms or 64 pibi-terms. This also might be workable, but (I believe that) eight layers is too cumbersome, the lack of symmetry makes the chording awkward and the added amount of space is unneeded.

If the right hand was the base and the left hand was the exponent (or vice versa), the address space would be 32^32 terms or 2^160 terms. I believe that this is the largest possible address space, but we have long since passed the point of economy of scale and are deep into the land of diminishing returns.

Here are all of the previous schemes in order:

Base fingers	Exponent fingers	# of Layers	Address Space
10	0	1	2^10 terms
9	1	2	2^18 terms
8	2	4	2^32 terms (sweet spot)
7	3	8	2^56 terms
6	4	16	2^96 terms
5	5	32	2^160 terms
4	6	64	2^96 terms (same as 6 & 4 fingers)
2	8	256	2^32 terms (same as 8 & 2 fingers)

In summation, I believe that four layers of 256 creates a large but manageable address space in a naturalistic manner. I shall now attempt to show that Instrumentation uses this space in an efficient way.

The Most Efficient Way

The total Instrumentation address space of four gibi-terms is intrinsically partitioned into 256 Areas by the Specialization layer.

Each Area contains sixteen mebi-terms. Each Area is dedicated to a separate discipline which has it's own specialized vocabulary. Each Area is then partitioned into Divisions, which contain sixty-four kibi-terms and into Blocks, which contain 256 terms.

Since each layer is uniquely identified using the thumbs, it is not necessary to enter any layer that has a value of zero. Instrumentation takes advantage of this by putting (what I believe will be) the most needed features at the indexes that can be selected with the fewest chords.

Layer 1 - Creation

These are the Categorical terms

They should be the simplest singular nouns within each Category

They provide the mnemonic foundation for the Basic terms (below)

Layer 2 - Description

These are the Hypodescriptive terms
They can express meaning which might not be covered in an electronic message
They can simplify command and communication

They should be an abbreviated, simple or expressive set of terms

Layers 1 + 2

These are the Basic terms

They should be essential nouns, adjectives and adverbs

Complete conversational phrases are included as a convenience

Layer 3 - Articulation

These are the Syntactic terms

They should be essential relationships and 'connecting words'

They include verb cases which turn nouns, adjectives and adverbs into verbs
They are normally combined with (and therefore modify) a Basic term

Layers 1 + 2 + 3 (and layers 1 + 3 or layers 2 + 3)

This is the basic vocabulary

This should be an adequate vocabulary for most casual conversations

"Casual Conversation" is Area zero of the total address space

Layer 4 - Specialization

This is the top level of the Index

This should facilitate term discovery

Layers 4 + 3 + 2 (and possibly layers 4 + 3)

This will lock-in a Block of terms

So that only layer one need be entered until that block is unlocked
This should simplify usage of control oriented blocks

This is also part of the Index

Layers 4 + 3 + 2 + 1

These are Specialized terms
My goal is to minimize or mitigate the use of these "four layer terms"

They should only be needed for technical conversations

They include commands, keywords and links
These long indexes can be abbreviated using Special Layer Pointers

I've used the qualifier 'should' above to describe the actual vocabulary because I am sure that many of my current term choices are far from optimal. I hope that the vocabulary will improve as the language matures.

The Most Ergonomic Keypad Interface

If you hold your hands with the thumbs pointing upward, the little fingers correspond to the lowest elements (zero and four) in the glyph below, while the pointer fingers correspond to the highest elements (three and seven).

The 2C Glyph

If an element (a spoke or a lath) is present (blue or black), the user would press the corresponding key. If an element is absent (yellow or invisible), the user would release the corresponding key. The presence and absence of elements can be demonstrated using the game.

Since each finger has a single key, the typist's fingers never leave their 'home' keys. Since each key is either pressed or released, the typist needn't worry about the intermediate positions of the fingers. I believe that these features are the major contributors to the simplicity of the keypad interface.

It would be possible to include a second thumb button for power, but that button would only be touched once at the beginning of a session and once at the end of a session. I suspect that a separate power switch is unnecessary, but that is a minor issue in any case.

Once the finger positions have been memorized, they can also serve as a sign language (presumably with the 'signer' interlacing their fingers to correct the positions of the right and left hands from the viewer's point of view). If the keys on the keyboard provide haptic feedback, the keyboard itself can serve as a 'headless' reading device for the blind and others. 'Headless' refers to a system that has no visual display capability.

It is questionable as to whether the average person can learn to enter chords automatically, but (I believe that) it is roughly equivalent to playing chords on a piano and (I believe that) the notation (the glyph) is actually simpler to interpret than standard musical notation because of the direct relationship between the elements and the finger positions.

I assume that users would start in 'arpeggio mode' where each key would be pressed sequentially. As translation of glyph appearance (or 'aspect') to finger placement begins to become automatic, the user could progress to full fledged chording where all keys needed on a given layer are pressed at the same time.

Playing the Solitaire game has taught me to automatically see the combinations of elements as individual hexadecimal numbers. I believe that this will also help to teach people to think of chords as single things rather than combinations of fingers.

The Most Mnemonic Representation

I believe that the trigrams of the I Ching are a brilliant distillation of the states of the human condition similar to the Kübler-Ross model or the "Hierarchy of Needs" of Abraham Maslow. I believe that the trigrams of the I Ching can be used to create a symbolic ontology that can encompass all human experience. Most importantly, there is one trigram for each finger (otherwise I would have been forced to use something else).

A glyph has four quadrants and each quadrant contains two octants. An index can be mapped to the four layers of the glyph. The inner spokes are the two lowest order numbers (00 00 00 FF) and the outer laths are the two highest order numbers (FF 00 00 00).

A Split Glyph

The four octants on the right are the 'active' (or Yang) octants and correspond to the four fingers of the right hand. The four octants on the left are the 'passive' (or Yin) octants and correspond to the four fingers of the left hand.

The four upper octants are the 'more controlling' octants, while the four lower octants are the 'less controlling' octants.

The four polar octants (two top and two bottom) are more stable, while the four equitorial octants (two left and two right) are more transitional.

Each octant has four mnemonic types which are based on the trigrams of the I Ching (one type on each layer). These types are associated with the active/passive, more/less controlling and stable/transitional states to help users remember their locations.

I believe that these somatic, visual and semiotic cues will help users to remember the index associated with an individual term.

Afterword

I am not claiming that the grammatical structure I have designed for Instrumentation is the best fit for human expression within the limits that I have defined above. I have tried to make the language as simple and straightforward as possible to facilitate the learning curve, but there may be more universal, adaptable or expressive arrangements that I could have used.

For anyone who is still (pretending to be) interested, the following are my "rigidly defined areas of doubt and uncertainty".

Creation:

The 256 Categorical terms

do they cover all human concerns equitably?

Description:

Active and Passive nouns

is this taxonomic simplification misaligned?

Adjectives and Adverbs

would a generic 'modifier' for both verbs and nouns be better?

Plurals

should pluralization be moved to the articulation layer?

Adverbial and Adjectival Degree forms

is this level of specificity needed?

Conversational phrases

Should there be more or less than 3 kibi-terms?

The Hypodescriptive Sub-table names and contents

would other terms better optimize this valuable real estate?

Articulation:

Qualifiers, Operators, Specifiers, Modifiers and Actors (verbs)

are these the most global and needed relationships?

Verb Tense and Person

is there a better way to handle verbs?

Verb interaction with Active nouns, Passive nouns, Adjectives and Adverbs

can verb 'meaning' be made clearer?

Specialization:

The selection and distribution of the Areas

will this adequately cover the sum total of human knowledge now and forever?

Negation

is there a better way to handle this?

Everything else

seriously, what was I thinking?

back to the home page.