• Home
  • DARPA SyNAPSE
  • Business-minded
  • Compute Me
  • Brainplug
  • Biophys-Ed

We all need control (theory)

Tim Barnes | February 7, 2010

Top Gun taught us that the best and brightest pilots can perform some amazing aerobatics.  Nobody seems surprised that a good pilot, with some practice, can move seamlessly from the flight maneuvers used on a Boeing 747 to those featured in Blue Angels shows.  While computer autopilots have performed well in commercial aircraft for some time, however, getting an electronic computer to pull a plane successfully through an aerobatic maneuver is almost impossible, and is thus a relatively new field of research.

Though this information may surprise one at first, the example is but one of a string of revelations accompanying the journey of intelligence research.  Thirty years ago very few scientists expected to run into difficulties in building cameras that understood what they were ‘looking at’ similarly to a person, but the challenge remains.  The emerging theme from varied engineering projects is that biological intelligence trades off some performance abilities for adaptability to a wide range of difficult conditions.  Becoming a professional baseball pitcher may take a good fifteen to twenty years, but it doesn’t take long for someone to learn to play catch, throw trash away, flip a pancake, and so on.  One of the primary goals of neuromorphic technology is to fill in the spectrum between the mathematical but rigid perfection of traditional controllers (and models), and the messy, cheap and rugged heuristics of the animals.

Among the most likely reasons that vision, aerobatics, and many other complicated problems become difficult very quickly is the inherent nonlinearity of the relationship between the input and output of a control system.  Let’s say that someone wants to design a controller that regulates a car’s throttle and transfers power to the different wheels while driving.  The controller knows how fast the car is going, how hard the driver is turning the wheel, and the weather conditions.  The controller probably does a pretty good job of keeping the car from sliding when the road is dry and clean.  In the rain, however, the tires will be more prone to slipping, so the controller shouldn’t allow the throttle to drive the wheels as hard in case they break loose.  The rule ‘more rain → less throttle’ is a simple and linear relationship that could be easily trained into the controller .  During the first fifteen minutes of rain, however, the road is especially slick because rainwater brings oils to the surface of the asphalt, after which times it gets washed away.  In other words, the danger of slipping has a nonlinear relationship to the amount of water detected by the controller; it goes up for about fifteen minutes and then settles back down to something worse than a dry road.  This small difference might not be a problem for a conservative controller designed to keep a family safe, but what if its purpose is to maximize turning speed in a race?  If it were trained only on very dry and very rainy roads (which seems reasonable enough), then a linear controller couldn’t anticipate a good throttle position for the start of a rainstorm, and a small error in throttle could result in a huge difference for the driver!

Rain danger is an example of nonlinearity in a single input dimension, but the situation becomes more complicated with interacting conditions.  Turning quickly is safer on a cement road than uneven asphalt because the uneven surface allows for less traction; dry roads are safer than wet ones because of hydroplaning.  A wet asphalt road may be better than a wet cement road, however, because the uneven surface lifts the tire and allows water to escape from under it (unlikely, don’t test this).  Rather than two bad conditions simply adding together to become worse, the combination may actually be safer than either one alone.

(The situation is a contrived example of the XOR problem, which was shown to be impossible for a single layer perceptron by Minsky and Papert.  In a mathematical sense, all nonlinearities are expressed as multiplicative interacting terms; the previous single variable case arises from a variable’s interactions with itself.  There’s also an interesting connection between nonlinear interactions and the ability of people to categorize and structure the world hierarchically.  Hofstadter mentions that expert chess players, when performing a task of quickly memorizing and recalling chess boards, make recall errors with groups of pieces rather than the single piece errors that novices make.  The grouping process, the recognition that some arrangements of pieces affect the outcome of the game more than others, makes steps towards capturing the nonlinearity of such a complex system.)

A controller can either learn how the process it’s controlling (the plant) works, or it can learn the inverse plant.  Learning the inverse problem can be easier because the algorithm is straightforward to apply; the input to the network is the desired outcome and the output is the action for the controller to take.  This is exemplified in robotic models of inverse kinematics, where a controller must learn to rotate the joints of a limb to get the limb’s end to a particular place in space.  Undoing a process may be difficult, however, when the space of possible plant outcomes is much larger than the number of knobs the controller can twist; in that case learning the forward model may give better performance.  The DIRECT model is an example of how the human brain can act as a limb controller by learning both the forward and inverse plant for a wide range of motions.  An interesting hypothesis put forward here is that ‘babbling’, a childlike exploration of all kinds of movements to gain experience, is essential for fine tuning the network in preparation for precise movement.  For high performance movements, Marr and Albus independently both suggested that the cerebellum is a forward kinematic model of human movement which could be used to apply small corrective signals to those coming straight from the cortex to the muscles.  All of these are examples of neural networks that already exist as controllers in biological bodies, and their applicability both to specific high-performance and robust general-purpose tasks has only increased with continued research.

Categories
Biophys-Ed, Compute Me, DARPA SyNAPSE
Tags
controller, learning, neuromorphic technology
Comments rss
Comments rss
Trackback
Trackback

« Plastic synapses in a stable brain Understanding the Competition »

Leave a Reply

Click here to cancel reply.

Jump to

About Neurdon
About SyNAPSE
Contact
Contributors
Editors
Glossary
Neurdon Merch

Tags

adaline adaptive resonance theory arm processor artificial intelligence auditory cat brain cochlear implant consciousness continous firing neurons controller cortical column DARPA DARPA SyNAPSE Dharmendra Modha events Excitatory Postsynaptic Potentials FACETS flash memory global workspace theory Greg Snider hearing HP HRL IBM Inhibitory Postsynaptic Potentials iSLC it Izhikevich law and robotics learning Leon Chua markram MATLAB MATLAB code Melanie-Mitchell memristor memristors Minsky modha modular robotics money Moore's Law Narayan Srinivasa neural engineering neural prosthesis neuromorphic technology NSF object recognition poggio rat brain rate-based models Ray Kurzweil riesenhuber robot robotics robotic weapons sensory fusion serre software SPICE model spike-based models spiking neurons Stanley Williams stdp super computer supercomputer synaptic plasticity time as supervisor

Blogroll

  • CELEST
  • CNS Tech Lab
rss Comments rss valid xhtml 1.1 design by jide powered by Wordpress get firefox