Robot Environment Interaction
We need to define couple more terminology that will be used throughout the notebook here.
State
Environments are characterized by state. We will denote state as and a state at time is denoted as .
Environment Interaction
Perception is the process by which the robot uses its sensors to obtain information about the state of its environment. We can denote measurement data at time t as .
Control actions change the state of the world. Even if the robot does not perform any action itself, the state usually changes. For consistency, we will assume that robot always executes a control action, even if it chooses not to move any of its motors. Control data carry information about the change of state in the environment. We will denote it as . The variable will always correspond to the change of state in the time interval .
If a state is complete, then it is a sufficient summary of all that happened in the previous time steps. We can express this idea through conditional independence.
That is basically saying, if we know the everything so well, we can simply predict the current state by knowing the previous state and the control action that is going to influence it. Similarly we can predict measurement data by knowing the complete previous state and control action, because measurement data is merely a reflection of the current state. Knowledge of any other variables, such as past measurements, controls, or even past states, is irrelevant if is complete.
State Transition Probability
State transition probability specifies how environmental state evolves over time as a function of robot controls .
Measurement Probability
Measurement probability specifies the probablistic law according to which measurements are generated from the environment state .
The state at time is stochastically dependent on the state at time and the control . The measurement depends stochastically on the state at time . Such a temporal generative model is also known as Hidden Markov Model.
Belief
A belief reflects the robot's internal knowledge about the state of the environment. A belief distribution assigns a probability to each possible hypothesis with regards to the true state. Belief distributions are posterior probabilities over state variables conditioned on the available data.
The belief function above incorporates the current measurement data but sometimes we wish to make a prediction and compare it to the actual measurement data, such prediction is denoted as follows.
Calculating from is called correction or measurement update.
Last updated