User:Thepigdog/Relational model of imperative programming
The relational model of imperative programming [1] [2] describes how time and state may be managed in relational programming, and how imperative programming may be represented within relational programming. The goals are,
- Represent of an imperative program within a relational program.
- Logical transformations of imperative programs.
- Time based interaction with other systems and services in relational programs.
This allows code refactoring of an imperative program into relational program while preserving its functionality.
A relational program must interact with its environment. To do this in a sound manner the program must be able to obtain the current time without pre-determining the execution order.
Introduction
[edit]Relational programming languages are based on the objective mathematical model where the world being described is separate from the method of description, and time is a dimension. Relational programs gain from this the freedom to be executed in any order. However this places some constraints on the interaction of a program with its environment.
The state of an imperative program can be turned into a mathematical object by parameterizing it with time. The state at a particular time is a well defined object with a single value. The current state may be obtained using the current time.
A relational program may be written which regards time as a dimension, provided it has access to the current time. The current time may be accessed from a list of times, where each time is current at the time it is read from the list.
Current time
[edit]Allowing a relational program access to the current time, without pre-determining the order of execution is difficult. The natural way to represent obtaining the current time is to ask a clock. The clock returns the time and, because it is stateless, returns the next clock. The clock implements a list of times.
However, as well as being inconvenient, passing the clock around unnecessarily pre-determines the order of execution. An example is when logging messages as a record of what the program as done. Log messages need to be tagged with the time, which requires access to the clock. But access to the clock requires that the next clock to be passed on to the next use, which pre-determines the order of execution. But in fact the requirement was to log the order of execution, not determine it.
Branching clocks solve this problem.
Working with legacy programs
[edit]Imperative programs are characterized by having implicit state. An imperative program describes a series of changes to the state, executed in a defined order.
Most programs are written in imperative programming languages. In order to successfully transform large imperative programs, a flexible system is needed that allows incremental changes to be made. Large scale changes to a big imperative program may cause bugs which are hard to track down and correct. Small changes may be introduced and any bugs found may be attributed to individual changes.
Roles
[edit]A role is an implicit parameter to a function, identified by a name, whose value is requested from its calling context. If the calling context, within another function definition, does not provide the value, then the role becomes a request of any call to the function. The request is passed up the calling hierarchy until the value is provided.
The purpose of roles is to provide an implicit mechanism for constructing parameter parsing through a hierarchy of multiple function calls without explicitly defining the actual or the formal parameter. The role requests the value and the parameter parsing to transport the value is constructed automatically by meta programming.
This has advantages,
- The calling sequence is constructed automatically, eliminating bugs caused by manual construction.
- The reader of the program is not distracted by parameters that are not needed locally in the function.
- A change to a program may introduce the need for a value in a function, without adding all the parameter passing needed to deliver the value.
Role pre-processor
[edit]A role pre-processor for a language processes source text and identifies two meta functions,
- provide - Identifies the value for later use.
- request - Requests the creation of all the parameter passing needed to get the value here.
The pre-processor implements a request by adding an extra formal parameter to a function for each value requested in it. The parameter will be used to pass in the requested value. Then for each function call, add the actual parameter and implicitly regard that value as requested in its containing function.
Recursively implement requests in that way until a provide meta function is found for the same name.
The pre-processor could obtain the type of the variable from the provided variable. Alternately the provide meta function could give the type.
Example
[edit]In this example, someParameter is provided in the main function, but is needed in MyDetailedLowLevelCalculation.
Making someParameter a global variable would make it harder to later parameterize any of the functions to be called with different values.
Another alternative would be to create a class as a container for all my methods. someParameter could be a member variable which is populated in the constructor. But in a more complex example these methods may already be across different classes. I may end up parsing the value between classes, to get it where I want it.
The most direct and flexible approach is to parse the value as a parameter, but as well as being distracting for the reader, and adding additional parameters to each call, the developer has to do more work, all of which may need to be removed later if the requirements change.
In this example a role-preprocessor adds the extra parameters. The text is show after the pre-processor has been run with added text surrounded by comments as in,
/* Role processor add */ someParameter, logger /* end add */
In the code ... represents code omitted for the sake of brevity. The example code is,
#include ...
#define provide(x, t) // Role processing directive does not do anything here
#define request(x)
class ObjectToBeProcessed;
void MyProcessing();
void MyProcessObject();
double MyDetailedLowLevelCalculation(double v);
void main(int argc, char **argv)
{
Logger logger;
double someParameter = atof(argv[0]);
...;
provide(someParameter, double); // I have the value someParameter, if anyone wants it.
provide(logger, Logger &); // Also I am providing the logger.
MyProcessing( /* Role processor add */ someParameter, logger /* end add */);
}
void MyProcessing( /* Role processor add */ double someParameter, Logger &logger /* end add */)
{
...
{
ObjectToBeProcessed *o = ...;
MyProcessObject(/* Role processor add */ someParameter, logger, /* end add */ o);
}
...
}
void MyProcessObject( /* Role processor add*/ double someParameter, Logger &logger /* end add */)
{
...
double myValue = ...;
double value = MyDetailedLowLevelCalculation( /* Role processor add */ someParameter, /* end add */ myValue);
// Add logging here, for debugging purposes. I might take it out later.
// The logger would normally be a global variable, but again this reduces flexibility.
request(logger);
logger << "MyProcessObject: result = " << to_string(value);
...
}
double MyDetailedLowLevelCalculation( /* Role processor add */ double someParameter, /* end add */ double v)
{
// Oops, now I need some parameter.
request(someParameter); // Please construct all the parameter passing to get the value here.
// See the code added by the role processor.
return v * someParameter; // Some calculation
}
A role pre-processor would be hard to implement in a language without meta processing functionality, because the pre-processor would need to parse and understand at least some of the syntax of the language. Identifying the type of the variable would be challenging.
However a role pre-processor may be written as a few rules in a language supporting meta-programming.
Roles in lambda calculus
[edit]Named lambda expression is producer.
Request is
Axioms
Alpha rename applies to and the named .
The following example demonstrates a request. Here the request gets the value w from .
- - beta reduce
Notes.
- Reduction all ways reduce to a single value. Proof?
- However the same lamda term can represent different values.
Branching clocks
[edit]A branching clock allows 2 clocks to be obtained from a clock. The two clocks are called next and branch.
When the time is required in a function call, a clock is branched off while the main sequence of clocks is linked up to the next function.
Stability of the branching clock
[edit]In the mathematical model every call to a function with the same parameters must return the same value. So,
must always return the same result. To achieve this each path in the tree is identified by a bit string, with 0 for next and 1 for branch. When a time is given the time is recorded in a table against the path. When a time is requested again the time recorded against the path is given.
The time table can then be recorded to allow the re-running of the program with the same clock. This allows the reproduction of the exact functionality, which is useful for testing and debugging.
The time request
[edit]State and imperative programming
[edit]On the surface imperative programs are very different from functional programs. However monads provide a way of introducing imperative commands into a functional language, by hiding the state in the monad object.
But monads do not give a completely natural way of representing imperative programming. They do not provide a general model for translating an imperative program into a functional language.
Transformations on imperative programs
[edit]Mathematical expressions may be transformed using mathematical rules to give equivalent expressions. The same ability is needed for imperative programs. A mathematical model for an imperative program allows a change in the execution order of the program, without changing the overall effect of the program.
The goal is to integrate imperative statements into mathematics, so that an imperative program may be represented with minimal translation as mathematics.
A function definition in an imperative language is translated into a function definition in mathematics. A program is represented by a conjunction of function definitions.
The mathematics may then be transformed in various ways, using rules that guarantee no change in the meaning of the program.
State holder
[edit]Using the relational evaluation model, a simple model of state transfer may be given.[1] A statement in an imperative program has,
- Input state i
- Input parameters p
- return value x
- Output state o
It is natural in a functional to consider the input state as another parameter. But using the relational evaluation model this is not necessary. the relation model allows the output to be used in determining the input. For example in,
x is determined to be 3, when it is only input to the function.
A state holder is represented by a .
- s - state identifier.
- i - input.
- x - value.
- o - output.
The state identifier will be used to break up monolith state into into individual state variables. Monolithic state is represented by a single program state including all variables.
Because a state holder takes its input from its output, it does not behave in the same way as other mathematical objects. But with its own set of laws it may be included in a mathematical framework.
The advantage of using a state holder is that state may be transferred through any parameter, providing a full model of imperative programming. The state transfer is completely hidden, and there is a direct translation between imperative programs and state holder programs. State only needs to be transferred where needed.
In comparison monads provide only a linear transfer through primary parameters, but not through every parameter.
A disadvantage is that translating an imperative program to a state holder form is that the type system must be used to determine the rules to apply to state holders. Normal mathematical rules such as the definition of equality and the rule of substitution must be modified for state holders.
State identifiers
[edit]The usual approach to modelling imperative programs is to consider them to have a single state. State variables access this state to retrieve the value of the variable in that state. However this approach specifies a single execution order on the program. For example,
Reversal of the execution order does not effect the meaning of the program. This is obvious if x and y are modeled separately as two different state variables, but not obvious if there is a single monolithic state. The monolithic state is modified and accessed in both the statements, therefore the must be executed before .
However if x and y are two separate states then and are independent statements that each access there own state variable and the order of execution does not effect the result.
The way in which states interact is defined using the following rules,
State holder rules
[edit]The use of state holders depends on the use of type signatures to determine what rules to apply. Different rules apply to expressions of types that are state holders. The rules to apply depend on the signatures of the functions and operators. For simplicity two basic type classifications are made,
- Non state holder - n
- State holder - s
For example the type signature,
indicates that the equality operator takes two state holders as parameters, and returns a state holder.
Equality and function definition
[edit]Equality is defined by different rules depending on its type signature.
Signature | Rule |
---|---|
Equality defined by, or where | |
|
Normal mathematical definition of equality, which implies the substitution rule, |
In imperative languages there will be function definitions which are equivalent to mathematical equality, that returns a Boolean. Calls to the function are replaced with the body of the function with formal parameters replaced by actual parameters.
Substitution of parameters
[edit]Substitution of formal for actual parameters is allowed if the types match;
- state holder formal parameter with state holder actual parameter.
- non state holder formal parameter with non state holder actual parameter.
Substitution of lambda expressions follows the same rule. The type of the lambda variable must match the type of the expression being substituted.
In an imperative language, function parameters are state holders. Older languages like C++ do not allow a state holder to be directly used as a parameter. Only a named function pointer may be used for a state holder. More recent languages like Java and Javascript allow anonymous functions, which may be represented as state holders.
Assignment statement
[edit]The definition of the assignment statement requires the use of the use–mention distinction. In the following definitions is the use the mention .
A C style assignment operator may be defined using monolithic state as.
Using state identifiers, the assignment statement may be defined as,
Use-mention distinction
[edit]The use–mention distinction may be illustrated by the statement,
- Jane said "Your hair looks scruffy.".
In this statement I have mentioned what Jane said. I can turn the mention into a use by the statement.
- I agree with Jane.
Then this is the same as me saying,
- Your hair looks scruffy.
Any expression denotes (is a name for) a value. A statement as a usage and denotes one of the values true or false. A statement as a mention has as its value the statement itself, not yet evaluated.
Meta programming uses the to mark a statement as a mention. is a mention of the variable x. It is a piece of code that has not yet been evaluated. evaluates the meta code to give the use.
Note that a use cannot be turned into a mention, because a use already denotes a value.
For example in the statement,
The definition of this as a state holder is
where is a mention of the variable z. So,
Then letting
In the following definitions,
- is a meta variable.
- is the use of the variable denoted by the meta variable x.
A C style assignment operator may be defined using monolithic state as.
Using state identifiers, the assignment statement may be defined as,
Value lookup
[edit]Using a monolithic state, for a state s,
where is the expression y with free occurrences of x replaced by i[x]. Free and bound variables are concepts from lambda calculus. A lambda abstraction binds it's variable. Also a state holder binds it's state identifier.
For a state variable x,
where is the expression y with free occurrences of x replaced by i.
Where there is condition evaluation of parameters (if, and, or) an extra rule is required,
where s may be a state holder. Using Currying this rule can be shown to imply,
for example,
Statement composition
[edit]The semicolon ; is used in many languages to separate statements, which are executed sequentially. Here it is regarded as an operator.
State import rule,
Rules for the composition of polylithic states,
Function application
[edit]Function application of state holders are defined by the following rules,
Using the state identifier rules, the last rule implies,
State import rule,
State transfer rule,
Definition of ,
Lambda abstraction
[edit]Lambda abstraction of state holders is defined by,
Conditional evaluation and execution control
[edit]Conditional evaluation is a form of function application where some parameters are ignored if the value is not needed. Where the value being omitted is a state holder, the transfer of state through the parameter is omitted in some languages, giving control of the execution path.
In this case these functions are treated differently and the function application rules are not applied.
If statement
[edit]An if statement may be regarded as a function, with 3 parameters.
- Condition
- True case
- False case
The condition function may be state-full or stateless. Stateless is defined by,
where a and b may be state holders, or other types.
State import rule. This rule is implied by the state import rule defined in function application.
State transfer rule. This rule overrides general state transfer rule in function application.
Statement append rule.
And expression
[edit]In some imperative languages such as C the second condition is only executed if required to compute the result. This is defined by,
Or expression
[edit]Similarly for an or exression || in C the second condition is only executed if required to compute the result. This is defined by,
Loops
[edit]While loops and other control structures may be implemented as functions that take state holders as parameters. The while statement is defined by,
If b and c are not a state holders this statement either does nothing or expands recursively forever. If b or c are state holders, the value returned by c does not need to be the same each time, even though the expression is the same.
Local state variables
[edit]Return values
[edit]The natural structure is that the body of the function is an expression. The return value of the function is the value of the expression. The function call is then equal to the body of the function, with formal parameters replaced by actual parameters. The return value may be stateless, or it may be a state holder.
However, many imperative languages like C ignore the value returned from the expression and rely on the return statement to capture a value from a state. An extra structure called the return frame is needed to implement this structure.
Return frame
[edit]The return frame accepts the return value. The value may be stateless, or a state holder.
provided s does not contain any return statements.
Return statement
[edit]The return statement is defined by,
The return statement does not execute statements after the return, other than destroy.
if u is a single statement, other than a destroy statement.
Pointers arrays and references
[edit]Examples
[edit]Factorial function
[edit]The factorial function may be defined using a while loop by,
long fact(long n)
{
long r = 1;
while (0 < n)
{
r = r * n;
n = n - 1;
}
return r;
}
A direct translation of this imperative code to relational logic using state holders is,
Remove return frame
[edit]
- .......... See local state variables
- .......... See return frame
- .......... See return frame
- .......... See return frame
Remove assignment statements
[edit]
- .......... See assignment statement
- .......... See statement composition
Expand while loop
[edit]
- .......... See loops.
Simplify if statement
[edit]
- .......... See if statement
- .......... See if statement
- .......... See statement composition ????
- .......... See if statement
- .......... See statement composition
Destroy values
[edit]
- ......... See local state variables
Consider the sub expression,
The initial value for r is which is a multiplied by other factors to get the result. So if this value is divided by and the whole expression is multiplied by , the result will be the same,
Earlier at the end of removing assignment statements,
so the sub-expression is equal to,
and,
Comparison to other representations
[edit]The state holder may be applied to function application where the parameters are state holders. Other approaches such as monads implement state under composition of statements, but not application. This limits there ability to fully represent imperative programs, and the ability to integrate imperative programming into mathematics.
See also
[edit]References
[edit]- ^ a b Mccarthy, J. (1962). "Towards a Mathematical Science of Computation". In IFIP Congress. Cite error: The named reference "relstate" was defined multiple times with different content (see the help page).
- ^ Berghammer, Rudolf; Moller, Bernhard; Struth, Georg (7 April 2008). Relations and Kleene Algebra in Computer Science: 10th International Conference on Relational Methods in Computer Science, and 5th International Conference on Applications of Kleene Algebra, RelMiCS/AKA 2008, Frauenwörth, Germany. Springer. ISBN 978-3-642-04639-1.