Natural Modeling Process
This page is a work in progress.
“All men by nature desire to know.”
Studies on Mathematical Modeling of Modeling of Modeling
There may lie ahead a possibility to abstract and generalize the essence of transformer architectures and diffusion models to fundamental mathematical structures, where they would be amenable to behaviors aking to analytical mechanics, but for learning, knowledge, and understanding – and also for design, automation, and control.
Minimally understanding data
leads to an objective function of sorts as an action principle to be extremalized
Dagger
Studying the gradients of the simplest version of the above (
reveals some algebraically (and thus also statistically) stable attractors present in this minimal adaptive system.
The stationary points (
where
Note that the stationary points (with matrix inverses) do no need to be calculated explicitly – the system can be running, learning, and adapting all the time, as the relevant stationary point is solved implicitly by the distributed dynamics. Compare the formulas also to the Lagrange dual method.
The above covariance matrix structures resembling quadratically optimal principal component regression (see also control theory, optimal control, linear system, and LTI system) result intrinsically using
Using time constant
It is also interesting to consider and study gradients of exponentials, logarithms, traces, and determinants, as for complex valued square matrices, they are related,
where
Even more ambitiously, studying the exponentials and their derivatives in relation to Lie groups and Lie algebras could lead to succinct formulations for
I have found the above structures quite interesting, being minimal examples of system-oriented, meaning-forming/semiotic, functional, and distributed dynamical processes that learn to survive (i.e. keep their sanity in their ecological niches) and even flourish by their very nature, and have experimented with some simulated visualizations of the organic or “lifelike” behaviors present – similar to these recordings of mine from about 15 years ago, where the columns of the model matrix
These kind of structures have been studied a decade or two ago by prof. Hyötyniemi (link in Finnish), who in recent years has been convinced that the grand time should be analyzed in the frequency domain to make progress in understanding various systems relevant to “all men” that “by nature” “desire to know” – that may turn out to have deep cybernetic roots and reasons for their inquiries and aspirations in the first place.
Motivational thoughts on modeling (62 pages, early 2023) is also available.