Design philosophy#

Interactive versus scripted analysis#

AVNI has some great interactive plotting abilities that can help you explore your data, and there are a few GUI-like interactive plotting commands (like browsing through the raw data and clicking to mark bad channels, or click-and-dragging to annotate bad temporal spans). But in general it is not possible to use AVNI to mouse-click your way to a finished, publishable analysis. AVNI works best when you assemble your analysis pipeline into one or more Python scripts. On the plus side, your scripts act as a record of everything you did in your analysis, making it easy to tweak your analysis later and/or share it with others (including your future self).

Integration with the scientific python stack#

AVNI also integrates well with other standard scientific Python libraries. For example, AVNI objects underlyingly store their data in NumPy arrays, making it easy to apply custom algorithms or pass your data into one of scikit-learn’s machine learning pipelines. AVNI’s 2-D plotting functions also return matplotlib Figure objects, so you can customize your AVNI plots using any of matplotlib or AVNI’s plotting commands. The intent is that AVNI will get most geoscientist 90% of the way to their desired analysis goal, and other packages can get them over the finish line.

Submodule-based organization#

A useful-to-know organizing principle is that AVNI objects and functions are separated into submodules. This can help you discover related functions if you’re using an editor that supports tab-completion. For example, you can type avni.tools.<TAB> to see all the functions in the tools submodule; similarly for model functions (avni.models), functions for reading and writing data (avni.io), mapping (avni.mapping), etc. This also helps save keystrokes — instead of:

import avni
avni.tools.bases.eval_vbspl(...)
avni.tools.bases.eval_splrem(...)

you can import submodules directly, and use just the submodule name to access its functions:

from avni.tools import bases
bases.eval_vbspl(...)
bases.eval_splrem(...)

(Mostly) unified API#

Whenever possible, we’ve tried to provide a unified API for the different data classes. For example, the Reference1D and Model3D classes all have a plot() method that can typically be called with no parameters specified and still yield an informative plot of the data. Similarly, they all have the methods like copy() with similar or identical method signatures.

In-place operation#

Because gescience datasets can be quite large, AVNI tries very hard to avoid making unnecessary copies of your data behind-the-scenes. To further improve memory efficiency, many object methods operate in-place (and silently return their object to allow method chaining). In-place operation may lead you to frequent use of the copy() method during interactive, exploratory analysis — so you can try out different preprocessing approaches or parameter settings without having to re-load the data each time — but it can also be a big memory-saver when applying a finished script to different models or datasets.