Data Abstraction#

The core data primitive in PyARPES is the xarray.DataArray. However, adding additional scientific functionality is needed since xarray provides only very general functionality. The approach that we take is described in some detail in the xarray documentation at extending xarray, which allows putting additional functionality on all arrays and datasets on particular, registered attributes.

In PyARPES we use a few of these:

.S attribute: functionality associated with spectra (physics here)
.G attribute: general abstract functionality that could reasonably be a part of xarray core
.F attribute: functionality associated with curve fitting

Caveat: In general these accessors can and do behave slightly differently between datasets and arrays, depending on what makes contextual sense.

This section will describe just some of the functionality provided by the .S attribute, while the following section will describe some of the functionality on .G and the section on curve fitting describes much of what is available through .F.

Much more can be learned about them by viewing the definitions in arpes.xarray_extensions.

Data selection#

`select_around` and `select_around_data`#

As an alternative to interpolating, you can integrate in a small rectangular or ellipsoidal region around a point using .S.select_around. You can also do this for a sequence of points using .S.select_around_data.

These functions can be run in either summing or averaging mode using either mode='sum' or mode='mean' respectively. Using the radius parameter you can specify the integration radius in pixels (int) or in unitful (float) values for all (pass a single value) or for specific (dict) axes.

select_around_data operates in the same way, except that instead of passing a single point, select_around_data expects a dictionary or Dataset mapping axis names to iterable collections of coordinates.

As a concrete example, let’s consider the example_data.temperature_dependence dataset with axes (eV, phi, T) consisting of cuts at different temperatures. Suppose we wish to obtain EDCs at the Fermi momentum for each value of the temperature.

First we will load the data, and combine datasets to get a full temperature dependence.

[1]:

import arpes
from arpes.io import example_data
from matplotlib import pyplot as plt

temp_dep = example_data.temperature_dependence
near_ef = temp_dep.sel(eV=slice(-0.05, 0.05), phi=slice(-0.2, None)).sum("eV").spectrum
near_ef.S.plot()

Activating auto-logging. Current session state plus future input saved.
Filename       : logs/unnamed_2026-03-24_23-28-50.log
Mode           : backup
Output logging : False
Raw input log  : False
Timestamping   : False
State          : active

../_images/notebooks_custom-dot-s-functionality_2_14.png

Exercises#

Change the phi range of the selection to see how the fit responds. Can we deal with the asymmetric background this way?
Inspect the first fit with phis.results[0].item(). What can you tell about the fit?
Select a region of the temperature dependent data away from the band. Perform a broadcast fit for the Fermi edge using arpes.fits.fit_models.AffineBroadenedFD. Does the edge position shift at all? Does the edge width change at all? Look at the previous exercise to determine which parameters to look at.

Data Abstraction#

Data selection#

`select_around` and `select_around_data`#

Finding \(\phi_F\)/\(k_F\)#

Argmax#

Curve fitting#

Exercises#

This Page

Data Abstraction#

Data selection#

select_around and select_around_data#

Finding \(\phi_F\)/\(k_F\)#

Argmax#

Curve fitting#

Exercises#

This Page

`select_around` and `select_around_data`#