Use new algorithm to run the batch processing¶

Create the calculator¶

All the data processing is done by the Calculator. The input data and output data are stored in its attributes.

from crystalmapping.utils import Calculator

calculator = Calculator()

Load the experiment Data¶

In this example, we load the data from a database. You can also use your data source as long as there is an DataArray of the exposure images.

from databroker import catalog

list(catalog)

['test_data_in_database',
 'analysis',
 'bt_safN_306132',
 'pdf',
 'saf_307381',
 'xpd']

db = catalog["xpd"]

UID = '257b5581-ca78-4309-9c50-b4d65d80152a'
run = db[UID]
run

BlueskyRun
  uid='257b5581-ca78-4309-9c50-b4d65d80152a'
  exit_status='success'
  2021-03-19 22:48:19.253 -- 2021-03-19 23:13:41.753
  Streams:
    * primary

data = run.primary.to_dask()
data

<xarray.Dataset>
Dimensions:              (dim_0: 1, dim_1: 3888, dim_2: 3072, time: 1001)
Coordinates:
  * time                 (time) float64 1.616e+09 1.616e+09 ... 1.616e+09
Dimensions without coordinates: dim_0, dim_1, dim_2
Data variables:
    dexela_stats1_total  (time) float64 dask.array<chunksize=(1,), meta=np.ndarray>
    dexela_image         (time, dim_0, dim_1, dim_2) float64 dask.array<chunksize=(1, 1, 3888, 3072), meta=np.ndarray>
    mPhi                 (time) float64 dask.array<chunksize=(1,), meta=np.ndarray>
    mPhi_user_setpoint   (time) float64 dask.array<chunksize=(1,), meta=np.ndarray>

xarray.Dataset

Dimensions:
- dim_0: 1
- dim_1: 3888
- dim_2: 3072
- time: 1001

Coordinates: (1)

time

(time)

float64

1.616e+09 1.616e+09 ... 1.616e+09

array([1.616209e+09, 1.616209e+09, 1.616209e+09, ..., 1.616210e+09,
       1.616210e+09, 1.616210e+09])

Data variables: (4)

dexela_stats1_total

(time)

float64

dask.array<chunksize=(1,), meta=np.ndarray>

object :: dexela

	Array	Chunk
Bytes	7.82 kiB	8 B
Shape	(1001,)	(1,)
Count	2002 Tasks	1001 Chunks
Type	float64	numpy.ndarray

dexela_image

(time, dim_0, dim_1, dim_2)

float64

dask.array<chunksize=(1, 1, 3888, 3072), meta=np.ndarray>

object :: dexela

	Array	Chunk
Bytes	89.08 GiB	91.12 MiB
Shape	(1001, 1, 3888, 3072)	(1, 1, 3888, 3072)
Count	3003 Tasks	1001 Chunks
Type	float64	numpy.ndarray

mPhi

(time)

float64

dask.array<chunksize=(1,), meta=np.ndarray>

object :: mPhi

	Array	Chunk
Bytes	7.82 kiB	8 B
Shape	(1001,)	(1,)
Count	2002 Tasks	1001 Chunks
Type	float64	numpy.ndarray

mPhi_user_setpoint

(time)

float64

dask.array<chunksize=(1,), meta=np.ndarray>

object :: mPhi

	Array	Chunk
Bytes	7.82 kiB	8 B
Shape	(1001,)	(1,)
Count	2002 Tasks	1001 Chunks
Type	float64	numpy.ndarray

Attributes: (0)

Here, we give the data to the attribute.

calculator.frames_arr = data["dexela_image"][::10]
calculator.frames_arr

<xarray.DataArray 'dexela_image' (time: 101, dim_0: 1, dim_1: 3888, dim_2: 3072)>
dask.array<getitem, shape=(101, 1, 3888, 3072), dtype=float64, chunksize=(1, 1, 3888, 3072), chunktype=numpy.ndarray>
Coordinates:
  * time     (time) float64 1.616e+09 1.616e+09 ... 1.616e+09 1.616e+09
Dimensions without coordinates: dim_0, dim_1, dim_2
Attributes:
    object:   dexela

xarray.DataArray

'dexela_image'

time: 101
dim_0: 1
dim_1: 3888
dim_2: 3072

dask.array<chunksize=(1, 1, 3888, 3072), meta=np.ndarray>

	Array	Chunk
Bytes	8.99 GiB	91.12 MiB
Shape	(101, 1, 3888, 3072)	(1, 1, 3888, 3072)
Count	3104 Tasks	101 Chunks
Type	float64	numpy.ndarray

Coordinates: (1)

time

(time)

float64

1.616e+09 1.616e+09 ... 1.616e+09

array([1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09, 1.616209e+09,
       1.616209e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09, 1.616210e+09,
       1.616210e+09])

Attributes: (1)
object :
dexela

We also need the metadata of the grid scan, especially the shape of the grid. If not provided, the calculation can still be done but the coordinates of the grain map is unknown.

# show the metadata
metadata = dict(run.metadata["start"])
# Because I terminate the data. I nedd to update the metadata.
metadata["shape"] = [101]
metadata["extents"] = ([-0.5, 0.499],)
calculator.metadata = metadata
calculator.metadata

{'time': 1616208499.2537348,
 'uid': '257b5581-ca78-4309-9c50-b4d65d80152a',
 'versions': {'ophyd': '1.3.3', 'bluesky': '1.6.7'},
 'scan_id': 45,
 'proposal_id': '307690',
 'plan_type': 'generator',
 'plan_name': 'rel_grid_scan',
 'detectors': ['dexela'],
 'motors': ['mPhi'],
 'num_points': 1001,
 'num_intervals': 1000,
 'plan_args': {'detectors': ["XPDDDexelaDetector(prefix='XF:28IDD-ES:2{Det:DEX}', name='dexela', read_attrs=['stats1', 'stats1.total', 'tiff'], configuration_attrs=['cam', 'cam.acquire_period', 'cam.acquire_time', 'cam.image_mode', 'cam.trigger_mode', 'stats1', 'stats1.configuration_names', 'stats1.port_name', 'stats1.asyn_pipeline_config', 'stats1.blocking_callbacks', 'stats1.enable', 'stats1.nd_array_port', 'stats1.plugin_type', 'stats1.bgd_width', 'stats1.centroid_threshold', 'stats1.compute_centroid', 'stats1.compute_histogram', 'stats1.compute_profiles', 'stats1.compute_statistics', 'stats1.hist_max', 'stats1.hist_min', 'stats1.hist_size', 'stats1.profile_cursor', 'stats1.profile_size', 'stats1.ts_num_points', 'tiff', 'detector_type'])"],
  'args': ["EpicsMotor(prefix='XF:28IDD-ES:2{Stg:Stack-Ax:Phi}Mtr', name='mPhi', settle_time=0.0, timeout=None, read_attrs=['user_readback', 'user_setpoint'], configuration_attrs=['user_offset', 'user_offset_dir', 'velocity', 'acceleration', 'motor_egu'])",
   -0.5,
   0.5,
   1001],
  'per_step': 'None'},
 'hints': {'gridding': 'rectilinear', 'dimensions': [[['mPhi'], 'primary']]},
 'shape': [101],
 'extents': ([-0.5, 0.499],),
 'snaking': [False],
 'plan_pattern': 'outer_product',
 'plan_pattern_args': {'args': ["EpicsMotor(prefix='XF:28IDD-ES:2{Stg:Stack-Ax:Phi}Mtr', name='mPhi', settle_time=0.0, timeout=None, read_attrs=['user_readback', 'user_setpoint'], configuration_attrs=['user_offset', 'user_offset_dir', 'velocity', 'acceleration', 'motor_egu'])",
   -0.5,
   0.5,
   1001]},
 'plan_pattern_module': 'bluesky.plan_patterns',
 'task': 'a single point rocking curve',
 'sample': 'PARADIM-2',
 'beam': 'slit'}

We can also apply the geometry of the experiment to let the calculator calculate the Q value of the peaks. This is optional.

from  pyFAI.azimuthalIntegrator import AzimuthalIntegrator

calculator.ai = AzimuthalIntegrator(dist=200, wavelength=0.186, detector="dexela2923", poni1=1536, poni2=1944)

Process the data¶

The simplest way to use the calculator is to use the auto_process. It takes three necessary parameters. You will find the meaning of them in the docs.

help(calculator.auto_process)

Help on method auto_process in module crystalmapping.utils:

auto_process(num_wins: int, hw_wins: int, diameter: int, index_filter: slice = None, *args, **kwargs) -> None method of crystalmapping.utils.Calculator instance
    Automatically process the data in the standard protocol.

    Parameters
    ----------
    num_wins : int
        The number of windows.
    hw_wins : int
        The half width of the windows in pixels.
    diameter : int
        The diameter of the kernel to use in peak finding in pixels. It must be an odd integer.
    index_filter : slice
        The index slice of the data to use in the calculation of the dark and light image.
    args :
        The position arguments of the peak finding function trackpy.locate.
    kwargs :
        The keyword arguments of the peak finding function trackpy.locate.

    Returns
    -------
    None. The calculation results are saved in attributes.

Here we process the data. The new algorithm is a two-run-through algorithm so there are two status bars. First one show the status of the calculation of light and dark image and the second one shows the status of the calculation of the crystal maps.

calculator.auto_process(num_wins=4, hw_wins=25, diameter=41)

100%|██████████| 100/100 [00:37<00:00,  2.65it/s]
100%|██████████| 101/101 [00:28<00:00,  3.53it/s]

Visualize the data¶

All the final, intermediate and raw data can be visualized. The methods to visualize them starts with “show”. Here, we show two examples.

Here, we show the windows on the dark subtracted light image.

calculator.show_windows(vmax=500, size=8);

_images/01_analysis_example_code_20_0.png

Then, we show the final rocking curves plot, where are the one dimensional crystal maps.

calculator.show_intensity();

_images/01_analysis_example_code_22_0.png

Load the dataset and visualize it¶

The data can be loaded and visualized again after the data processing session is over.

import xarray as xr

ds = xr.load_dataset("data/example.nc")

calculator.auto_visualize(ds);

Use new algorithm to run the batch processing¶

Create the calculator¶

Load the experiment Data¶

Process the data¶

Visualize the data¶

Save the data¶

Load the dataset and visualize it¶