Using diffpy.morph in Python
On top of the command-line (CLI) usage described in the quickstart tutorial,
diffpy.morph
also supports Python integration.
All functionality supported on the CLI is also available for Python.
This page is intended for those acquainted with the basic morphs
described in the aforementioned quickstart tutorial who want to use diffpy.morph
in their
Python scripts.
For those looking to use the Python-specific morph MorphFuncxy
(described below)
with commonly used diffraction software like PDFgetx3
and PyFai are directed to the
funcxy tutorials.
Python Morphing Functions
In the quickstart tutorial, you were asked to try a combined scale, stretch, and smear morph on the files darkSub_rh20_C_01.gr and darkSub_rh20_C_44.gr using the command-line command
diffpy.morph --scale=0.8 --smear=-0.08 --stretch=0.005 --rmin=1.5 --rmax=30 darkSub_rh20_C_01.gr darkSub_rh20_C_44.grTo do the same on Python, we must first create a new Python script in the same directory as the data files darkSub_rh20_C_01.gr and darkSub_rh20_C_44.gr.
Then, in that script, import
from diffpy.morph.morphpy import morph
Finally, we run the
morph
functionmorph_info, morph_table = morph("darkSub_rh20_C_01.gr", "darkSub_rh20_C_44.gr", scale=0.8, smear=-0.08, stretch=0.005, rmin=1.5, rmax=30)
The
morph
function takes in two file names (or paths). You can also provide various parameters for morphing (see the Full Parameter List below).If, let’s say, the file darkSub_rh20_C_01.gr is in a subdirectory subdir/darkSub_rh20_C_01.gr, you should replace
"darkSub_rh20_C_01.gr"
in the above example with"subdir/darkSub_rh20_C_01.gr"
.The
morph
function returns a dictionarymorph_info
and a numpy arraymorph_table
.
morph_info
contains all morphs as keys (e.g."scale"
,"stretch"
,"smear"
) with the optimized morphing parameters found bydiffpy.morph
as values.morph_info
also contains the Rw and Pearson correlation coefficients found post-morphing. Try printingprint(morph_info)
and compare the values stored in this dictionary to those given by the CLI output!
morph_table
is a two-column array of the morphed function interpolated onto the grid of the target function (e.g. in our example, it returns the contents of darkSub_rh20_C_01.gr after the morphs are applied interpolated onto the grid of darkSub_rh20_C_44.gr).Notice that most parameters you are able to use are the same as the options provided in the command-line interface version of
diffpy.morph
. For example, the--apply
option becomes theapply=True
parameter.With that, you have already mastered the basics of using
diffpy.morph
on Python!Note that instead of passing two files to
diffpy.morph
, you might instead want to directly pass arrays. For example, rather than passing darkSub_rh20_C_01.gr, I may want to pass a two-column array namedds_rh20_c_01_array
containing the data table contents of the file darkSub_rh20_C_01.gr. In this case, we have a separate functionfrom diffpy.morph.morphpy import morph_arraysAssuming we have loaded the data in darkSub_rh20_C_01.gr into
ds_rh20_c_01_array
and darkSub_rh20_C_44.gr intods_rh20_c_44_array
, we can apply the same morph as step 3 by runningmorph_info, morph_table = morph_arrays(ds_rh20_c_01_array, ds_rh20_c_44_array, scale=0.8, smear=-0.08, stretch=0.5, rmin=1.5, rmax=30)Notice that the two-column format of the input to
morph_arrays
is the same as the output ofmorph
andmorph_arrays
. It is VERY IMPORTANT that the data is in two-column format rather than the traditional two-row format. This is to reflect the file formats conventionally used to store PDFs. Again, try printingprint(morph_info)
and compare!For a full list of parameters used by (both)
morph
andmorph_arrays
, see the Full Parameter List section below.
Full Parameter List
General Parameters
- save: str or path
Save the morphed function to a the file passed to save. Use ‘-’ for stdout.
- get_diff: bool
Return the difference function (morphed function minus target function) instead of the morphed function (default). When save is enabled, the difference function is saved instead of the morphed function.
- verbose: bool
Print additional header details to saved files. These include details about the morph inputs and outputs.
- rmin: float
Minimum r-value (abscissa) to use for function comparisons.
- rmax: float
Maximum r-value (abscissa) to use for function comparisons.
- tolerance: float
Specify least squares refiner tolerance when optimizing for morph parameters. Default: 10e-8.
- pearson: bool
The refiner instead maximizes agreement in the Pearson function (default behavior is to minimize the residual). Note that this is insensitive to scale.
- addpearson: bool
Maximize agreement in the Pearson function as well as minimizing the residual.
Manipulations
These parameters select the manipulations that are to be applied to the function. The passed values will be refined unless specifically excluded with the apply or exclude parameters.
- apply: bool
Apply morphs but do not refine.
- exclude: list of str
Exclude a manipulations from refinement by name (e.g. exclude=[“scale”, “stretch”] excludes the scale and stretch morphs).
- scale: float
Apply scale factor.
This multiplies the function ordinate by scale.
- stretch: float
Stretch function grid by a fraction stretch.
This multiplies the function grid by 1+stretch.
- squeeze: list of float
Squeeze function grid given a polynomial p(x) = squeeze[0]+squeeze[1]*x+…+squeeze[n]*x^n.
n is dependent on the number of values in the user-inputted comma-separated list. The morph transforms the function grid from x to x+p(x). When this parameter is given, hshift is disabled. When n>1, stretch is disabled.
- smear: float
Smear the peaks with a Gaussian of width smear.
This is done by convolving the function with a Gaussian with standard deviation smear. If both smear and smear_pdf are used, only smear_pdf will be applied.
- smear_pdf: float
Convert PDF to RDF. Then, smear peaks with a Gaussian of width smear_pdf. Convert back to PDF. If both smear and smear_pdf are used, only smear_pdf will be applied.
- slope: float
Slope of the baseline used in converting from PDF to RDF.
This is used with the option smear_pdf. The slope will be estimated if not provided.
- hshift: float
Shift the function horizontally by hshift to the right.
- vshift: float
Shift the function vertically by vshift upward.
- qdamp: float
Dampen PDF by a factor qdamp.
- radius: float
Apply characteristic function of sphere with radius given by parameter radius.
If pradius is also specified, instead apply characteristic function of spheroid with equatorial radius radius and polar radius pradius.
- pradius: float
Apply characteristic function of spheroid with equatorial radius given by above parameter radius and polar radius pradius.
If only pradius is specified, instead apply characteristic function of sphere with radius pradius.
- iradius: float
Apply inverse characteristic function of sphere with radius iradius.
If ipradius is also specified, instead apply inverse characteristic function of spheroid with equatorial radius iradius and polar radius ipradius.
- ipradius: float
Apply inverse characteristic function of spheroid with equatorial radius iradius and polar radius ipradius.
If only ipradius is specified, instead apply inverse characteristic function of sphere with radius ipradius.
- funcy: tuple (function, dict)
Apply a function to the y-axis of the (two-column) data.
This morph applies the function funcy[0] with parameters given in funcy[1]. The function funcy[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The y-axis values of the data are then replaced by the return value of funcy[0].
For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def linear(x, y, a, b, c): return a * x + b * y + c
This example function above takes in both the abscissa and ordinate on top of three additional parameters a, b, and c. To use the funcy parameter with parameter values a=1.0, b=2.0, and c=3.0, we would pass
funcy=(linear, {"a": 1.0, "b": 2.0, "c": 3.0})
. For an explicit example, see the Python-Specific Morphs section below.- funcx: tuple (function, dict)
Apply a function to the x-axis of the (two-column) data.
This morph works fundamentally differently from the other grid morphs (e.g. stretch and squeeze) as it directly modifies the grid of the morph function. The other morphs maintain the original grid and apply the morphs by interpolating the function ***.
This morph applies the function funcx[0] with parameters given in funcx[1]. The function funcx[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The x-axis values of the data are then replaced by the return value of funcx[0]. Note that diffpy.morph requires the x-axis be monotonic increasing (i.e. for i < j, x[i] < x[j]): as such, if funcx[0] is not a monotonic increasing function of the provided x-axis data, the error
x must be a strictly increasing sequence
will be thrown.For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def exponential(x, y, amp, decay): return abs(amp) * (1 - 2**(-decay * x))
This example function above takes in both the abscissa and ordinate on top of three additional parameters amp and decay. (Even though the ordinate is not used in the function, it is still required that the function take in both acscissa and ordinate.) To use the funcx parameter with parameter values amp=1.0 and decay=2.0, we would pass
funcx=(exponential, {"amp": 1.0, "decay:: 2.0})
. For an explicit example, see the Python-Specific Morphs section below.- funcxy: tuple (function, dict)
Apply a function the (two-column) data.
This morph applies the function funcxy[0] with parameters given in funcxy[1]. The function funcxy[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The two columns of the data are then replaced by the two return values of funcxy[0].
For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def shift(x, y, hshift, vshift): return x + hshift, y + vshift
This example function above takes in both the abscissa and ordinate on top of two additional parameters hshift and vshift. To use the funcy parameter with parameter values hshift=1.0 and vshift=2.0, we would pass
funcy=(shift, {"hshift": 1.0, "vshift": 1.0})
. For an example use-case, see the Python-Specific Morphs section below.
Python-Specific Morphs
Some morphs in diffpy.morph
are supported only in Python. Here, we detail
how they are used and how to call them.
MorphFunc: Applying custom functions
In these tutorial, we walk through how to use the MorphFunc
morphs
(MorphFuncy
, MorphFuncx
, MorphFuncxy
)
with some example transformations.
Unlike other morphs that can be run from the command line,
MorphFunc
moprhs require a Python function and is therefore
intended to be used through Python scripting.
MorphFuncy:
The MorphFuncy
morph allows users to apply a custom Python function
to the y-axis values of a dataset, enabling flexible and user-defined
transformations.
Let’s try out this morph!
Import the necessary modules into your Python script:
from diffpy.morph.morphpy import morph_arrays import numpy as npDefine a custom Python function to apply a transformation to the data. The function must take
x
andy
(1D arrays of the same length) along with named parameters, and return a transformedy
array of the same length. For this example, we will use a simple linear transformation that scales the input and applies an offset:def linear_function(x, y, scale, offset): return (scale * x) * y + offsetIn this example, we use a sine function for the morph data and generate the target data by applying the linear transformation with known scale and offset to it:
x_morph = np.linspace(0, 10, 101) y_morph = np.sin(x_morph) x_target = x_morph.copy() y_target = np.sin(x_target) * 20 * x_target + 0.8Setup and run the morph using the
morph_arrays(...)
.morph_arrays
expects the morph and target data as 2D arrays in two-column format[[x0, y0], [x1, y1], ...]
. This will apply the user-defined function and refine the parameters to best align the morph data with the target data. This includes both the transformation parameters (our initial guess) and the transformation function itself:morph_params, morph_table = morph_arrays(np.array([x_morph, y_morph]).T, np.array([x_target, y_target]).T, funcy=(linear_function,{'scale': 1.2, 'offset': 0.1}))Extract the fitted parameters from the result:
fitted_params = morph_params["funcy"] print(f"Fitted scale: {fitted_params['scale']}") print(f"Fitted offset: {fitted_params['offset']}")
As you can see, the fitted scale and offset values match the ones used
to generate the target (scale=20 & offset=0.8). This example shows how
MorphFuncy
can be used to fit and apply custom transformations. Now
it’s your turn to experiment with other custom functions that may be useful
for analyzing your data.
MorphFuncx:
The MorphFuncx
morph allows users to apply a custom Python function
to the x-axis values of a dataset, similar to the MorphFuncy
morph.
One caveat to this morph is that the x-axis values must remain monotonic
increasing, so it is possible to run into errors when applying this morph.
For example, if your initial grid is [-1, 0, 1]
, and your function is
lambda x, y: x**2
, the grid after the function is applied will be
[1, 0, 1]
, which is no longer monotonic increasing.
In this case, the error x must be a strictly increasing sequence
will be thrown.
Let’s try out this morph!
Import the necessary modules into your Python script:
from diffpy.morph.morphpy import morph_arrays import numpy as npDefine a custom Python function to apply a transformation to the data. The function must take
x
andy
(1D arrays of the same length) along with named parameters, and return a transformedx
array of the same length. Recall that this function must maintain the monotonic increasing nature of thex
array.For this example, we will use a simple exponential function transformation that greatly modifies the input:
def exp_function(x, y, scale, rate): return np.abs(scale) * np.exp(np.abs(rate) * x)Notice that, though the function only uses the
x
input, the function signature takes in bothx
andy
.Like in the previous example, we will use a sine function for the morph data and generate the target data by applying the decay transfomration with a known scale and rate:
x_morph = np.linspace(0, 10, 1001) y_morph = np.sin(x_morph) x_target = x_target = 20 * np.exp(0.8 * x_morph) y_target = y_morph.copy()Setup and run the morph using the
morph_arrays(...)
.morph_arrays
expects the morph and target data as 2D arrays in two-column format[[x0, y0], [x1, y1], ...]
. This will apply the user-defined function and refine the parameters to best align the morph data with the target data. This includes both the transformation parameters (our initial guess) and the transformation function itself:morph_params, morph_table = morph_arrays(np.array([x_morph, y_morph]).T, np.array([x_target, y_target]).T, funcx=(decay_function, {'scale': 1.2, 'rate': 1.0}))Extract the fitted parameters from the result:
fitted_params = morph_params["funcx"] print(f"Fitted scale: {fitted_params['scale']}") print(f"Fitted rate: {fitted_params['rate']}")
Again, we should see that the fitted scale and offset values match the ones used to generate the target (scale=20 & rate=0.8).
For fun, you can plot the original function to the morphed function to see how much the
MorphFuncxy:
The MorphFuncxy
morph allows users to apply a custom Python function
to a dataset that modifies both the x
and y
column values.
This is equivalent to applying a MorphFuncx
and MorphFuncy
simultaneously.
This morph is useful when you want to apply operations that modify both
the grid and function value.
Examples of using MorphFuncxy
with PyFai
azimuthal integration
and PDFgetx3
PDF calculation are included here.
For this tutorial, we will go through two examples. One simple one
involving shifting a function in the x
and y
directions, and
another involving a Fourier transform.
Let’s start by taking a simple
sine
function.import numpy as np morph_x = np.linspace(0, 10, 101) morph_y = np.sin(morph_x) morph_table = np.array([morph_x, morph_y]).TThen, let our target function be that same
sine
function shifted to the right by0.3
and up by0.7
.target_x = morph_x + 0.3 target_y = morph_y + 0.7 target_table = np.array([target_x, target_y]).TWhile we could use the
hshift
andvshift
morphs, this would require us to refine over two separate morph operations. We can instead perform these morphs simultaneously by defining a function:def shift(x, y, hshift, vshift): return x + hshift, y + vshiftNow, let’s try finding the optimal shift parameters using the
MorphFuncxy
morph. We can try an initial guess ofhshift=0.0
andvshift=0.0
.from diffpy.morph.morphpy import morph_arrays initial_guesses = {"hshift": 0.0, "vshift": 0.0} info, table = morph_arrays(morph_table, target_table, funcxy=(shift, initial_guesses))Finally, to see the refined
hshift
andvshift
parameters, we extract them frominfo
.print(f"Refined hshift: {info["funcxy"]["hshift"]}") print(f"Refined vshift: {info["funcxy"]["vshift"]}")
Now for an example involving a Fourier transform.
Let’s say you measured a signal of the form \(f(x)=\exp\{\cos(\pi x)\}\). Unfortunately, your measurement was taken against a noisy sinusoidal background of the form \(n(x)=A\sin(Bx)\), where
A
,B
are unknown. For our example, let’s say (unknown to us) thatA=2
andB=1.7
.import numpy as np n = 201 dx = 0.01 measured_x = np.linspace(0, 2, n) def signal(x): return np.exp(np.cos(np.pi * x)) def noise(x, A, B): return A * np.sin(B * x) measured_f = signal(measured_x) + noise(measured_x, 2, 1.7) morph_table = np.array([measured_x, measured_f]).TYour colleague remembers they previously computed the Fourier transform of the function and has sent that to you.
# We only consider the region where the grid is positive for simplicity target_x = np.fft.fftfreq(n, dx)[:n//2] target_f = np.real(np.fft.fft(signal(measured_x))[:n//2]) target_table = np.array([target_x, target_f]).TWe can now write a noise subtraction function that takes in our measured signal and guesses for parameters
A
,B
, and computes the Fourier transform post-noise-subtraction.def noise_subtracted_ft(x, y, A, B): n = 201 dx = 0.01 background_subtracted_y = y - noise(x, A, B) ft_x = np.fft.fftfreq(n, dx)[:n//2] ft_f = np.real(np.fft.fft(background_subtracted_y)[:n//2]) return ft_x, ft_fFinally, we can provide initial guesses of
A=0
andB=1
to theMorphFuncxy
morph and see what refined values we get.from diffpy.morph.morphpy import morph_arrays initial_guesses = {"A": 0, "B": 1} info, table = morph_arrays(morph_table, target_table, funcxy=(background_subtracted_ft, initial_guesses))Print these values to see if they match with the true values of of
A=2.0
andB=1.7
!print(f"Refined A: {info["funcxy"]["A"]}") print(f"Refined B: {info["funcxy"]["B"]}")
You can also use this morph to help find optimal parameters
(e.g. rpoly
, qmin
, qmax
, bgscale
) for computing
PDFs of materials with known structures.
One does this by setting the MorphFuncxy
function to a PDF
computing function such as
PDFgetx3.
The input (morphed) 1D function should be the 1D diffraction data
one wishes to compute the PDF of and the target 1D function
can be the PDF of a target material with similar geometry.
More information about this will be released in the diffpy.morph
manuscript, and we plan to integrate this feature automatically into
PDFgetx3
soon.