Using diffpy.morph in Python
On top of the command-line (CLI) usage described in the quickstart tutorial,
diffpy.morph also supports Python integration.
All functionality supported on the CLI is also available for Python.
This page is intended for those acquainted with the basic morphs
described in the aforementioned quickstart tutorial who want to use diffpy.morph in their
Python scripts.
For those looking to use the Python-specific morph MorphFuncxy (described below)
with commonly used diffraction software like PDFgetx3
and PyFai are directed to the
funcxy tutorials.
Python Morphing Functions
In the quickstart tutorial, you were asked to try a combined scale, stretch, and smear morph on the files darkSub_rh20_C_01.gr and darkSub_rh20_C_44.gr using the command-line command
diffpy.morph --scale=0.8 --smear=-0.08 --stretch=0.005 --rmin=1.5 --rmax=30 darkSub_rh20_C_01.gr darkSub_rh20_C_44.grTo do the same on Python, we must first create a new Python script in the same directory as the data files darkSub_rh20_C_01.gr and darkSub_rh20_C_44.gr.
Then, in that script, import
from diffpy.morph.morphpy import morph
Finally, we run the
morphfunctionmorph_info, morph_table = morph("darkSub_rh20_C_01.gr", "darkSub_rh20_C_44.gr", scale=0.8, smear=-0.08, stretch=0.005, rmin=1.5, rmax=30)
The
morphfunction takes in two file names (or paths). You can also provide various parameters for morphing (see the Full Parameter List below).If, let’s say, the file darkSub_rh20_C_01.gr is in a subdirectory subdir/darkSub_rh20_C_01.gr, you should replace
"darkSub_rh20_C_01.gr"in the above example with"subdir/darkSub_rh20_C_01.gr".The
morphfunction returns a dictionarymorph_infoand a numpy arraymorph_table.
morph_infocontains all morphs as keys (e.g."scale","stretch","smear") with the optimized morphing parameters found bydiffpy.morphas values.morph_infoalso contains the Rw and Pearson correlation coefficients found post-morphing. Try printingprint(morph_info)and compare the values stored in this dictionary to those given by the CLI output!
morph_tableis a two-column array of the morphed function interpolated onto the grid of the target function (e.g. in our example, it returns the contents of darkSub_rh20_C_01.gr after the morphs are applied interpolated onto the grid of darkSub_rh20_C_44.gr).Notice that most parameters you are able to use are the same as the options provided in the command-line interface version of
diffpy.morph. For example, the--applyoption becomes theapply=Trueparameter.With that, you have already mastered the basics of using
diffpy.morphon Python!Note that instead of passing two files to
diffpy.morph, you might instead want to directly pass arrays. For example, rather than passing darkSub_rh20_C_01.gr, I may want to pass a two-column array namedds_rh20_c_01_arraycontaining the data table contents of the file darkSub_rh20_C_01.gr. In this case, we have a separate functionfrom diffpy.morph.morphpy import morph_arraysAssuming we have loaded the data in darkSub_rh20_C_01.gr into
ds_rh20_c_01_arrayand darkSub_rh20_C_44.gr intods_rh20_c_44_array, we can apply the same morph as step 3 by runningmorph_info, morph_table = morph_arrays(ds_rh20_c_01_array, ds_rh20_c_44_array, scale=0.8, smear=-0.08, stretch=0.5, rmin=1.5, rmax=30)Notice that the two-column format of the input to
morph_arraysis the same as the output ofmorphandmorph_arrays. It is VERY IMPORTANT that the data is in two-column format rather than the traditional two-row format. This is to reflect the file formats conventionally used to store PDFs. Again, try printingprint(morph_info)and compare!For a full list of parameters used by (both)
morphandmorph_arrays, see the Full Parameter List section below.
Full Parameter List
General Parameters
- save: str or path
 Save the morphed function to a the file passed to save. Use ‘-’ for stdout.
- get_diff: bool
 Return the difference function (morphed function minus target function) instead of the morphed function (default). When save is enabled, the difference function is saved instead of the morphed function.
- verbose: bool
 Print additional header details to saved files. These include details about the morph inputs and outputs.
- rmin: float
 Minimum r-value (abscissa) to use for function comparisons.
- rmax: float
 Maximum r-value (abscissa) to use for function comparisons.
- tolerance: float
 Specify least squares refiner tolerance when optimizing for morph parameters. Default: 10e-8.
- pearson: bool
 The refiner instead maximizes agreement in the Pearson function (default behavior is to minimize the residual). Note that this is insensitive to scale.
- addpearson: bool
 Maximize agreement in the Pearson function as well as minimizing the residual.
Manipulations
These parameters select the manipulations that are to be applied to the function. The passed values will be refined unless specifically excluded with the apply or exclude parameters.
- apply: bool
 Apply morphs but do not refine.
- exclude: list of str
 Exclude a manipulations from refinement by name (e.g. exclude=[“scale”, “stretch”] excludes the scale and stretch morphs).
- scale: float
 Apply scale factor.
This multiplies the function ordinate by scale.
- stretch: float
 Stretch function grid by a fraction stretch.
This multiplies the function grid by 1+stretch.
- squeeze: list of float
 Squeeze function grid given a polynomial p(x) = squeeze[0]+squeeze[1]*x+…+squeeze[n]*x^n.
n is dependent on the number of values in the user-inputted comma-separated list. The morph transforms the function grid from x to x+p(x). When this parameter is given, hshift is disabled. When n>1, stretch is disabled.
- smear: float
 Smear the peaks with a Gaussian of width smear.
This is done by convolving the function with a Gaussian with standard deviation smear. If both smear and smear_pdf are used, only smear_pdf will be applied.
- smear_pdf: float
 Convert PDF to RDF. Then, smear peaks with a Gaussian of width smear_pdf. Convert back to PDF. If both smear and smear_pdf are used, only smear_pdf will be applied.
- slope: float
 Slope of the baseline used in converting from PDF to RDF.
This is used with the option smear_pdf. The slope will be estimated if not provided.
- hshift: float
 Shift the function horizontally by hshift to the right.
- vshift: float
 Shift the function vertically by vshift upward.
- qdamp: float
 Dampen PDF by a factor qdamp.
- radius: float
 Apply characteristic function of sphere with radius given by parameter radius.
If pradius is also specified, instead apply characteristic function of spheroid with equatorial radius radius and polar radius pradius.
- pradius: float
 Apply characteristic function of spheroid with equatorial radius given by above parameter radius and polar radius pradius.
If only pradius is specified, instead apply characteristic function of sphere with radius pradius.
- iradius: float
 Apply inverse characteristic function of sphere with radius iradius.
If ipradius is also specified, instead apply inverse characteristic function of spheroid with equatorial radius iradius and polar radius ipradius.
- ipradius: float
 Apply inverse characteristic function of spheroid with equatorial radius iradius and polar radius ipradius.
If only ipradius is specified, instead apply inverse characteristic function of sphere with radius ipradius.
- funcy: tuple (function, dict)
 Apply a function to the y-axis of the (two-column) data.
This morph applies the function funcy[0] with parameters given in funcy[1]. The function funcy[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The y-axis values of the data are then replaced by the return value of funcy[0].
For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def linear(x, y, a, b, c): return a * x + b * y + c
This example function above takes in both the abscissa and ordinate on top of three additional parameters a, b, and c. To use the funcy parameter with parameter values a=1.0, b=2.0, and c=3.0, we would pass
funcy=(linear, {"a": 1.0, "b": 2.0, "c": 3.0}). For an explicit example, see the Python-Specific Morphs section below.- funcx: tuple (function, dict)
 Apply a function to the x-axis of the (two-column) data.
This morph works fundamentally differently from the other grid morphs (e.g. stretch and squeeze) as it directly modifies the grid of the morph function. The other morphs maintain the original grid and apply the morphs by interpolating the function ***.
This morph applies the function funcx[0] with parameters given in funcx[1]. The function funcx[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The x-axis values of the data are then replaced by the return value of funcx[0]. Note that diffpy.morph requires the x-axis be monotonic increasing (i.e. for i < j, x[i] < x[j]): as such, if funcx[0] is not a monotonic increasing function of the provided x-axis data, the error
x must be a strictly increasing sequencewill be thrown.For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def exponential(x, y, amp, decay): return abs(amp) * (1 - 2**(-decay * x))
This example function above takes in both the abscissa and ordinate on top of three additional parameters amp and decay. (Even though the ordinate is not used in the function, it is still required that the function take in both acscissa and ordinate.) To use the funcx parameter with parameter values amp=1.0 and decay=2.0, we would pass
funcx=(exponential, {"amp": 1.0, "decay:: 2.0}). For an explicit example, see the Python-Specific Morphs section below.- funcxy: tuple (function, dict)
 Apply a function the (two-column) data.
This morph applies the function funcxy[0] with parameters given in funcxy[1]. The function funcxy[0] take in as parameters both the abscissa and ordinate (i.e. take in at least two inputs with as many additional parameters as needed). The two columns of the data are then replaced by the two return values of funcxy[0].
For example, let’s start with a two-column table with abscissa x and ordinate y. let us say we want to apply the function
def shift(x, y, hshift, vshift): return x + hshift, y + vshift
This example function above takes in both the abscissa and ordinate on top of two additional parameters hshift and vshift. To use the funcy parameter with parameter values hshift=1.0 and vshift=2.0, we would pass
funcy=(shift, {"hshift": 1.0, "vshift": 1.0}). For an example use-case, see the Python-Specific Morphs section below.
Python-Specific Morphs
Some morphs in diffpy.morph are supported only in Python. Here, we detail
how they are used and how to call them.
MorphFunc: Applying custom functions
In these tutorial, we walk through how to use the MorphFunc morphs
(MorphFuncy, MorphFuncx, MorphFuncxy)
with some example transformations.
Unlike other morphs that can be run from the command line,
MorphFunc moprhs require a Python function and is therefore
intended to be used through Python scripting.
MorphFuncy:
The MorphFuncy morph allows users to apply a custom Python function
to the y-axis values of a dataset, enabling flexible and user-defined
transformations.
Let’s try out this morph!
Import the necessary modules into your Python script:
from diffpy.morph.morphpy import morph_arrays import numpy as npDefine a custom Python function to apply a transformation to the data. The function must take
xandy(1D arrays of the same length) along with named parameters, and return a transformedyarray of the same length. For this example, we will use a simple linear transformation that scales the input and applies an offset:def linear_function(x, y, scale, offset): return (scale * x) * y + offsetIn this example, we use a sine function for the morph data and generate the target data by applying the linear transformation with known scale and offset to it:
x_morph = np.linspace(0, 10, 101) y_morph = np.sin(x_morph) x_target = x_morph.copy() y_target = np.sin(x_target) * 20 * x_target + 0.8Setup and run the morph using the
morph_arrays(...).morph_arraysexpects the morph and target data as 2D arrays in two-column format[[x0, y0], [x1, y1], ...]. This will apply the user-defined function and refine the parameters to best align the morph data with the target data. This includes both the transformation parameters (our initial guess) and the transformation function itself:morph_params, morph_table = morph_arrays(np.array([x_morph, y_morph]).T, np.array([x_target, y_target]).T, funcy=(linear_function,{'scale': 1.2, 'offset': 0.1}))Extract the fitted parameters from the result:
fitted_params = morph_params["funcy"] print(f"Fitted scale: {fitted_params['scale']}") print(f"Fitted offset: {fitted_params['offset']}")
As you can see, the fitted scale and offset values match the ones used
to generate the target (scale=20 & offset=0.8). This example shows how
MorphFuncy can be used to fit and apply custom transformations. Now
it’s your turn to experiment with other custom functions that may be useful
for analyzing your data.
MorphFuncx:
The MorphFuncx morph allows users to apply a custom Python function
to the x-axis values of a dataset, similar to the MorphFuncy morph.
One caveat to this morph is that the x-axis values must remain monotonic
increasing, so it is possible to run into errors when applying this morph.
For example, if your initial grid is [-1, 0, 1], and your function is
lambda x, y: x**2, the grid after the function is applied will be
[1, 0, 1], which is no longer monotonic increasing.
In this case, the error x must be a strictly increasing sequence
will be thrown.
Let’s try out this morph!
Import the necessary modules into your Python script:
from diffpy.morph.morphpy import morph_arrays import numpy as npDefine a custom Python function to apply a transformation to the data. The function must take
xandy(1D arrays of the same length) along with named parameters, and return a transformedxarray of the same length. Recall that this function must maintain the monotonic increasing nature of thexarray.For this example, we will use a simple exponential function transformation that greatly modifies the input:
def exp_function(x, y, scale, rate): return np.abs(scale) * np.exp(np.abs(rate) * x)Notice that, though the function only uses the
xinput, the function signature takes in bothxandy.Like in the previous example, we will use a sine function for the morph data and generate the target data by applying the decay transfomration with a known scale and rate:
x_morph = np.linspace(0, 10, 1001) y_morph = np.sin(x_morph) x_target = x_target = 20 * np.exp(0.8 * x_morph) y_target = y_morph.copy()Setup and run the morph using the
morph_arrays(...).morph_arraysexpects the morph and target data as 2D arrays in two-column format[[x0, y0], [x1, y1], ...]. This will apply the user-defined function and refine the parameters to best align the morph data with the target data. This includes both the transformation parameters (our initial guess) and the transformation function itself:morph_params, morph_table = morph_arrays(np.array([x_morph, y_morph]).T, np.array([x_target, y_target]).T, funcx=(decay_function, {'scale': 1.2, 'rate': 1.0}))Extract the fitted parameters from the result:
fitted_params = morph_params["funcx"] print(f"Fitted scale: {fitted_params['scale']}") print(f"Fitted rate: {fitted_params['rate']}")
Again, we should see that the fitted scale and offset values match the ones used to generate the target (scale=20 & rate=0.8).
For fun, you can plot the original function to the morphed function to see how much the
MorphFuncxy:
The MorphFuncxy morph allows users to apply a custom Python function
to a dataset that modifies both the x and y column values.
This is equivalent to applying a MorphFuncx and MorphFuncy
simultaneously.
This morph is useful when you want to apply operations that modify both
the grid and function value.
Examples of using MorphFuncxy with PyFai azimuthal integration
and PDFgetx3 PDF calculation are included here.
For this tutorial, we will go through two examples. One simple one
involving shifting a function in the x and y directions, and
another involving a Fourier transform.
Let’s start by taking a simple
sinefunction.import numpy as np morph_x = np.linspace(0, 10, 101) morph_y = np.sin(morph_x) morph_table = np.array([morph_x, morph_y]).TThen, let our target function be that same
sinefunction shifted to the right by0.3and up by0.7.target_x = morph_x + 0.3 target_y = morph_y + 0.7 target_table = np.array([target_x, target_y]).TWhile we could use the
hshiftandvshiftmorphs, this would require us to refine over two separate morph operations. We can instead perform these morphs simultaneously by defining a function:def shift(x, y, hshift, vshift): return x + hshift, y + vshiftNow, let’s try finding the optimal shift parameters using the
MorphFuncxymorph. We can try an initial guess ofhshift=0.0andvshift=0.0.from diffpy.morph.morphpy import morph_arrays initial_guesses = {"hshift": 0.0, "vshift": 0.0} info, table = morph_arrays(morph_table, target_table, funcxy=(shift, initial_guesses))Finally, to see the refined
hshiftandvshiftparameters, we extract them frominfo.print(f"Refined hshift: {info["funcxy"]["hshift"]}") print(f"Refined vshift: {info["funcxy"]["vshift"]}")
Now for an example involving a Fourier transform.
Let’s say you measured a signal of the form \(f(x)=\exp\{\cos(\pi x)\}\). Unfortunately, your measurement was taken against a noisy sinusoidal background of the form \(n(x)=A\sin(Bx)\), where
A,Bare unknown. For our example, let’s say (unknown to us) thatA=2andB=1.7.import numpy as np n = 201 dx = 0.01 measured_x = np.linspace(0, 2, n) def signal(x): return np.exp(np.cos(np.pi * x)) def noise(x, A, B): return A * np.sin(B * x) measured_f = signal(measured_x) + noise(measured_x, 2, 1.7) morph_table = np.array([measured_x, measured_f]).TYour colleague remembers they previously computed the Fourier transform of the function and has sent that to you.
# We only consider the region where the grid is positive for simplicity target_x = np.fft.fftfreq(n, dx)[:n//2] target_f = np.real(np.fft.fft(signal(measured_x))[:n//2]) target_table = np.array([target_x, target_f]).TWe can now write a noise subtraction function that takes in our measured signal and guesses for parameters
A,B, and computes the Fourier transform post-noise-subtraction.def noise_subtracted_ft(x, y, A, B): n = 201 dx = 0.01 background_subtracted_y = y - noise(x, A, B) ft_x = np.fft.fftfreq(n, dx)[:n//2] ft_f = np.real(np.fft.fft(background_subtracted_y)[:n//2]) return ft_x, ft_fFinally, we can provide initial guesses of
A=0andB=1to theMorphFuncxymorph and see what refined values we get.from diffpy.morph.morphpy import morph_arrays initial_guesses = {"A": 0, "B": 1} info, table = morph_arrays(morph_table, target_table, funcxy=(background_subtracted_ft, initial_guesses))Print these values to see if they match with the true values of of
A=2.0andB=1.7!print(f"Refined A: {info["funcxy"]["A"]}") print(f"Refined B: {info["funcxy"]["B"]}")
You can also use this morph to help find optimal parameters
(e.g. rpoly, qmin, qmax, bgscale) for computing
PDFs of materials with known structures.
One does this by setting the MorphFuncxy function to a PDF
computing function such as
PDFgetx3.
The input (morphed) 1D function should be the 1D diffraction data
one wishes to compute the PDF of and the target 1D function
can be the PDF of a target material with similar geometry.
More information about this will be released in the diffpy.morph
manuscript, and we plan to integrate this feature automatically into
PDFgetx3 soon.