CSP Estimators: The Frequency-Smoothing Method

The non-blind spectral-correlation estimator called the FSM is favored when one wishes to have fine control over frequency resolution and can tolerate long FFTs.

In this post I describe a basic estimator for the spectral correlation function (SCF): the frequency-smoothing method (FSM). The FSM is a way to estimate the SCF for a single value of cycle frequency. Recall from the basic theory of the cyclic autocorrelation and SCF that the SCF is obtained by infinite-time averaging of the cyclic periodogram or by infinitesimal-resolution frequency averaging of the cyclic periodogram. The FSM is merely a finite-time/finite-resolution approximation to the SCF definition.

One place the FSM can be found is in (My Papers [6]), where I introduce time-smoothed and frequency-smoothed higher-order cyclic periodograms as estimators of the cyclic polyspectrum. When the cyclic polyspectrum order is set to $n = 2$ , the cyclic polyspectrum becomes the spectral correlation function, so the FSM discussed in this post is just a special case of the more general estimator in [6, Section VI.B].

Let’s start by reviewing the FSM for power-spectrum measurement. That is, let’s begin by looking at an estimator for the power spectral density (PSD). We’ll use discrete time and discrete frequencies in this post, in contrast to My Papers [6], because almost everybody who might read this post will be interested in sampled data. Moreover, discrete frequencies lead to discrete cycle frequencies in the method, which causes a significant complication in practice. We’ll highlight that complication, and show how it can be mitigated.

The Frequency-Smoothed Periodogram

The power spectrum estimate is obtained by smoothing the periodogram. The discrete Fourier transform (DFT) of $x(t)$ is defined by

$X(f) = \displaystyle \sum_{t=0}^{N-1} x(t) e^{-i2\pi f t}, \hfill (1)$

where $f = n/N$ for $n = 0,1, \ldots, N-1$ . The periodogram is the normalized squared magnitude of the DFT,

$I(f) = \displaystyle\frac{1}{N} \left| X(f) \right|^2. \hfill (2)$

The periodogram is not a particularly good spectrum estimator when the data is random; it is quite useful when the data is nonrandom and contains periodic components. The maximum-likelihood estimator for the frequency of a tone in white Gaussian noise is the location of the maximum of the Fourier transform magnitude over all frequencies $f$ , which is usefully approximated by the discrete-time/discrete-frequency periodogram. For random data (like communication signals), the periodogram is erratic and does not converge to the true PSD even as $N\rightarrow\infty$ .

The PSD can be accurately estimated by smoothing the periodogram,

$\hat{S}(f) = g(f) \otimes I(f), \hfill (3)$

where $g(f)$ is some unit-area pulse-like function, such as a rectangle. The smoothing implied by this convolution allows the estimate to converge to a biased (distorted) estimate of the true PSD. If $N$ is large and the width of $g(f)$ is small relative to the fluctuations over frequency in the true PSD, then the bias is low and the estimator converges to very nearly the true PSD. More explicitly, the FSM for PSD estimation is

$\hat{S}(f) = g(f) \otimes I(f) \hfill (4)$

$= \displaystyle \sum_\nu g(f-\nu) I(\nu) \hfill (5)$

$= \displaystyle\frac{1}{N} \displaystyle \sum_\nu g(f-\nu) \left| X(\nu) \right|^2 \hfill (6)$

$= \displaystyle\frac{1}{N} \displaystyle \sum_{n=0}^{N-1} g(f - n/N) \left| X(n/N) \right|^2, \hfill (7)$

for $f = m/N$ . So to implement this FSM, we need only use an FFT algorithm and a general convolution algorithm capable of handling complex inputs. An FSM-based PSD estimate for our rectangular-pulse BPSK signal is shown here:

fsm_psd — Figure 1. Power spectral density estimate for a rectangular-pulse BPSK signal obtained using the frequency-smoothing method and a cycle frequency of zero.

This estimate is formed from a data record with length $32,768$ samples and a rectangular smoothing window $g(f)$ with width $164$ frequency bins, which is about $0.005$ (normalized) Hz.

The frequency-smoothed periodogram method of spectrum estimation is referred to as the Daniell method after P. J. Daniell (The Literature [R42]).

The Frequency-Smoothed Cyclic Periodogram

For the spectral correlation function, we modify the FSM for the PSD
by replacing the periodogram with the cyclic periodogram, which is defined
by

$I^\alpha(f) = \displaystyle\frac{1}{N} X(f + \alpha/2) X^*(f - \alpha/2), \hfill (8)$

where $\alpha$ is a cycle frequency of interest. Since $X(f)$ is a discrete-frequency function, the arguments $f \pm \alpha/2$ are valid only if $\pm \alpha/2 = k/N$ . That is, the discrete-frequency function $X(f)$ can be shifted to the left and right only by multiples of the discrete frequency $1/N$ . It may happen that the true cycle frequency does not satisfy this constraint. In practice, the closest discrete shifts are chosen for $\pm \alpha/2$ . When the two frequencies do not correspond to multiples of $1/N$ , then zero-padding the data prior to FSM estimation is advisable; this will allow a closer match to discrete frequencies. In general, the $+\alpha/2$ and $-\alpha/2$ shifts don’t have to be negatives of each other. The idea is to choose the shifts $a_1$ and $a_2$ such that $\left||\alpha| - (|a_1/N |+ |a_2/N|)\right|$ is minimized (My Papers [16]).

To estimate the SCF, then, we convolve the cyclic periodogram with a smoothing window as before,

$\hat{S}^\alpha (f) = g(f) \otimes I^\alpha(f) \hfill (9)$

$= \displaystyle\frac{1}{N} \displaystyle \sum_{n=0}^{N-1} g(f - n/N) X(n/N + a_1/N) X^*(n/N - a_2/N) \hfill (10)$

for $f = m/N$ . All that is required to implement this algorithm is an FFT, complex multiplication, and a general convolution algorithm (such as MATLAB’s conv.m or filter.m). The conjugate SCF is estimated by frequency smoothing the conjugate cyclic periodogram given by

$I_*^\alpha(f) = \displaystyle\frac{1}{N} X(f + \alpha/2) X(\alpha/2-f). \hfill (11)$

Applying this algorithm to our rectangular-pulse BPSK signal, we obtain the following estimates for the non-conjugate SCF:

fsm_nonconj_scfs — Figure 2. FSM-based non-conjugate spectral correlation function estimates for a rectangular-pulse BPSK signal with bit rate 1/T0 = 0.1 and carrier frequency offset of fc = 0.05.

The conjugate SCF estimates are:

fsm_conj_scfs — Figure 3. FSM-based conjugate spectral correlation function estimates for a rectangular-pulse BPSK signal with bit rate 1/T0 = 0.1 and carrier frequency offset of fc = 0.05. Note that the estimate for 2fc+1/T0 is nearly obscured by that for 2fc-1/T0, which is expected because the two theoretical features are identical in magnitude.

To obtain these estimates, we used a data-record length of $32,768$ samples and a rectangular $g(f)$ with width $164$ frequency points, or about $0.005$ Hz. Recall the sampling frequency is set to unity here, the signal’s bit rate is $1/T_0 = 0.1$ , and the carrier frequency is $f_c = 0.05$ . The $32,768$ -sample data record is zero-padded by a factor of two prior to forming the cyclic periodogram.

The estimates presented in Figures 2 and 3 closely match the known exact formulas for the non-conjugate and conjugate spectral correlation functions for rectangular-pulse BPSK, as illustrated in the second-order verification post.

Choice for the Frequency-Smoothing Window $g(f)$

We used the rectangular pulse $g(f)$ in our estimator examples for a good reason: computational cost. For arbitrary $g(f)$ , a general-purpose convolution routine can be used to find the SCF estimate, but the convolutions can be expensive. Computational cost can be reduced by selecting only a subset of all possible output frequencies, and computing the convolution just for those particular spectral frequencies.

Arbitrary output frequencies can be accommodated, however, by using a rectangular smoothing window $g(f)$ because the convolution can be efficiently performed by a running sum over the cyclic periodogram. The estimates for successive spectral frequencies can be obtained by subtracting the cyclic-periodogram value that is no longer covered by the support of $g(f)$ (left edge of the window) and adding the one that is newly included in the support (right edge of the window).

In the final analysis, the FSM offers good control over spectral resolution, but requires an FFT operation on the entire data record. When the signal of interest has low signal-to-interference-and-noise ratio (SINR), the required data-record length can be very large. For non-real-time CSP applications running on modern general-purpose computers, this isn’t usually a problem. But when attempting to transition CSP to hardware, long complex-valued data vectors are problematic. For such cases, we can turn to the time-smoothing method (TSM) of SCF estimation, which we discuss in another post.

Author: Chad Spooner

I'm a signal processing researcher specializing in cyclostationary signal processing (CSP) for communication signals. I hope to use this blog to help others with their cyclo-projects and to learn more about how CSP is being used and extended worldwide. View all posts by Chad Spooner

59 thoughts on “CSP Estimators: The Frequency-Smoothing Method”

AN says:

March 31, 2017 at 10:17 am

Hey! This is a great blog! Learning a lot here. Can you please provide MATLAB codes to generate these plots?
Thanks!

Reply
1. Chad Spooner says:
  
  March 31, 2017 at 1:25 pm
  
  Thanks AN. The CSP Blog is a self-help blog. I don’t give out much code. Let me know if you get stuck, though, trying to write your own. That’s the only way you’ll really understand CSP.
  
  Reply
Chen says:

August 13, 2019 at 7:49 am

Hi Chad
I think you chose complex-valued waveform of BPSK signal to estimate the PSD and cyclic spectrum, right?
And how can I judge the simulation results of the FSM priodogram and cyclic priodogram of BPSK signal correct or not?
The value range of n in Equation (10) can not reach 0 and N. The reason is that for certain cycle frequency, “n+a1=0” , right?

Reply
1. Chad Spooner says:
  
  August 13, 2019 at 10:10 am
  
  Yes, the BPSK signal I use throughout the CSP Blog is complex-valued.
  
  And how can I judge the simulation results of the FSM priodogram and cyclic priodogram of BPSK signal correct or not?
  
  Do you mean how can you judge whether my plots are correct? Or your plots? My plots are consistent with theory in the cases where I know the theory, such as PSK and QAM signals. So you can use my plots in the BPSK posts (here and here) or the gallery posts (here and here) to compare with yours.
  
  The value range of n in Equation (10) can not reach 0 and N. The reason is that for certain cycle frequency, “n+a1=0” , right?
  
  Yes, when $\alpha \neq 0$ in the non-conjugate SCF, you cannot use all the indices in the sum over $n$ unless you are willing to view the FFT output $X$ as a periodic function (with period $N$ ). But generally you don’t want to do that, and the edges of the cyclic periodogram will contain zeros. The larger $\alpha$ becomes, the more zeros you’ll see at the ends (this also corresponds to the diamond shape of the support for the SCF).
  
  Reply
  1. Chen says:
    
    August 13, 2019 at 8:05 pm
    
    But why the estimation performance of FSM method decreased as I increased the length of data record? The simulation parameters I chose were the same as yours.
    
    Reply
    1. Chad Spooner says:
      
      August 14, 2019 at 8:28 am
      
      It isn’t possible to substantively answer this question. Not enough information!
      
      Reply
Chen says:

September 3, 2019 at 5:31 am

Hi Chad
According to the equation (25) in Gardner’s paper “Measurement of Spectral Correlation”, I think the result of equation (3) should be divided by the frequency bins number of smoothing window. Do as what I advised, the FSM-based PSD matches well with the theoretical PSD for BPSK signal.
What about your opinion?

Reply
1. Chad Spooner says:
  
  September 3, 2019 at 6:23 am
  
  My (3) is a bit more general than Gardner’s (25), which smooths using a unit-area rectangle. My $g(f)$ can be any pulse-like unit-area window function, but I do use a rectangle a lot too. The factor you say is missing is actually embedded in the definition of $g(f)$ : It has unit area. Therefore if it is a rectangle, it has height equal to the reciprocal of the width. I’ve added “unit area” to the post to clarify.
  
  Reply
  1. Mimi says:
    
    February 5, 2020 at 10:32 am
    
    The Figure 1 in this post is similar to what your code make_rect_bpsk.m. But you did not use the convolution after the FFT, but the plot looks same.
    
    So my question is related to this: To estimate the SCF, we should take the FFT of non conjugate CAF and then we need to convolve this with a smoothing window g(f). Does it give any extra benefit of using the g(f)? What is a good size of g(f)?
    
    Is the FFT of non conjugate CAF called the cyclic Periodogram?
    
    Reply
    1. Chad Spooner says:
      
      February 5, 2020 at 10:54 am
      
      The Figure 1 in this post is similar to what your code make_rect_bpsk.m. But you did not use the convolution after the FFT, but the plot looks same.
      
      I don’t quite understand this. The plot in Figure 1 looks the same as what?
      
      So my question is related to this: To estimate the SCF, we should take the FFT of non conjugate CAF and then we need to convolve this with a smoothing window g(f).
      
      No, to estimate the SCF, take the FFT of the data, shift it up and down by $\alpha/2$ , multiply, divide by the length of the data, then convolve the result with a smoothing window. See the posts on the frequency-smoothing and time-smoothing methods of spectral correlation estimation for details and examples.
      
      Does it give any extra benefit of using the g(f)?
      
      Yes. The main point of the Periodogram post is that if you don’t average (“smooth”) the periodogram in either time or frequency, you will have a highly erratic function of frequency. Note also that the reliability (variance) of a spectral correlation estimate is inversely proportional to the product of the data-block length and the frequency resolution. For the frequency-smoothing method, the frequency resolution is the width of $g(f)$ . So, to generate a reliable (low variance) estimate, you want to use large data-block lengths and large (coarse) frequency resolution (wide $g(f)$ ).
      
      What is a good size of g(f)?
      
      A good width for $g(f)$ is typically 1-5% of the sampling rate.
      
      Is the FFT of non conjugate CAF called the cyclic Periodogram?
      
      No. The cyclic periodogram is defined in (3) and (5) in the Periodogram post. The Fourier transform of the non-conjugate CAF is the non-conjugate spectral correlation function.
      
      Reply
      1. Mimi says:
        
        February 6, 2020 at 2:59 am
        
        Thanks for the reply. I meant, the Figure 1 in this post is similar to what your code make_rect_bpsk.m. generates.
      2. Mimi says:
        
        February 6, 2020 at 3:18 am
        
        Thanks for the reply. I need some more clarification. You said “To estimate the SCF, take the FFT of the data, shift it up and down by \alpha/2, multiply, divide by the length of the data”.
        
        1. Shift it up and down by alpha/2: Is it not giving back the same matrix if I shift it up by alpha/2 and then down by the same amount. Also is this shifting the same what fftshift does? If not same, then what this shift up and down by alpha/2 is doing here?
        
        2. Then you said “multiply”: what do I multiply with what? I understood the divide by the length of the data.
        
        3. Is the cyclic periodogram calculated as: FT[x(t)*exp(j*2*pi*alpha/2*t)] * conj{FT[x(t)*exp(-j*2*pi*alpha/2*t)]}
      3. Chad Spooner says:
        
        February 6, 2020 at 10:22 am
        
        1. Shift it up and down by alpha/2: Is it not giving back the same matrix if I shift it up by alpha/2 and then down by the same amount.
        
        Well, you shift the FFT up by $\alpha/2$ to get $X(f-\alpha/2)$ and shift it down by $\alpha/2$ to get $X(f+\alpha/2)$ . Those are the two quantities you need to multiply together to get the cyclic periodogram.
        
        Also is this shifting the same what fftshift does?
        
        No. MATLAB’s fftshift.m swaps the second half of a vector for the first half: fftshift([1 2 3 4]) = [3 4 1 2]. This is identical to circshift([1 2 3 4], 2). The shifting we need to do in CSP depends on the value of the cycle frequency $\alpha$ .
        
        If not same, then what this shift up and down by alpha/2 is doing here?
        
        The mathematics of CSP says that you can estimate the spectral correlation function by averaging the cyclic periodogram, which is the product of two frequency-shifted Fourier transforms. More physically, the spectral correlation function is the temporal correlation between two narrowband spectral components of a signal, after those components have been isolated and frequency-shifted to zero frequency. And those two frequency components have frequencies specified by $f_1 = f + \alpha/2$ and $f_2 = f - \alpha/2$ in the usual parameterization.
        
        2. Then you said “multiply”: what do I multiply with what? I understood the divide by the length of the data.
        
        See answer to Question 1 above.
        
        3. Is the cyclic periodogram calculated as: FT[x(t)*exp(j*2*pi*alpha/2*t)] * conj{FT[x(t)*exp(-j*2*pi*alpha/2*t)]}
        
        I’m not sure about the plus and minus signs in the exp() functions, and you’ll need to divide by length(x(t)), but, yes, this is one way to obtain the cyclic periodogram. I typically don’t use this method, because if you’re careful, you can do the frequency shifting in the frequency domain using only things like circshift.m, which is much cheaper than creating the complex sine waves and multiplying them by the signal x(t) and taking two (not one) Fourier transforms.
Chen says:

September 3, 2019 at 5:51 am

I find an interesting phenomenon. To meet the condition that the approximated value, obtained by integrating PSD values over frequency and multiplying that sum by the frequency increment, must be equal to the estimate of power. Therefore, different PSD estimating methods have different frequency increment and different PSD value. For example, periodogram based PSD values are much greater than that based on TSM and FSM method.

Reply
1. Chen says:
  
  September 3, 2019 at 5:52 am
  
  How do you explain that phenomenon?
  
  Reply
  1. Chad Spooner says:
    
    September 3, 2019 at 7:07 am
    
    Therefore, different PSD estimating methods have different frequency increment and different PSD value.
    
    Yes, the different methods can have different frequency-bin increments as well as different spectral resolutions. This means that for any particular given frequency $f$ , the PSD estimates from the various methods will likely have different values. But for properly set up measurements (the measurement spectral resolution is well-matched to the data’s spectral characteristics, the data-block length is sufficient to allow averaging away erratic components), the values will be quite similar, as you can see from my comparisons between the FSM and TSM outputs.
    
    And the sum of the PSD estimate values, multiplied by the frequency increment, should be very close for each method.
    
    For example, periodogram based PSD values are much greater than that based on TSM and FSM method.
    
    Yes, the periodogram is highly erratic, and is a terrible estimator of the PSD, so there will be periodogram values that are very much greater than the true PSD value and periodogram values that are very much less than the true PSD value–that’s why we need the averaging.
    
    Reply
Chen says:

September 11, 2019 at 8:04 am

Convolving the periodogram of x(t) with g(f) would obtain PSD estimate with length of N+164-1. So what is the frequency increment between the adjacent frequency bins of that PSD estimates? Is the frequency increment fs/(N+164-1)?

Reply
1. Chad Spooner says:
  
  September 11, 2019 at 11:20 am
  
  This is a convolution question, not a CSP question! The spacing between the samples does not change as a result of convolution.
  
  Reply
Smoon says:

September 27, 2019 at 7:13 am

The posts are very helpful for the beginners like me. It would be better if you supply more detailed unit labels on the figures. I think the unit of the nomalized frequency is [cycle/sample], and of the PSD is [dB/(rad/sample)]. I am not sure whether the units I added here is correct? Or maybe it is a traditional label manner in the CPS society.

Reply
1. Chad Spooner says:
  
  October 4, 2019 at 8:27 am
  
  Thanks for visiting the CSP Blog and leaving a comment Smoon! I appreciate it.
  
  I agree with you about normalized frequency–strictly speaking I should label axes with [Cycles/Sample] or the like. But when the actual physical sampling rate really is $1.0$ , then the unit is also Cycles/Second, or Hz. Mostly I’m following convention, which is to use ‘Hz’ for both physical frequency and normalized frequency. To translate any frequency axis from normalized frequency to physical frequency, just multiply by the actual sampling rate.
  
  And yes, the physical unit of a power spectrum is Watts/Hz. Normally expressed in decibels, but often in decibels relative to some basic power level, such as one Watt (dBW) or 1 milli Watt (dBm). In almost all of my posts, I’m unconcerned with the actual power level of the data–even when I capture data using my lab equipment I don’t keep track of the relationship between the integers I obtain from a sampler and the power level of the electromagnetic wave incident on the attached antenna. So I rather lazily just use decibels and the relationship to actual physical power is suppressed.
  
  What is the ‘CPS society’?
  
  Reply
Clint says:

July 9, 2020 at 10:11 am

This feels like a dumb question, but I’m completely failing to re-create your plots for the conjugate SCF. (Using what should be identical to your textbook BPSK signal example.)

I have another function that calls the function below and convolves a smoothing window with the result. The results match your FSM plots quite well for the non-conjugate SCF, but not for the conjugate version.

I’ve tried computing the SCF for all possible values of alpha for the signal under consideration, and none of the results match your plot for the conjugate SCF; in fact, they’re all pretty noisy and remain below 0dB.

I’ve also implemented the time-smoothing method, and I can replicate your plots for the non-conjugate SCF, but again not for the conjugate SCF. (I still have a bug somewhere such that my phase compensation factor only works when alpha*N is an integer, but I’m working on fixing my frequency-smoothed conjugate SCF before going back to that.)

Below is my python function computing the cyclic periodogram; my understanding is that the only difference is whether X2 should have the conjugate operation applied (yes for the non-conjugate SCF, no for the conjugate SCF).

def cyclic_periodogram(x, alpha=0, conjugate=False):
“””Compute the cyclic periodogram from the given signal x

I^A(f) = 1/N * X(f + alpha/2) * conj(X(f – alpha/2))
where X(f) is the Fourier transform of x(n) and x(n) has N total samples

Assumes that alpha is in normalized frequency; i.e. alpha = alpha_absolute/sampling_frequency

Note that the conjugate cyclic periodogram does not include the conjugation operation in its definition:
I^A_*(f) = 1/N * X(f + alpha/2) * X(f – alpha/2)
“””

X = numpy.fft.fftshift(numpy.fft.fft(x))

# shift X left by # of bins corresponding to alpha/2 and another copy right by the same amount
# eventually can slightly improve the resolution by allowing slightly different left/right shifts
offset = int(numpy.round(alpha/2 * len(X)))
X1 = numpy.roll(X, offset)
X2 = numpy.roll(X, -offset)

if not conjugate:
X2 = numpy.conj(X2)
print(f’Computing non-conjugate cyclic periodogram for alpha = {alpha}, offset = {offset}’)
else:
print(f’Computing conjugate cyclic periodogram for alpha = {alpha}, offset = {offset}’)

return 1/len(X) * X1 * X2

Can you see something I’m missing here? Except for the conjugation of X2 (and the values of alpha where the SCF is meaningful), all my code is identical for the non-conjugate and conjugate SCF, and it matches your plots perfectly for the non-conjugate case.

Thanks for any help!

Reply
1. Clint says:
  
  July 9, 2020 at 10:12 am
  
  Ugh, it got rid of all the indentation, making it much less readable. I’ll clarify if needed…
  
  Reply
2. Clint says:
  
  July 9, 2020 at 12:57 pm
  
  To clarify one point, – when I tried to estimate the conjugate SCF for “all possible values of alpha” I did reduce the total length from 32,768 to 4,096 samples. My understanding is that that should just increase the variance of the resulting SCF estimate, but shouldn’t fundamentally change the behavior.
  
  Also, there’s raw LaTeX in this post, as well as the post for the TSM.
  
  Reply
  1. Chad Spooner says:
    
    July 9, 2020 at 3:06 pm
    
    Thanks very much for pointing out the raw latex! Fixed it.
    
    Yes, reducing the amount of processed data does increase the variance of the estimate, and also impacts the all-important cycle-frequency resolution parameter. Which means that one may also experience cycle leakage when the block length is too small–energy from a nearby slice of the spectral correlation function can leak into the slice you are focusing on. So, variance and leakage are both increased in general.
    
    Reply
3. Chad Spooner says:
  
  July 9, 2020 at 3:03 pm
  
  Hey Clint! Sorry about the loss of the indentation when you posted your code. When I comment on a comment, I have several markup tool buttons available, including “code,” which allows posting of code that should be formatted reasonably by WordPress. Do you see that when you comment?
  
  One major problem you are having is your statement:
  
  my understanding is that the only difference is whether X2 should have the conjugate operation applied (yes for the non-conjugate SCF, no for the conjugate SCF).
  
  That is not the only difference; see Equation (11) in the FSM post and Equation (8) in the SCF post. Instead of the non-conjugate cyclic periodogram
  
  $\frac{1}{T} X_T(t, f+\alpha/2) X_T^*(f-\alpha/2)$
  
  you remove the conjugate and negate the argument of the second factor
  
  $\frac{1}{T} X_T(t, f+\alpha/2) X_T(\alpha/2 -f )$
  
  Try implementing that and re-running the estimates.
  
  Reply
Clint says:

July 10, 2020 at 1:35 pm

Thanks for clarifying, Chad, in spite of my apparent inability to read equations.

I don’t see any formatting/markup options when commenting; just raw text.

I don’t have it working yet, although I have something implemented that at least corrects for the problem you mentioned, as well as limiting the domain of the SCF by padding with zeros beyond f = +/-0.5 (my previous code treated the X(t, f) as being infinitely periodic in frequency, though I came across this comment – https://cyclostationary.blog/2018/06/01/csp-estimators-the-fft-accumulation-method/#comment-2131 – which makes me think that you had a discussion of that with plots of the SCF domain elsewhere, though I can’t remember where at this point. Is there a way to view a list of all blog posts, instead of either searching (and hoping I stumble upon the right term) or viewing the archives a month at a time? (Those are the only options I’ve found, other than just going through articles sequentially.)

Reply
1. Chad Spooner says:
  
  July 10, 2020 at 3:26 pm
  
  I haven’t yet published a post on the spectral correlation “principal domain,” but I may have a plot or two showing the diamond or part of it. Perhaps in the SSCA post.
  
  I’ve created a new page for the CSP Blog. Look at the top of the home page for “All CSP Blog Posts” and click that to get a simple list, in chronological order, of all posts. Is that what you were looking for? Once I hear back from you about whether or not that is what you desired, and perhaps suggestions for improvement, I’ll do a little post announcing it. You aren’t the first to suggest the navigation on the site is cumbersome.
  
  Thanks Clint!
  
  Reply
  1. Clint says:
    
    July 13, 2020 at 12:11 pm
    
    Thanks for the new page, Chad! It’s definitely helpful, and a good complement to the posts you already had with beginner links and category links.
    
    The only additional suggestion I might have would be a list of all tags, but that would be somewhat redundant with the list of categories, and thus much less useful.
    
    Reply
2. Clint says:
  
  July 14, 2020 at 12:19 pm
  
  Got to work on this some more today. Finally got it working!
  
  Turns out the remaining problem was an off-by-one error in the indexing for the conjugate SCF case. Now I just need to look at it in more detail to be sure of _why_ the indexing needs to be the way it is.
  
  Thanks again for the help Chad!
  
  Reply
  1. Clint says:
    
    July 14, 2020 at 12:34 pm
    
    In case it helps others trying to implement this, I’ll describe what I was seeing before I found my bug.
    
    My problematic curves were generally the same shape as Chad’s, but 10-15dB lower and they had fairly high variance/noise. After thinking about cycle leakage, I wondered if my offset was close but not quite right, which turned out to be the case.
    
    Consistent with Chad’s comments above about cycle leakage and block length, the level of my curves went down as I increased the total length of random data that I was processing.
    
    Reply
Ada says:

August 10, 2020 at 5:30 am

Hi, thank you for creating this blog. Can you confirm that the parameter m (appearing below eq. 7 and eq. 10) is the width of the rectangle used for smoothing. Thanks.

Reply
1. Chad Spooner says:
  
  August 10, 2020 at 6:56 am
  
  Thanks for the comment, Ada, and for checking out the CSP Blog.
  
  Here is what I say near (10):
  
  The parameter $m$ here is an integer-valued index that specifies a frequency. Since we’re doing digital signal processing here, the input signal is a sampled function of time, and the FFT and SCF are discrete-frequency functions. So $m$ just tells us which frequencies we are getting out of the FSM algorithm.
  
  The width of the rectangle or, more generally, the width of the smoothing function $g(f)$ is usually referred to as $\Delta f$ in the CSP Blog. In the examples in the FSM post, the width of $g(f)$ is $164$ frequency bins, or $164$ consecutive values of $m$ .
  
  Does that help?
  
  Reply
Ada says:

August 11, 2020 at 7:58 am

Hello, thank you for your fast response. So the smoothing function g(f) will comprise 164 frequency values of n for averaging. The convolution will use m as an index for the shifting. This means that the output of the FSM will be about 200 averaged points (32768/164) when no overlapping of the smoothing function is used. I do not understand what you mean by the width of g(f) is 164 consecutive values of m ? Thanks.

Reply
1. Chad Spooner says:
  
  August 12, 2020 at 7:42 am
  
  I see you posted a comment after this one indicating you understand after all. For others, I’ll just say here that width of the smoothing function is $164$ points, so for each output spectral correlation estimate (one value of spectral frequency $f$ ), $164$ values of the cyclic periodogram are averaged together, not $32768/164$ .
  
  Reply
Ada says:

August 12, 2020 at 1:45 am

Hello, I understood it. Thank you !

Reply
Clint says:

April 27, 2021 at 2:48 pm

Hi Chad,

You’ve mentioned zero-padding the signal in at least a couple places, such as above:
“When the two frequencies do not correspond to multiples of 1/N, then zero-padding the data prior to FSM estimation is advisable; this will allow a closer match to discrete frequencies.”

Is there a reason to use zero-padding rather than just to process a longer signal in the first place? In your example above, why don’t you just generate a signal that is twice as long (or whatever length you want to achieve with zero-padding)? Or, if working with a real signal, just process/record a longer chunk? Unless you’re using a custom FFT implementation, it’ll still require the same amount of computation.

I could see it being beneficial in hardware or real-time applications, where you could shortcut part of the calculations if you know large chunks are zeros. But, as you say, many such applications will do better with the TSM anyway.

Thanks!
Clint

Reply
1. Chad Spooner says:
  
  April 29, 2021 at 2:36 pm
  
  Thought-provoking question Clint! Thanks. Let’s see if my reply is convincing.
  
  Is there a reason to use zero-padding rather than just to process a longer signal in the first place?
  
  If we want to do the best CSP we can with the data block we have, with length $T$ , then we cannot increase the data length. We always want to do the best we can with what we have, for sure, but we are also sometimes constrained by short data records, and so do not have the choice you suggest.
  
  The problem, as we agree I think, is that the number of FFT bins represented by the frequency $\alpha/2$ is not an integer. So we have to pick two frequency bins that best represent, in some sense, the frequency span $\alpha/2$ . Suppose that $\alpha/2$ lies exactly between two FFT bins. That is $(\alpha/2)T = k + 1/2$ , where $k$ is some integer between $1$ and $T$ . Then, yes, if we were to process with a new block length of $2T$ , the frequency distance of $\alpha/2$ would then be an integer. But what about when the number $\alpha/2$ is not exactly halfway (or quarter-way, eighth-way, etc.) between FFT bins? Such as a number like $\alpha = 0.05$ . This number cannot be represented as $k/2^M$ for any $k$ and $M$ . In fact, this is why I’ve chosen the bit rate of the rectangular-pulse BPSK signal used throughout the CSP Blog to be $0.1$ –it is not friendly to DFTs of sequences with dyadic lengths $2^M$ . (Don’t want to sneak in some numbers with special privileges or properties…) When $(\alpha/2)T$ is near the midpoint between FFT bins, we can increase $T$ and it will be closer to an integer number of bins, as you suggest. But further increases may find it farther away again. If we look at $0.05T$ for a variety of $T$ we get:
  
  1024 51.2
  2048 102.4
  4096 204.8
  … 409.6
  … 819.2
  … 1638.4
  … 3276.8
  … 6553.6
  … 13107.2
  … 26214.4
  2^20 52428.8
  
  So whether the frequency $\alpha/2$ is close to an integer number of bins depends on the number of bins, and it doesn’t converge.
  
  What is important is whether the accuracy of the representation of $\alpha/2$ in terms of some integer number of FFT bins is small or larger relative to the inherent cycle-frequency resolution of the measurement, which is about $1/T$ . The resolution of the measurement doesn’t change when we add zeros–we are just interpolating by zero padding. But by doing that interpolation, we allow the approximation of $\alpha/2$ by an integer number of FFT bins to be more accurate, yet we do not have the problem of finer resolution that we would have if we increased the data length. So we get more accurate results with the same data.
  
  Moreover, increasing the data-block length to better represent one cycle frequency can result in a worse representation for another cycle frequency exhibited by the data. Zero-padding helps them all, or at least does no harm (if, say the cycle frequencies really are of the form $\alpha = k/2^M$ ).
  
  Here is my offered evidence. (This was worth doing because I think we all struggle with this problem in the CSP community.) Sticking with our old friend the rectangular-pulse BPSK signal with bit rate $1/10$ , carrier offset of $0.05$ , unit power, and noise with $N0 = -10$ dB, we can plot the peak spectral correlation and spectral coherence magnitudes for a particular implementation of the FSM and a variety of block lengths $T$ . First the spectral correlation:
  
  Next the coherence:
  
  Notice that the cycle frequency of $0.5 = 5/10$ is unaffected by either zero padding. That is because that cycle frequency is always perfectly represented by an integer number of FFT bins. The others are affected though.
  
  What do you say?
  
  Reply
  1. Clint says:
    
    May 4, 2021 at 3:14 pm
    
    Thanks for the detailed response, Chad! I think you have some entire posts that are shorter…
    
    Question on the coherence calculation – are you using the same FSM with the same zero-padding factor to compute the PSD and then the coherence for each case? It seems that the coherence values have higher variance than the SCF values themselves, though that might be deceptive plot scaling.
    
    Your response spurred me to go back and improve my understanding of spectral and cycle frequency resolution (and temporal resolution, though my intuition is still terrible on what it means).
    
    For my immediate purposes, I can select my data record lengths fairly arbitrarily, though I now (somewhat) better understand your comment about longer data records giving rise to higher cycle-frequency resolution (which has its own challenges).
    
    I’m still wrapping my head around the fact that the FSM and the TSM both have approximately the same cycle frequency resolution. In your posts on resolution, you tended to focus on the FSM, which I think I (now) have a good handle on. I’m not clear on why the TSM has the cycle frequency resolution that it does. It “feels” like it should have a coarser cycle frequency resolution.
    I’m also a little confused about the TSM relative to our zero-padding discussion, since the TSM (by definition) has far fewer FFT bins than the FSM, and thus ideal alpha/2 shifts will generally be further away from exact integers. However, zero-padding the entire signal wouldn’t help anything with the TSM, though I guess you could zero-pad each block. Not sure that that even makes sense though, as it wouldn’t? change the cycle frequency resolution relative to just increasing the block length.
    
    Trying to explore the implications of and need for zero-padding – I tried to relate zero-padding to your post on the resolution product (https://cyclostationary.blog/2016/01/08/scf-estimate-quality-the-resolution-product/).
    
    Let’s say we have a signal with N samples (in normalized units, I believe its temporal resolution is thus T=N). The spectral resolution, for the FSM, is then the length of the smoothing window, g, divided by N. And the resolution product then equals len(g)/N * N = len(g).
    If we zero-pad the signal, I’m guessing that since zero-padding in time is like interpolating in frequency, then the spectral resolution is still just equal to len(g)/N. Or should it be len(g) / (N + num_zeros)? If the latter case, then is it appropriate to increase the length of g to maintain the same ratio as len(g)/N? Otherwise, the resolution product would shrink with the additional zero-padding, increasing the variance of the SCF estimate.
    
    Lastly, why not just stick with shorter FFTs and interpolate the result in the frequency domain onto the finer grid? Seems like that could save a good bit of computation. The FFT is O( N log(N) ), while interpolation is just O(N). Perhaps better yet, why not just interpolate to the frequency points needed for the SCF calculation rather than do integer shifts of the data?
    
    Reply
    1. Chad Spooner says:
      
      May 5, 2021 at 8:39 am
      
      are you using the same FSM with the same zero-padding factor to compute the PSD and then the coherence for each case?
      
      Yes.
      
      I’m still wrapping my head around the fact that the FSM and the TSM both have approximately the same cycle frequency resolution
      
      Yes, the FFTs are typically quite short in the TSM, say, $64$ points or $128$ points. So the FFT bin size is wide, and many different cycle frequencies $\alpha$ will map to the same set of FFT-shift-up and FFT-shift-down indices for each cyclic periodogram. So from that point of view, the resolution in $\alpha$ must be coarse. But remember there is another operation in the TSM that involves the cycle frequency $\alpha$ , and that is the cyclic-periodogram averaging operation, which involves the phase factor $e^{i 2 \pi \alpha k N_{TSM}}$ , where $N_{TSM}$ is that short TSM FFT length. That operation narrows the effective cycle-frequency resolution so that in the end, if you process $N$ samples, you’ll get a cycle-frequency resolution of $1/N$ , just like in the FSM and SSCA. This is fairly easy to show mathematically, but I’ve not posted that to the CSP Blog yet.
      
      However, zero-padding the entire signal wouldn’t help anything with the TSM, though I guess you could zero-pad each block.
      
      Yes, zero-padding in the TSM means zero-padding each short block prior to Fourier transformation, and you get about the same kind of improvement that you get with zero-padding in the FSM.
      
      Let’s say we have a signal with N samples (in normalized units, I believe its temporal resolution is thus T=N). The spectral resolution, for the FSM, is then the length of the smoothing window, g, divided by N. And the resolution product then equals len(g)/N * N = len(g).
      If we zero-pad the signal, I’m guessing that since zero-padding in time is like interpolating in frequency, then the spectral resolution is still just equal to len(g)/N.
      
      Well, after zero-padding, the frequency increments in the FFT are twice as narrow, so to keep the smoothing window width consistent in terms of Hz, you will have a window that is twice as many FFT bins in length as before:
      
      No zero-padding: len(g) = G, total data samples = N, resolution product = (N) (G/N) = G
      
      Zero-padding factor of two: len(g) = 2G, total data samples = N, resolution product = (N) (2G/2N) = G
      
      Lastly, why not just stick with shorter FFTs and interpolate the result in the frequency domain onto the finer grid? Seems like that could save a good bit of computation. The FFT is O( N log(N) ), while interpolation is just O(N). Perhaps better yet, why not just interpolate to the frequency points needed for the SCF calculation rather than do integer shifts of the data?
      
      I’m not sure I’m quite following this, but suppose you are doing the FSM for some cycle frequency $\alpha$ , and you are trying to estimate $S_x^\alpha(f)$ , and suppose further that the total number of processed samples is $N$ and $\alpha \ll 1$ . Say, $\alpha = 0.1$ or $\alpha = 0.01$ (this is a typical range). Then the extent of the function $S_x^\alpha(f)$ in spectral frequency $f$ is large compared to the size of $\alpha$ . That is, you have to estimate the spectral correlation function $S_x^\alpha(f)$ for many $f$ . So you need to “by hand” interpolate to find $X_T(f\pm\alpha/2)$ for a lot of $f\pm\alpha/2$ . Doing the zero-padding gets all of those at once. If you had one $f$ and one $\alpha$ , then you are doing a point estimate of the spectral correlation function, and I think the FSM and TSM are overkill–they are meant to efficiently estimate the function-of-frequency $S_x^\alpha(f)$ . Am I following you?
      
      Reply
      1. Clint says:
        
        May 5, 2021 at 1:17 pm
        
        That clears up my thinking on most of the points; thanks!
        
        Regarding the interpolation, let me try to explain my thinking a bit more.
        As you suggest, let’s consider a signal x(n) with N samples.
        
        We can get X(f) by taking the FFT of x, which requires O( N log(N) ) operations.
        
        Let’s also zero-pad x(n) with P zeros to give x_pad(n) with M samples (M = N + P). Taking the FFT gives X_pad(f), which requires O( M log(M) ) operations.
        
        To estimate S_x^alpha(f) with the FSM, we perform two shift operations on X_pad(f) to approximate X(f +/- alpha/2), and then multiply and smooth. This takes two sets of operations that are each O(M) (assuming the shifts are achieved through artistic indexing and thus essentially zero operations).
        
        So, with zero-padding, it requires something like O( M (2 + log(M)) ) operations, and none of the resolutions are changed, though the accuracy of the cycle frequency is improved.
        
        Alternatively, we could interpolate X(f) onto two frequency grids given by f +/- alpha/2 (e.g. using ‘interp1()’ in Matlab/Octave or NumPy’s interp() function) and multiply the result and smooth. This corresponds to four sets of operations that are each O(N).
        
        With frequency-domain interpolation instead of zero-padding in time, we need O( N (4 + log(N)) ). And I believe all the resolutions would be the same, while achieving near-perfect accuracy of the cycle frequency. (And having fewer data points.)
        
        I’m playing fast and loose with big-O notation here, but if P = N, then zero-padding takes O( 2N (2 + log(2N)) ) = O( 2N (3 + log(N)) ), while interpolation takes O( N (4 + log(N)) ).
        
        In other words, for zero-padding by a factor of two, interpolation as I describe it _might_ save you a factor of two in number of operations. (Big-O notation was never meant to be used as precisely as I used it above, and it’s really going to come down to which FFT and interpolation implementations you use and exactly how many operations they require.)
        
        So – interpolation may be more competitive for zero-padding by larger factors, and you could probably save a bit of computation for larger alpha values that result in substantial shrinking of the frequency domain. But it’s certainly not a big enough difference to matter until someone starts processing real data on a substantial scale.
        
        Does that make sense? Do you agree that interpolation in the frequency domain would be at least as accurate as zero-padding in the time domain? Or am I missing something?
      2. Chad Spooner says:
        
        May 7, 2021 at 2:44 pm
        
        Does that make sense? Do you agree that interpolation in the frequency domain would be at least as accurate as zero-padding in the time domain? Or am I missing something?
        
        I think it can be, but there is no guarantee because not all interpolation methods are equivalent in terms of computational cost and outcome.
        
        Here is an example illustrating that linear interpolation (using your suggestion: MATLAB’s interp1.m) does not improve the situation whereas zero-padding does:
        
        The signal is a unit-amplitude complex-valued sine wave with a frequency that lies between two FFT bin centers: $f_0 = 0.25 + 1.5/8192$ , where the FFT size is $8192$ . Zero padding brings out the full strength of the tone in one bin, as expected because the frequency is exactly equal to an integer number of bins of width $1/(2*8192)$ . But interpolating the complex-valued FFT using interp1.m results in a zero at that bin. Interpolating the magnitude of the FFT just replicates the value on either side of the bin for $f_0$ .
        
        Do you buy this?
Clint says:

May 14, 2021 at 7:37 pm

That does make sense. Can’t argue with the example you presented! 😉
I’d like to play around with it some, taking the process all the way through to estimating the SCF and tracking any effects to completely convince myself.

Fortunately or unfortunately, I’ve been caught up in working to better understand resolution and details like what you were explaining earlier in our conversation, applying various methods to the signals I currently care about.

For anyone following our discussion here, I’d also recommend looking through some related comments under the FFT Accumulation Method post. Several points there helped me to clarify my understanding of resolution and related matters.

Reply
Pingback: 1,000 Page Views!!! - Wave Walker DSP
Willis says:

May 25, 2022 at 10:22 am

Hi Dr. Spooner,

I’m sorry if this is a duplicate post. My first comment produced an error when I hit “Post Comment”. I’ll try again.

First, thank you *very* much for your blog. It is absolutely fantastic!

I feel like I’m understanding this particular post but I think there might be a small typo. When choosing two different shifts (i.e. $a_1/N$ and $a_2/N$ ) other than $+\alpha/2$ and $-\alpha/2$ , what we really care about is the distance between $a_1/N$ and $a_2/N$ since we want that to be (exactly or very close to) $\alpha$ . If I’m understanding that accurately then I believe the statement “The idea is to choose the shifts $a_1$ and $a_2$ such that $\left| \alpha - |a_1/N| - |a_2/N| \right|$ is minimized” should actually use the formula $\left| \alpha - |a_1/N - a_2/N| \right|$ .

For a dumb example, if $\alpha = 2/N$ and $a_1 = a_2 = 1$ then $\left| \alpha - |a_1/N| - |a_2/N| \right| = \left| 2/N - 1/N - 1/N \right| = 0$ so the formula as given is minimized but the distance between $a_1$ and $a_2$ is 0 rather than $\alpha$ .

For another example, if $\alpha = 2/N$ and $a_1 = 12$ and $a_2 = 10$ then $\left| \alpha - |a_1/N| - |a_2/N| \right| = 20/N$ but $\left| \alpha - |a_1/N - a_2/N| \right| = 0$ showing that the distance between $a_1/N$ and $a_2/N$ is equal to $\alpha$ , producing the desired shift.

If I’ve missed something fundamental here, please let me know. Thanks again for this blog!

Reply
1. Chad Spooner says:
  
  May 28, 2022 at 10:11 am
  
  Thanks for checking out the CSP Blog, Willis, and for your generous compliments and great first comment. Very much appreciated!
  
  Also, great job on using latex to format your symbols. I did edit your comment because enclosing the latex fragments between two dollar signs is not enough in WordPress–you must also include the keyword ‘latex’ just after the first dollar sign. If we do $\alpha$ we just see the latex code but if you put the word latex just after the left dollar sign, add a space, then the \alpha, you get $\alpha$ .
  
  Thought-provoking comment! I was reading it late last evening and was thinking, ‘Uh-oh, messed up again.’ But looking at it this morning, I’m not so sure.
  
  When choosing two different shifts (i.e. $a_1/N$ and $a_2/N$ ) other than $+\alpha/2$ and $-\alpha/2$ , what we really care about is the distance between $a_1/N$ and $a_2/N$ since we want that to be (exactly or very close to) $\alpha$ .
  
  I think what we are focusing on is the distance between $\alpha/2$ and $-\alpha/2$ , because we have to take into account the negative sign in the second factor of the cyclic periodogram. The two shifts move the Fourier transform to the left by $\alpha/2$ and to the right by $\alpha/2$ , for a total shift of $\alpha$ . It’s that negative sign that is causing some confusion here.
  
  If we approximate the two shifts by $a_1/N$ and $a_2/N$ , the effective shift is related to $a_1 + a_2 = a_1 - (-a_2)$ . I have to wade through a tangle of absolute value signs if I want to make sure I’m including negative cycle frequencies. I have slightly clarified the formula in the post, but I’m (so far) sticking with the basic formula you are grappling with.
  
  For a dumb example, if $\alpha = 2/N$ and $a_1 = a_2 = 1$ then $\left| \alpha - |a_1/N| - |a_2/N| \right| = \left| 2/N - 1/N - 1/N \right| = 0$ so the formula as given is minimized but the distance between $a_1$ and $a_2$ is 0 rather than $\alpha$ .
  
  Yes, since $a_1 = a_2$ , the distance (difference) between $a_1$ and $a_2$ is zero, but that is not the same as the effective shift. $a_1 = a_2 = 1$ shifts the left factor in the cyclic periodogram one way by 1 FFT bin and the other factor by 1 the other way, for a total shift of two bins, which is the desired cycle frequency $2/N$ .
  
  For another example, if $\alpha = 2/N$ and $a_1 = 12$ and $a_2 = 10$ then $\left| \alpha - |a_1/N| - |a_2/N| \right| = 20/N$ but $\left| \alpha - |a_1/N - a_2/N| \right| = 0$ showing that the distance between $a_1/N$ and $a_2/N$ is equal to $\alpha$ , producing the desired shift.
  
  But choosing $a_1 = 12$ and $a_2 = 10$ corresponds to a total shift in the cyclic periodogram of $22/N$ , which is not the desired cycle frequency. This is consistent with the fact that my formula is not minimized, as you point out, so those two $a_i$ are not good candidates for $\alpha = 2/N$ .
  
  It gets a little more complicated (and annoying) when you consider the conjugate cyclic periodogram and conjugate spectral correlation function.
  
  Buying it?
  
  Reply
Sandford says:

June 20, 2023 at 4:18 am

Hi Again Chad,

I have successfully implemented the FSM using some of the code people have posted, in particular for the frequency shift, the time domain shift is working well:

alpha=0.1
x1=y_of_t.*exp(-1i*2*pi*alpha/2.*[1:length(y_of_t)])
x2=y_of_t.*exp(1i*2*pi*alpha/2.*[1:length(y_of_t)])

X1=fftshift(fft(x1))
X2=fftshift(fft(x2))

S=X1.*conj(X2);
S=S/length(S);

s=rectwin(164)*(1/164)

f=conv(S,s)

freq=(-length(f)/2+1:length(f)/2)/(length(f));

plot(freq,abs(f))

Using BPSK 40,000 length signal. My question is, how can I perform the frequency shift in the frequency domain using equation (8) or (10)? My attempt was as follows:

Assuming a sampling rate of 1 MHz, left me with a bit rate of 100 kHz and that carrier frequency of 50 kHz, using your calculation from previous post this would get me alpha = 100 kHz for the first cyclic frequency or 0.1 normalised. I used straight fft on the 40k samples to calculate nfft=40000, fs=1 M, therefore frequency res = 25 Hz.

So to achieve alpha = 100 kHz I would need to move signal one to the right 4000 bins and signal two to the left 4000 bins. I chose to move signal 2 down 8000 bins instead and cut off the overlap at the end. But this does not produce the same result as the first code.

shift=-8000

x1=y_of_t
x2=circshift(y_of_t,shift)

X1=fftshift(fft(x1))
X2=fftshift(fft(x2))

S=X1.*conj(X2);
S=S/length(S);

s=rectwin(164)*(1/164)

f=conv(S,s)

freq=(-length(f)/2+1:length(f)/2)/(length(f));

plot(freq,abs(f))

Reply
1. Chad Spooner says:
  
  June 20, 2023 at 10:01 am
  
  Love the progress!
  
  I confirm that the first code snippet produces a good estimate of $S_x^{0.1}(f)$ for the rectangular-pulse BPSK signal.
  
  Regarding the second snippet, where you attempt to implement the shift in the frequency domain, you are actually still shifting things in the time domain:
  
  shift=-8000
  x1=y_of_t
  x2=circshift(y_of_t,shift)
  
  So the rest of that snippet then computes an estimate of the cross spectral density between y_of_t and a delayed (by 8000 samples) version of y_of_t.
  
  The non-conjugate cyclic periodogram is
  
  $\displaystyle I_x^\alpha(f) = \frac{1}{T} X_T(f+\alpha/2) X_T^*(f-\alpha/2)$
  
  so that you have to apply the shifts of $\pm \alpha/2$ after you compute the Fourier transform (using fft.m) $X_T(f)$ .
  
  Reply
  1. Sandford says:
    
    June 23, 2023 at 4:45 am
    
    Thanks Chad,
    A rookie mistake… I dunno how I missed that. It works now.
    
    Reply
    1. Chad Spooner says:
      
      June 23, 2023 at 8:29 am
      
      I make three rookie mistakes before I finish my morning coffee most days!
      
      Reply
Vince says:

May 7, 2024 at 10:00 am

Hello!

I was trying to intuitively try to figure out what this algorithm is doing with the BPSK example and how to to find the chip rates of the BPSK signal.

1) Calculate the frequency domain of the signal with the FFT.
2) shifting the resulting frequency by +/- the desired chip rate (cycle frequency)
3) Add up the cross product of the two, shifted spectrums, to see if the frequency components line up.

For example, if my BPSK signal had a chip rate of 1 MHz, i would want my cycle frequency of interest to be the 1 MHz/2 = +/- 0.5 MHz.

Does that make sense or am i totally off?

Reply
1. Chad Spooner says:
  
  May 7, 2024 at 2:37 pm
  
  Welcome to the CSP Blog Vince! I appreciate your readership and the comment.
  
  Generally speaking, the FSM isn’t an efficient way of finding symbol or chip rates. It is highly efficient, though, if you already know a cycle frequency, or a few, and want to estimate the spectral correlation function. See the computational cost post.
  
  If you do know the symbol rate, then a non-conjugate cycle frequency for BPSK is that symbol rate, $\alpha = f_{sym} = 1$ MHz. In your Step 2,
  
  2) shifting the resulting frequency by +/- the desired chip rate (cycle frequency)
  
  you’d want to shift the obtained FFT up and down by $\alpha/2 = 500$ kHz, rather than shifting by $\alpha$ itself.
  
  Then in the third step,
  
  3) Add up the cross product of the two, shifted spectrums, to see if the frequency components line up.
  
  you’d want to smooth (average through convolution) the resulting product of shifted FFTs. To really see if you’ve got a valid feature, though, you’d want to take it one step further and compute the spectral coherence function.
  
  If you want to search for valid cycle frequencies, try the strip spectral correlation algorithm or the FFT accumulation method. These search the entire principal domain for significant cycle frequencies.
  
  Reply
  1. vince says:
    
    May 8, 2024 at 10:24 am
    
    Thanks for the info. I am trying to just search for a few, select chip rates (cycle frequencies) to start out to get a feel for what I am doing before I move on to the more exhaustive search algorithms.
    
    I just want to make sure I am doing what was discussed in your response.
    
    I am just looking for cycle frequencies at 1 MHz, 2 MHz, 3 MHz, 4 MHz, and 5 MHz.
    
    here is the output plot of my code below: https://imgur.com/a/fFQUTHk
    
    I do see the correct cylce frequencies pop out at the correct carrier frequencies. It does seem like the other cycle frequencies are relatively close in power to the correct one? is there anything to do to help with that?
    
    Sorry if my questions are a little rambling!
    
    Attached is some code I wrote:
    
    % Example usage:
    % Assuming input_signal is your input signal
    fs_hz = 80e6;
    
    % Create two seperate psk signals and add them together
    x = pskmake2(fs_hz,-12e6,3e6,13e-6,1); % fs, carrier freq, chip rate, duration, quanta
    x2 = pskmake2(fs_hz,+20e6,4e6,13e-6,1); % fs, carrier freq, chip rate, duration, quanta
    noise = 1*(randn(1,length(x)) + randn(1,length(x)))/sqrt(2);
    bpsk_sig = x+x2+ noise;
    
    figure; hold on;
    
    time_vec = (0:length(bpsk_sig)-1);
    
    for aa = 1e6:1e6:5e6
    
    % Desired Chip Rate to Search for
    chip_rate = aa;
    
    % Shift Signal by + Alpha/2
    u = fft(bpsk_sig.*exp(-1i*2*pi*chip_rate/2/fs_hz*time_vec));
    
    % Shift Signal by – Alpha/2
    v = fft(bpsk_sig.*exp(+1i*2*pi*chip_rate/2/fs_hz*time_vec));
    
    output_1 = u.*conj(v);
    
    win_len = 20;
    
    smoothed_output = conv(output_1,ones(1,win_len))/win_len;
    
    freq_axis = linspace(-fs_hz/2e6,fs_hz/2e6,length(smoothed_output));
    plot(freq_axis,10*log10(abs(fftshift(smoothed_output))))
    
    end
    range = 1e6:1e6:5e6;
    legendCell = strcat(‘Alpha / 2 = ‘,’ ‘,string(num2cell(range/1e6/2)),’ MHz’);
    legend(legendCell)
    ylim([20 70])
    grid on
    
    xlabel(‘Frequency (MHz)’)
    ylabel(‘dB’)
    title(‘Cyclic Spectrum at Desired Cycle Frequency Cuts’)
    
    Reply
    1. vince says:
      
      May 9, 2024 at 6:47 am
      
      Also, Was my comment about, in a broad sense, that shifting the frequency domain by +/- alpha/2, that we are just seeing how well the frequency components from the modulation line up?
      
      Reply
      1. Chad Spooner says:
        
        May 9, 2024 at 10:55 am
        
        I’m not sure what you mean by “line up.”
        
        The physical interpretation of spectral correlation is laid out in the spectral correlation post–be sure to read all the way to the end. The idea is that two narrowband (relative to the incoming-signal bandwidth) bandpass filters are used to extract two complex-envelope data streams. The separation between the center frequencies of the two filters’ passbands is the cycle frequency $\alpha$ . These two narrowband lowpass signals are then correlated over time. If that correlation is not zero, then we say that the incoming data exhibits cyclostationarity with cycle frequency $\alpha$ . The mathematics of this physical idea leads directly to the time-smoothing method (TSM) of spectral correlation.
        
        It turns out, though, that the time-smoothed cyclic periodogram estimator is closely approximated by the frequency-smoothed cyclic periodogram estimator. You can time average a large number of short-duration cyclic periodograms (the TSM), or you can frequency average one long-duration cyclic periodogram (the FSM).
    2. Chad Spooner says:
      
      May 9, 2024 at 10:39 am
      
      I do see the correct cylce frequencies pop out at the correct carrier frequencies. It does seem like the other cycle frequencies are relatively close in power to the correct one? is there anything to do to help with that?
      
      I have a couple things for you to consider.
      
      Most importantly, I suggest you think about your philosophy of software development and validation here. You’ve created an input containing a couple signals and noise, and you are applying a spectral correlation estimator to that input. How will you know if your code is correct? Do you know the truth here? I’ve seen this many times: estimator code is written and immediately applied to data of interest. I suggest you apply the code to data for which the ideal spectral correlation function is perfectly known mathematically and for which validated estimates are already available. That is why I wrote the Second-Order Verification post. I strongly suggest you develop your code so that you can replicate all the estimates I provide in that post. Then you are in a good position to move on to data of interest.
      
      For the provided code:
      
      * I note that you are generating 80 MHz * 13 us = 1040 samples, which is a small number of samples relative to the symbol durations, which are (80e6 Samp/sec)/(3e6 sym/sec) = 26.7 samp/sym and (80e6/4e6) = 20 samp/sym. So your processed data contains only 1040/26.7 = 40 symbols for the first rate!
      
      * Moreover, I assume pskmake2.m creates a unit-power signal, and you are adding comparable noise, so you have a short data record with lots of noise. This is not ideal for debugging/developing the estimator.
      
      * output_1 = u.*conj(v) is not the cyclic periodogram (8). That is why your plots show such large values: you neglected the factor of 1/N.
      
      * You should always include zero as a cycle frequency as you develop here, so you can always visually check the PSD–is it reflective of the intended RF scene?
      
      Reply
      1. vince says:
        
        May 9, 2024 at 12:36 pm
        
        Ok – i will take your recommendations and comments and go through another iteration. Thank you so much! Slowly i will get there.
      2. Chad Spooner says:
        
        May 9, 2024 at 12:47 pm
        
        You are the reason I do the CSP Blog! Thanks for reading, and I hope to hear about your success soon.
Vince says:

May 14, 2024 at 5:50 am

So after reading, is the bandpass filters accomplished by the FFT? i.e., the bandwidth of the filters are the bin sizes of the FFT. And then the down conversion process is shifting the signal by +/- alpha to bring the desired components to baseband.

Did i comprehend that correctly?

Reply
1. Chad Spooner says:
  
  May 15, 2024 at 2:49 pm
  
  Maybe? The physical interpretation is obscured a bit in the context of the time-smoothing and frequency-smoothing methods. Remember that in those methods, we are looking at a single value of cycle frequency, but a large number of values of spectral frequency (simultaneously).
  
  Reply