Core methods#
Time series methods#
restrict#
restrict is used to get time points within an IntervalSet. This method is available for TsGroup, Tsd, TsdFrame, TsdTensor and Ts objects.
tsdframe.restrict(epochs)
Time (s) a b c
---------- ----------- ---------- ---------
10.0 -1.70779 0.426821 -1.18807
11.0 0.275069 0.121761 -0.128662
12.0 -0.814533 0.0340065 -0.714543
13.0 -2.29497 -1.08473 0.532381
14.0 -0.325769 -0.0978819 2.06828
15.0 0.00718536 -1.12918 -0.475541
16.0 0.0988638 -0.0892249 0.649435
...
74.0 -0.867435 0.52065 2.02102
75.0 1.53763 -0.033928 0.272118
76.0 2.43595 -0.989441 -0.695611
77.0 -0.178886 0.288484 -0.127145
78.0 0.791889 0.201182 -0.46272
79.0 -0.838362 -3.15624 3.29988
80.0 -0.409936 -1.16035 -0.310777
dtype: float64, shape: (32, 3)
This operation update the time support attribute accordingly.
print(epochs)
print(tsdframe.restrict(epochs).time_support)
index start end
0 10 25
1 65 80
shape: (2, 2), time unit: sec.
index start end
0 10 25
1 65 80
shape: (2, 2), time unit: sec.
in_interval#
in_interval is similar to restrict, except instead of returning the restricted time series, it returns a Tsd of booleans for each time point indicating whether or not it falls within the intervals of an IntervalSet.
tsdframe.in_interval(epochs)
Time (s)
---------- --
0.0 0
1.0 0
2.0 0
3.0 0
4.0 0
5.0 0
6.0 0
...
93.0 0
94.0 0
95.0 0
96.0 0
97.0 0
98.0 0
99.0 0
dtype: bool, shape: (100,)
count#
count returns the number of timestamps within bins or epochs of an IntervalSet object.
This method is available for TsGroup, Tsd, TsdFrame, TsdTensor and Ts objects.
With a defined bin size:
count = tsgroup.count(bin_size=1.0, time_units='s')
print(count)
Time (s) 0 1 2
---------- --- --- ---
0.5 0 1 0
1.5 0 0 0
2.5 0 0 0
3.5 0 0 1
4.5 0 0 1
5.5 0 0 0
6.5 1 0 0
...
93.5 0 0 1
94.5 0 0 0
95.5 0 0 0
96.5 0 0 0
97.5 0 0 0
98.5 0 0 1
99.5 0 0 0
dtype: int64, shape: (100, 3)
With an IntervalSet:
count_ep = tsgroup.count(ep=epochs)
print(count_ep)
Time (s) 0 1 2
---------- --- --- ---
17.5 1 1 7
72.5 2 3 8
dtype: int64, shape: (2, 3)
trial_count#
TsGroup and Ts objects each have the method trial_count, which builds a trial-based count tensor from an IntervalSet object.
Similar to count, this function requires a bin_size parameter which determines the number of time bins within each trial.
The resulting tensor has shape (number of group elements, number of trials, number of time bins) for TsGroup objects,
or (number of trials, number of time bins) for Ts objects.
ep = nap.IntervalSet([5, 17, 30, 50], metadata={'trials':[1, 2]})
tensor = tsgroup.trial_count(ep, bin_size=2)
print(tensor, "\n")
print("Tensor shape = ", tensor.shape)
[[[ 1. 1. 0. 1. 0. 0. nan nan nan nan]
[ 0. 0. 0. 1. 0. 0. 0. 0. 1. 0.]]
[[ 0. 0. 0. 0. 0. 0. nan nan nan nan]
[ 0. 0. 0. 1. 1. 2. 0. 1. 0. 1.]]
[[ 0. 0. 0. 0. 0. 4. nan nan nan nan]
[ 1. 0. 0. 1. 2. 0. 0. 0. 1. 0.]]]
Tensor shape = (3, 2, 10)
The array is padded with NaNs when the trials have uneven durations,
The padding value can be controlled using the parameter padding_value.
Additionally, the parameter align can change whether the count is aligned to the “start” or “end” of each trial.
tensor = tsgroup.trial_count(ep, bin_size=2, align="end", padding_value=-1)
print(tensor, "\n")
print("Tensor shape = ", tensor.shape)
[[[-1. -1. -1. -1. 1. 1. 0. 1. 0. 0.]
[ 0. 0. 0. 1. 0. 0. 0. 0. 1. 0.]]
[[-1. -1. -1. -1. 0. 0. 0. 0. 0. 0.]
[ 0. 0. 0. 1. 1. 2. 0. 1. 0. 1.]]
[[-1. -1. -1. -1. 0. 0. 0. 0. 0. 4.]
[ 1. 0. 0. 1. 2. 0. 0. 0. 1. 0.]]]
Tensor shape = (3, 2, 10)
bin_average#
bin_average downsamples time series by averaging data point falling within a bin. This method is available for Tsd, TsdFrame and TsdTensor. While bin_average is good for downsampling with precise control of the resulting bins, it does not apply any antialiasing filter. The function decimate is also available for down-sampling without aliasing.
tsdframe.bin_average(3.5)
Time (s) a b c
---------- ---------- ----------- ----------
1.75 0.252831 0.0975991 0.371393
5.25 0.445295 -0.736343 -0.797215
8.75 -0.488128 0.273993 -0.469864
12.25 -0.944811 -0.309654 -0.103608
15.75 -0.0604766 -0.56759 0.239264
19.25 0.844943 -0.281723 0.290582
22.75 -0.126312 -1.00583 0.296432
...
75.25 1.03538 -0.167573 0.532509
78.75 -0.158824 -0.956731 0.59981
82.25 0.0596233 -0.00404422 -0.205578
85.75 -0.30696 0.337507 0.447115
89.25 0.211241 -0.227626 0.0559064
92.75 -0.136772 0.51948 -0.168265
96.25 0.0262991 -0.698451 0.855736
dtype: float64, shape: (28, 3)
decimate#
The decimate method downsamples the time series by an integer factor after an antialiasing filter.
new_tsd = tsd.decimate(down=4)
The original time series was sampled at 1Hz. The new time series has a rate of 0.25 Hz.
print(f"Original rate : {tsd.rate}")
print(f"New rate : {new_tsd.rate}")
Original rate : 1.0
New rate : 0.25
interpolate#
The interpolate method of Tsd, TsdFrame and TsdTensor can be used to fill gaps in a time series. It is a wrapper of numpy.interp.
new_tsd = tsd.interpolate(ts)
value_from#
By default, value_from assigns to timestamps the closest value in time from another time series. Let’s define the time series we want to assign values from.
For every timestamps in tsgroup, we want to assign the closest value in time from tsd.
tsgroup_from_tsd = tsgroup.value_from(tsd)
We can display the first element of tsgroup and tsgroup_sin.
The argument mode can control if the nearest target time is taken before or
after the reference time.
In this case, the variable ts receive data from the time point before.
new_ts_before = ts.value_from(tsd, mode="before")
If there is no time point found before or after or within the interval, the function assigns Nans.
tsd = nap.Tsd(t=np.arange(1, 10, 1), d=np.arange(10, 100, 10))
ep = nap.IntervalSet(start=0, end = 10)
ts = nap.Ts(t=[0, 9])
# First ts is at 0s. First tsd is at 1s.
ts.value_from(tsd, ep=ep, mode="before")
Time (s)
---------- ---
0 nan
9 90
dtype: float64, shape: (2,)
threshold#
The method threshold of Tsd returns a new Tsd with all the data above or below a certain threshold. Default is above. The time support of the new Tsd object get updated accordingly.
tsd_above = tsd.threshold(0.5, method='above')
This method can be used to isolate epochs for which a signal is above/below a certain threshold.
epoch_above = tsd_above.time_support
derivative#
The derivative method of Tsd, TsdFrame and TsdTensor can be used to calculate the derivative of a time series with respect to time. It is a wrapper of numpy.gradient.
derivative = tsd.derivative(ep=ep)
to_trial_tensor#
Tsd, TsdFrame, and TsdTensor all have the method to_trial_tensor, which creates a numpy array from an IntervalSet by slicing the time series. The resulting tensor has shape (shape of time series, number of trials, number of time points), where the first dimension(s) is dependent on the object.
tsd = nap.Tsd(t=np.arange(0, 100, 1), d=np.sin(np.arange(0, 10, 0.1)))
ep = nap.IntervalSet([0, 10, 30, 50, 70, 75], metadata={'trials':[1, 2, 3]})
print(ep)
index start end trials
0 0 10 1
1 30 50 2
2 70 75 3
shape: (3, 2), time unit: sec.
The following example returns a tensor with shape (3, 21), for 3 trials and 21 time points, where the first dimension is dropped due to this being a Tsd object.
tensor = tsd.to_trial_tensor(ep)
print(tensor, "\n")
print("Tensor shape = ", tensor.shape)
[[ 0. 0.09983342 0.19866933 0.29552021 0.38941834 0.47942554
0.56464247 0.64421769 0.71735609 0.78332691 0.84147098 nan
nan nan nan nan nan nan
nan nan nan]
[ 0.14112001 0.04158066 -0.05837414 -0.15774569 -0.2555411 -0.35078323
-0.44252044 -0.52983614 -0.61185789 -0.68776616 -0.7568025 -0.81827711
-0.87157577 -0.91616594 -0.95160207 -0.97753012 -0.993691 -0.99992326
-0.99616461 -0.98245261 -0.95892427]
[ 0.6569866 0.72896904 0.79366786 0.85043662 0.8987081 0.93799998
nan nan nan nan nan nan
nan nan nan nan nan nan
nan nan nan]]
Tensor shape = (3, 21)
Since trial 2 is twice as long as trial 1, the array is padded with NaNs. The padding value can be changed by setting the parameter padding_value.
tensor = tsd.to_trial_tensor(ep, padding_value=-1)
print(tensor, "\n")
print("Tensor shape = ", tensor.shape)
[[ 0. 0.09983342 0.19866933 0.29552021 0.38941834 0.47942554
0.56464247 0.64421769 0.71735609 0.78332691 0.84147098 -1.
-1. -1. -1. -1. -1. -1.
-1. -1. -1. ]
[ 0.14112001 0.04158066 -0.05837414 -0.15774569 -0.2555411 -0.35078323
-0.44252044 -0.52983614 -0.61185789 -0.68776616 -0.7568025 -0.81827711
-0.87157577 -0.91616594 -0.95160207 -0.97753012 -0.993691 -0.99992326
-0.99616461 -0.98245261 -0.95892427]
[ 0.6569866 0.72896904 0.79366786 0.85043662 0.8987081 0.93799998
-1. -1. -1. -1. -1. -1.
-1. -1. -1. -1. -1. -1.
-1. -1. -1. ]]
Tensor shape = (3, 21)
By default, time series are aligned to the start of each trial. To align the time series to the end of each trial, the optional parameter align can be set to “end”.
tensor = tsd.to_trial_tensor(ep, align="end")
print(tensor, "\n")
print("Tensor shape = ", tensor.shape)
[[ nan nan nan nan nan nan
nan nan nan nan 0. 0.09983342
0.19866933 0.29552021 0.38941834 0.47942554 0.56464247 0.64421769
0.71735609 0.78332691 0.84147098]
[ 0.14112001 0.04158066 -0.05837414 -0.15774569 -0.2555411 -0.35078323
-0.44252044 -0.52983614 -0.61185789 -0.68776616 -0.7568025 -0.81827711
-0.87157577 -0.91616594 -0.95160207 -0.97753012 -0.993691 -0.99992326
-0.99616461 -0.98245261 -0.95892427]
[ nan nan nan nan nan nan
nan nan nan nan nan nan
nan nan nan 0.6569866 0.72896904 0.79366786
0.85043662 0.8987081 0.93799998]]
Tensor shape = (3, 21)
time_diff#
Ts, Tsd, TsdFrame, TsdTensor, and TsGroup all have the method time_diff, which computes the time differences between subsequent timepoints.
For example, if a Ts object contained a set of spike times, time_diff would compute the inter-spike interval (ISI).
This method returns a new Tsd object, with values being each time difference, and time indices being their reference time point.
Passing epochs restricts the computation to the given epochs.
The reference time point can be adjusted by the optional align parameter, which can be set to "start", "center", or "end" (the default being "center").
time_diffs = ts.time_diff(align="center")
print(time_diffs)
Time (s)
---------- --
3 4
5.5 1
9 6
14 4
17 2
18.5 1
dtype: float64, shape: (6,)
Setting align="center" sets the reference time point to the midpoint between the timestamps used to calculate the time difference.
Setting align="start" or align="end" sets the reference time point to the earlier or later timestamp, respectively.
Mapping between TsGroup and Tsd#
It’s is possible to transform a TsGroup to Tsd with the method
to_tsd and a Tsd to TsGroup with the method to_tsgroup.
This is useful to flatten the activity of a population in a single array.
tsd = tsgroup.to_tsd()
print(tsd)
Time (s)
------------ --
0.900704546 1
3.599669614 2
4.443214901 2
6.466523673 0
8.319356024 0
12.293351008 0
15.017354866 2
...
85.926443439 0
86.288441949 2
89.129032463 1
91.144232462 1
92.364020335 2
93.258934928 2
98.197731592 2
dtype: float64, shape: (60,)
The object tsd contains all the timestamps of the tsgroup with
the associated value being the index of the unit in the TsGroup.
The method to_tsgroup converts the Tsd object back to the original TsGroup.
back_to_tsgroup = tsd.to_tsgroup()
print(back_to_tsgroup)
Index rate
------- ------
0 0.1
1 0.2
2 0.3
Parameterizing a raster#
The method to_tsd makes it easier to display a raster plot.
TsGroup object can be plotted with plt.plot(tsgroup.to_tsd(), 'o').
Timestamps can be mapped to any values passed directly to the method
or by giving the name of a specific metadata name of the TsGroup.
tsgroup['label'] = np.arange(3)*np.pi
print(tsgroup)
Index rate label
------- ------ -------
0 0.1 0
1 0.2 3.14
2 0.3 6.28
Special slicing: TsdFrame#
For users that are familiar with pandas, TsdFrame is the closest object to a DataFrame, but there are distinctive behavior when slicing the object. TsdFrame behaves primarily like a numpy array. This section lists all the possible ways of slicing TsdFrame.
1. If not column labels are passed#
tsdframe = nap.TsdFrame(t=np.arange(4), d=np.random.randn(4,3))
print(tsdframe)
Time (s) 0 1 2
---------- --------- --------- ---------
0 -0.768861 1.02546 -0.310163
1 -1.31702 0.647238 0.362684
2 -1.17852 1.00554 0.529681
3 0.590634 -2.29274 -1.63448
dtype: float64, shape: (4, 3)
Slicing should be done like numpy array:
tsdframe[0]
array([-0.76886142, 1.0254627 , -0.31016317])
tsdframe[:, 1]
Time (s)
---------- ---------
0 1.02546
1 0.647238
2 1.00554
3 -2.29274
dtype: float64, shape: (4,)
tsdframe[:, [0, 2]]
Time (s) 0 2
---------- --------- ---------
0 -0.768861 -0.310163
1 -1.31702 0.362684
2 -1.17852 0.529681
3 0.590634 -1.63448
dtype: float64, shape: (4, 2)
2. If column labels are passed as integers#
The typical case is channel mapping. The order of the columns on disk are different from the order of the columns on the recording device it corresponds to.
tsdframe = nap.TsdFrame(t=np.arange(4), d=np.random.randn(4,4), columns = [3, 2, 0, 1])
print(tsdframe)
Time (s) 3 2 0 1
---------- --------- ----------- --------- ---------
0 -0.14382 -0.325038 -0.944572 0.566623
1 -1.12361 -1.74113 1.32688 -0.465396
2 0.785126 -0.00346659 -0.719795 0.142208
3 0.844678 -0.807675 0.116848 0.38175
dtype: float64, shape: (4, 4)
In this case, indexing like numpy still has priority which can led to confusing behavior:
tsdframe[:, [0, 2]]
Time (s) 3 0
---------- --------- ---------
0 -0.14382 -0.944572
1 -1.12361 1.32688
2 0.785126 -0.719795
3 0.844678 0.116848
dtype: float64, shape: (4, 2)
Note how this corresponds to column labels 3 and 0.
To slice using column labels only, the TsdFrame object has the loc method similar to Pandas:
tsdframe.loc[[0, 2]]
Time (s) 0 2
---------- --------- -----------
0 -0.944572 -0.325038
1 1.32688 -1.74113
2 -0.719795 -0.00346659
3 0.116848 -0.807675
dtype: float64, shape: (4, 2)
In this case, this corresponds to columns labelled 0 and 2.
3. If column labels are passed as strings#
Similar to Pandas, it is possible to label columns using strings.
tsdframe = nap.TsdFrame(t=np.arange(4), d=np.random.randn(4,3), columns = ["kiwi", "banana", "tomato"])
print(tsdframe)
Time (s) kiwi banana tomato
---------- --------- --------- ---------
0 -0.966766 1.01905 0.616591
1 0.330481 0.556173 -1.29246
2 0.175159 -1.13488 0.824735
3 -0.705439 -1.97152 -1.06509
dtype: float64, shape: (4, 3)
When the column labels are all strings, it is possible to use either direct bracket indexing or using the loc method:
print(tsdframe['kiwi'])
print(tsdframe.loc['kiwi'])
Time (s)
---------- ---------
0 -0.966766
1 0.330481
2 0.175159
3 -0.705439
dtype: float64, shape: (4,)
Time (s)
---------- ---------
0 -0.966766
1 0.330481
2 0.175159
3 -0.705439
dtype: float64, shape: (4,)
4. If column labels are mixed type#
It is possible to mix types in column names.
tsdframe = nap.TsdFrame(t=np.arange(4), d=np.random.randn(4,3), columns = ["kiwi", 0, np.pi])
print(tsdframe)
Time (s) kiwi 0 3.141592653589793
---------- ---------- --------- -------------------
0 0.0245761 1.18928 0.584095
1 -0.455289 0.569711 -0.897161
2 1.74499 -1.71332 0.924258
3 -1.1611 0.134263 -1.13997
dtype: float64, shape: (4, 3)
Direct bracket indexing only works if the column label is a string.
print(tsdframe['kiwi'])
Time (s)
---------- ----------
0 0.0245761
1 -0.455289
2 1.74499
3 -1.1611
dtype: float64, shape: (4,)
To slice with mixed types, it is best to use the loc method:
print(tsdframe.loc[['kiwi', np.pi]])
Time (s) kiwi 3.141592653589793
---------- ---------- -------------------
0 0.0245761 0.584095
1 -0.455289 -0.897161
2 1.74499 0.924258
3 -1.1611 -1.13997
dtype: float64, shape: (4, 2)
In general, it is probably a bad idea to mix types when labelling columns.
Interval sets methods#
Interaction between epochs#
Intervals can be combined in different ways.
epoch1 = nap.IntervalSet(start=[0, 40], end=[10, 50]) # no time units passed. Default is us.
epoch2 = nap.IntervalSet(start=[5, 30], end=[20, 45])
print(epoch1, "\n")
print(epoch2, "\n")
index start end
0 0 10
1 40 50
shape: (2, 2), time unit: sec.
index start end
0 5 20
1 30 45
shape: (2, 2), time unit: sec.
union#
epoch = epoch1.union(epoch2)
print(epoch)
index start end
0 0 20
1 30 50
shape: (2, 2), time unit: sec.
intersect#
epoch = epoch1.intersect(epoch2)
print(epoch)
index start end
0 5 10
1 40 45
shape: (2, 2), time unit: sec.
set_diff#
epoch = epoch1.set_diff(epoch2)
print(epoch)
index start end
0 0 5
1 45 50
shape: (2, 2), time unit: sec.
split#
Useful for chunking time series, the split method splits an IntervalSet in a new IntervalSet based on the interval_size argument.
epoch = nap.IntervalSet(start=0, end=100)
print(epoch.split(10, time_units="s"))
index start end
0 0 10
1 10 20
2 20 30
3 30 40
4 40 50
5 50 60
6 60 70
7 70 80
8 80 90
9 90 100
shape: (10, 2), time unit: sec.
Drop intervals#
epoch = nap.IntervalSet(start=[5, 30], end=[6, 45])
print(epoch)
index start end
0 5 6
1 30 45
shape: (2, 2), time unit: sec.
drop_short_intervals#
print(
epoch.drop_short_intervals(threshold=5)
)
index start end
0 30 45
shape: (1, 2), time unit: sec.
drop_long_intervals#
print(
epoch.drop_long_intervals(threshold=5)
)
index start end
0 5 6
shape: (1, 2), time unit: sec.
merge_close_intervals#
index start end
0 1 6
1 7 45
shape: (2, 2), time unit: sec.
If two intervals are closer than the threshold argument, they are merged.
print(
epoch.merge_close_intervals(threshold=2.0)
)
index start end
0 1 45
shape: (1, 2), time unit: sec.