A 0.0023 mm²/ch. Delta-Encoded, Time-Division Multiplexed Mixed-Signal ECoG Recording Architecture With Stimulus Artifact Suppression

John P. Uehlin, Student Member, IEEE, William Anthony Smith, Member, IEEE, V. Rajesh Pamula, Member, IEEE, Steve I. Perlmutter, Jacques C. Rudell, Senior Member, IEEE, and Visvesh S. Sathe, Member, IEEE

Abstract—This article demonstrates a scalable, time-division multiplexed biopotential recording front-end capable of real-time differential- and common-mode artifact suppression. A delta-encoded recording architecture exploits the power spectral density (PSD) characteristics of Electrocorticography (ECoG) recordings, combining an 8-bit ADC, and an 8-bit DAC to achieve 14 bits of dynamic range. The flexibility of the digital feedback architecture is leveraged to time-division multiplex 64 differential input channels onto a shared mixed-signal front-end, reducing channel area by 2x compared to the state-of-the-art. The feedback DAC used for delta-encoding also serves to cancel differential artifacts with an off-chip adaptive loop. Analysis of this architecture and measured silicon performance of a 65 nm CMOS test-chip implementation, both on the bench and in-vivo, are included with this paper.

Index Terms—BCI, biopotential, delta-encoding, electrocorticography (ECoG), electrocorticography, low-power, neural amplifier, time-division multiplexing.

I. INTRODUCTION

BI DIRECTIONAL Brain-Computer Interfaces (BBCIs) are electronic systems that allow computers to simultaneously record, process, and stimulate neural activity. These systems have become critical technologies for enabling scientific inquiry into human brain function [1] and are ushering in a new era of neuroprosthetics [2], [3]. These advances have the potential to enable new treatments for a number of neurological disorders including Parkinson’s Disease [4], epilepsy [5], and depression [6]. Simultaneously, BBCI research continues to drive progress in the field of human-computer interaction.

Fig. 1 illustrates an example BBCI implementation. Bidi-rectional communication with the brain is made possible by recording the signals emitted by the brain and stimulating the tissue either in the cortex or in the peripheral nervous system. In-line computation elements allow for closed-loop bidirectional communication with the central nervous system.

Chronic (long-term) BBCI applications face several key engineering challenges. The desire for high electrode density and large spatial coverage motivates neural interfaces with thousands of electrodes [2]. However, the silicon area required to scale existing solutions is prohibitive. Concurrent stimulation and recording activity resulting from closed-loop neuromodulation generates large stimulation artifacts that obfuscate important signals during and shortly after stimulation. Power density requirements due to tissue heating remain restrictive, particularly in monolithic solutions [7].

This paper presents a channel, process, and frequency scalable recording system in standard 65 nm CMOS which makes several contributions to the state-of-the-art: 1) Time-domain multiplexed recording channels to enable a demonstrated two-fold reduction in silicon area for ECoG recording. 2) A delta-encoded feedback loop exploits neural signal statistics to provide large dynamic range by combining low-precision data converters to achieve high-precision recording. 3) Real-time, event-driven...
Fig. 2. Functional block diagram of the highly-multiplexed, digitally-delta-encoded ECoG signal chain with common- and differential-mode artifact suppression that is selectively enabled when cued by the stimulator.

This paper is organized as follows: In Section II we discuss the system architecture and analysis that led to the implemented prototype. Section III describes the details of the fabricated test chip. Bench and in-vivo measurements are presented in Section V. Finally, we discuss conclusions and potential next steps of this work in Section VI.

II. ARTIFACT-SUPPRESSING MULTIPLEXED DIGITAL-FEEDBACK ARCHITECTURE

Many biopotential signals, including ECoG, are highly colored, exhibiting an approximately $1/f^2$ relationship to the Power Spectral Density (PSD) [8]. Therefore, high-frequency low-amplitude portions of the signal spectrum set the sampling rate requirements of the system while large-amplitude, low-frequency spectral content requires a large front-end dynamic range. These PSD characteristics strongly motivate delta-encoded signal acquisition, where recording the differences between samples allows dynamic range requirements to be significantly reduced. Fig. 2 shows a block diagram of the system architecture, optimized for multiplexing and frequency-shaped dynamic range. While stimulation artifacts do not exhibit the same helpful PSD structure, their occurrence can be accurately anticipated with a cue communicated by the stimulator to the recording system. Knowledge of the arrival of an artifact is exploited to enable (learned) template-based artifact cancellation of differential-mode (DM) artifacts, and double-sampled switched-capacitor based common-mode (CM) artifact cancellation. Because these artifacts are suppressed effectively at the amplifier front-end, an impact on amplifier dynamic range is avoided. Both these techniques are applied on-cue from the stimulator and suppress artifacts at the amplifier input to avoid adversely impacting amplifier dynamic range. The proposed architecture also provides several other significant enhancements including multiplexed amplifier operation, scalable sampling frequency while using simpler, more robust circuit blocks to achieve similar system specifications. As we will show, these architectural enhancements demonstrably do not affect the first-order noise performance of the system.

A. Delta Encoding

We have previously shown in [9], [10] that the PSD characteristics of biopotential signals can be exploited to reduce amplifier and ADC dynamic range requirements. In this work, we implement delta-encoding using mixed-signal feedback to similarly shape the signal spectrum (Fig. 3). In contrast with existing mixed-signal feedback schemes that implement DC servo-loops [11]–[15], delta-encoding [16], [17], or auto-ranging circuits [18] that require high-precision oversampling converters, the proposed architecture uses delta-encoding to track differences between successive samples with Nyquist rate conversion. This approach significantly reduces the ADC dynamic range requirement for signals containing low-amplitude, high-frequency and high-amplitude, low-frequency content. Delta encoding also
compensates for DC offset in DC-coupled systems, such as this one.

The input signal is differenced with the previous sample, and this low-dynamic-range residue is passed through a gain stage $H(s)$ and then digitized at moderate resolution (8 bits). The ADC provides a code to a digital accumulator that tracks the large-scale variation of the signal based on the incremental differences between values. This accumulation is fed back to the input to complete the delta-encoding operation [19], [20].

Implementation with a full-scale feedback DAC would require oversampling. Instead, a truncated portion (MSB-intact) of the ADC is accumulated and stored as coarse representation of prior channel voltage, allowing for a 8-bit Nyquist-rate feedback capacitive-DAC. The signal can still be fully reconstructed by adding the accumulator output weighted by the gain factor $M$ to the ADC residue output. Thus, the ADC acts as an interpolator for the delta-encoder, allowing high-resolution data acquisition using moderate-resolution converters. $M$ is calibrated to compensate for DAC transfer function nonidealities which are amplified at the ADC input.

B. Multiplexing for Channel Area Reduction

The proposed digital feedback architecture makes it possible to multiplex the feedback networks for many channels using compact memory storage. Multiplexing the entire signal chain allows this implementation to achieve recording channel density approximately two-times the state-of-the-art, without dependance on specialized low-offset electrodes or trading off power for area reduction [21]. The accumulated residue in Fig. 3 is stored independently for each channel and then recalled periodically, providing programmable support for up to 64 channels with a common front-end.

To achieve a large number of multiplexed channels, the amplifier must have a wide bandwidth so that settling is possible between adjacent independent channels. However, this requirement directly contradicts the need for a low-pass response to eliminate noise aliasing from higher frequencies at the amplifier inputs. Filtering is achieved by the rectangular integration time window, which creates a $sinc$ envelope filter on the input signal and input noise, while leaving the amplifier noise of a single stage amplifier in terms of $kT/C$. The bandwidth is extended with minimized power draw by operating partially in the charge sampling region. The amplifier output impedance can be higher, as $\tau$ is larger than the sampling window, reducing the necessary current.

As described in [22], the total noise power due to noise sources before the charge integration stage is proportional to the sampling interval, which is nominally reduced by a factor of $N$, the number of independent channels. This increased power corresponds to an increase in the noise voltage by a factor of $\sqrt{N}$. Because of this penalty, additional gain stages before the integrating amplifier should be avoided, so that only biological and electrode noise is amplified by this factor. This penalty is reasonable for many electrode configurations where electrode noise does not limit the signal acquisition [23], [24]. Switching the CDAC does introduce $kT/C$ noise before the gain stage.

However, because this noise is shunted toward the much larger electrode-tissue capacitance, upon sampling, the contribution of this noise is negligible.

Crosstalk between adjacent samples should also be considered for multiplexed channels. Residual voltage from the previous channel can considerably influence the trajectory and sampled value of the current channel, especially when charge sampling, because the signal is not allowed to settle. To avoid this problem, each node in the circuit is auto-zeroed with high-precision settling between samples.

III. Detailed System Implementation

Details of the overall system are shown in Fig. 4. Multiplexer switches connect 64 differential electrode pairs (represented as an RC model) to a shared switched-capacitor front end. The input DC-blocking capacitors $C_{IN}$ can be reconfigured as sampling capacitors for common-mode artifact suppression (CMS). The complimentary 8-bit CDAC moves charge at the amplifier inputs to track similarities between subsequent samples, using a thresholding operation and register file to independently integrate the ADC outputs for 64 channels. The residual difference is integrated onto $C_L$ for the sampling period and quantized by an 8-bit SAR ADC.

Charge sampling in the amplifier eliminates the added power requirements of $7\tau$ settling (assuming 8-bit resolution) typically required to sample the input signal after transitioning to a new channel [25]. Excessive clock jitter can significantly degrade precision in a charge sampling implementation, but the jitter resulting from a ring oscillator powered by a regulated supply was found to be low enough for this application. A return-to-zero technique implemented at the amplifier inputs eliminates crosstalk between channels (measured at $-92$ dB) and offers a time-invariant switched-capacitor DC input resistance to each channel. Use of a single amplifier for recording an array of channels naturally offers correlated double-sampling using a dummy channel to reduce flicker noise if required by the application.

Mixed-signal delta-encoding is enabled by combining digital storage for each channel with an 8-bit feedback DAC to perform the differencing operation. Delta-encoding allows an 8-bit Nyquist-rate ADC to provide the dynamic range required for ECoG signal acquisition. The signal can be reconstructed by adding the stored signal state to the ADC output (the digitized delta-encoder residue). The accumulated value, to be provided to the DAC in the next cycle, must be updated to allow the system to track the input signal. The update quantity is determined based on comparing the ADC code to user-programmable thresholds (Fig. 2, blue and green bands). These comparisons motivate an update of $+1$, $-1$, and $0$ to maintain tracking.

The $G_{IN}$ stage gain is highly configurable for flexible use and is calibrated first by minimizing the size of the load capacitance for a given gain setting, and then by fine tuning with the precise integration time. This provides the optimal noise performance for a given configuration, as shown by $\tau$ in Equation (3). Configuring the gain with the integration time alone, while effective, is suboptimal for noise performance and should be avoided. In general, the value of capacitance for a given sampling rate and
number of channels is fixed, and fine tuning the integration time is unnecessary due to the DAC calibration procedure.

This one-time calibration of the DAC weights is performed by fitting them to a bathtub curve during post-processing and is a sufficient approach for handling systematic DAC nonlinearity [26]. Charging the 5 pF equivalent input capacitance from the electrodes creates a sampling-frequency-dependent input resistance, as shown conceptually by Fig. 5. This input resistance forms a high-pass corner with the series electrode capacitance ($C_E$) that is on the order of 1 Hz for most electrodes [27].

A multiplexed front end results in a system that is inherently DC coupled. Switching between channels using auto-zeroed inputs requires the channel to charge the amplifier input, leading to the presence of a switched capacitor resistor at the amplifier input. Importantly, this switched capacitor consists of the capacitance of the DAC and the $g_mC$ amplifier (amounting to approximately 5 pF and not $C_{in}$. Although the potential across $C_{in}$ changes in this multiplexed front-end, the system is designed to ensure that the charge delivered to $C_{in}$ is provided by the supply voltage, not the input signal.

![Fig. 4](image4.png)

**Fig. 4.** Detailed schematic of the analog front end, including delta-encoded feedback loop with 8-bit DAC and 8-bit ADC, CMS system, autozero technique, and interpolation.

![Fig. 5](image5.png)

**Fig. 5.** Conceptual diagram of input network charging and resulting equivalent input resistance, as seen from the perspective of each recording electrode.

![Fig. 6](image6.png)

**Fig. 6.** Autozero and sampling operation timing diagram for normal recording mode. Note: Switch AZ2 remains open during normal operation.

### A. Timing

Figure 6 shows the timing of the system in normal operating mode. When the system is ready to transition to the next channel the input MUX is disabled and the DAC is returned to zero-state, ensuring that sampling, amplification and quantization for each channel are performed with identical impedances independent of the state of the previous channel. Next, the auto-zero switches are activated, bringing both the input and output to a known voltage. During nominal operation the transconductor input is not auto-zeroed, but instead uses a large pseudo-resistor to set the bias. Using a pseudo-resistor avoids the injection of $kT/C$ noise at the amplifier inputs that would occur when the switch samples the reference voltage. This differs from the switching noise contribution at the MUX output node, because the series capacitor creates a high-impedance between the switch and ground. In contrast, the MUX output node has a low-impedance path to ground through the MUX that almost entirely dissipates the switching noise before amplification begins.

After the auto-zero is completed, the sampling switch at the input to the ADC is closed. The system is still not sampling, because the output auto-zero remains active. Next, the DAC is enabled just before $\phi_{AZ1}$ is opened, ensuring that the DAC sees a uniform impedance. The MUX is then enabled. Finally, the auto-zero switch at the output is opened, which allows the OTA to begin accumulating charge at the output. When the sampling window is completed, the sampling switch opens and the whole process begins again on the next channel.
Front-end operation under CMS conditions, as described in Section IV-B is shown in Fig. 7. This mode is enabled for a brief duration during and after stimulation events, and is similar to the nominal operating mode, with the key difference that the common mode voltage is first sampled across the input capacitors through a simple switching scheme. As previously mentioned, CMS introduces $kT/C$ noise at the input node. Given the typically low duty-cycle of stimulation, this does not have a significant impact on overall system noise performance.

B. OTA Stage

A schematic for the operational transconductance amplifier (OTA) is shown in Fig. 8. The cascoded current-reuse amplifier topology, similar to [28], leverages the presence of a 2.5 V supply needed by a stimulator on a monolithically integrated chip, allowing a large amplifier device stack using thick-oxide devices, which improves noise efficiency and CMRR without gate-leakage concerns.

The two sets of input pairs are sized to operate deep in weak inversion, with very large device areas to mitigate flicker noise. Cascode devices are sized in moderate inversion to maximize gain while minimizing the effects of their capacitance. The input CM voltage to the PMOS and NMOS devices is designed at $V_{DD}/2$, so the tail devices are cascoded with the available headroom to improve CMRR and PSRR.

C. ADC Implementation

The ADC at the output of the amplifier serves as the sampling switch and a portion of the load capacitance for the charge-sampling amplifier. It is implemented as a top-plate sampling, monotonic-switching asynchronous SAR ADC [29], [30]. In this fully differential implementation, a 7-bit DAC combined with an initial sign measurement creates an 8-bit output. The unit capacitor is implemented using a Metal-Oxide-Metal (MOM) capacitor structure with a unit size of 7.7fF. The large unit capacitor provides robust operation, and the resulting marginal increase in power dissipation is insignificant given that the OTA dominates overall power. The entire ADC is powered with a 0.5 V supply and reference voltage. This small reference voltage reduces the dynamic range requirements of the OTA and the LSB size of the converter and also reduces ADC power consumption.

D. DAC Implementation

The DAC was implemented using a monotonic switching structure similar to the ADC. 10fF unit capacitors are binary weighted over 7 bits with an additional sign bit from differential operation. The aggregate capacitance of the DAC forms a voltage divider with the 50 pF input capacitors and parasitic capacitance from the amplifier inputs. The capacitors are initially sized based the flicker noise requirements of the transconductor devices, and the input capacitors are designed at 10-times the total parasitic capacitance to prevent capacitive division of the input signal. Driven at 2.5 V, this voltage divider creates a nominal DAC LSB of 500 μV. This LSB size covers approximately half of the input-referred differential range of the forward signal path. Variations in the common-mode voltage due to monotonic switching of DAC elements are partially mitigated by inverting the switching direction of the MSB [31]. The residual variation in the common-mode range does not affect the linearity of the system output with a THD specification of 0.1%.

IV. Analysis

A. Noise Analysis

Two general paradigms exist for sampling of signals, as illustrated in Fig. 9. Conventional sampling requires several time constants, $\tau$, to achieve a desired settling accuracy in the sampling interval. Charge sampling, as discussed above, relies on a precise sampling period so does not require settling for multiple time constants. Therefore, charge sampling reduces the required amplifier bandwidth substantially, decreasing power consumption and amplifier noise when compared with traditional sampling.

Noise analysis of the system must be performed to identify the optimal time-constant for this first-order system. For simplicity, this analysis is limited to thermal circuit noise. Other noise sources will scale in the same fashion. The noise generated by a
transconductance amplifier is given by $\tau_{T} = 4kT\alpha G_m$, where $G_m$ is the stage transconductance, $\alpha$ captures topological, sizing, and process constraints on transconductor noise performance, $k$ is Boltzmann’s constant and $T$ is temperature. From this expression, the output noise voltage from one sampling operation can be shown to be:

$$v_{N-out}^2 = \frac{\alpha k T R_o}{C_L G_m R_o} \left[ 1 - e^{-\frac{\tau}{T}} \right]^2$$

(1)

where $R_o$ is the amplifier output resistance and $C_L$ is the sampling capacitor. The gain of the circuit, including transient effects, is given by $A = G_m R_o [1 - e^{-\frac{\tau}{T}}]$, where $T_s$ is the sampling interval bounded by the sampling rate. Therefore, the input referred noise can be written as:

$$v_{N-in}^2 = \frac{\alpha k T}{C_L G_m R_o} \left[ 1 - e^{-\frac{\tau}{T}} \right]^2$$

(2)

After grouping terms, the input-referred noise voltage is given by Equation (3), where $\tau = C_L R_o$. An efficiency factor, $\Gamma(\tau/T_s)$, captures the effect of finite per-channel charge-sampling time. For the chosen operating point, where $\tau \approx 0.8T_S$, $\Gamma(\tau/T_s)$ becomes $\Gamma_{min}$.

$$v_{N-in} = \sqrt{\frac{\alpha k T}{G_m T_s}} \sqrt{\frac{T}{1 - e^{-\frac{T}{T_S}}}} = \sqrt{\frac{\alpha k T}{G_m T_s}} \Gamma_{min}$$

(3)

This expression demonstrates the expected inverse proportionality between noise voltage and $\sqrt{Gm}$. It also shows that the sampling interval, $T_s$, should be set to the maximum value allowable by the sampling frequency. For a given $T_s$, noise efficiency is achieved by minimizing $\Gamma$ across possible values of $\tau/T_s$ (Fig. 10). From the figure, the optimal choice corresponds to $\tau \approx 0.8T_S$.

Since the analysis assumes a single-pole transconductor, the amplifier has optimal noise performance with a sampling interval just above the 3 dB bandwidth. However, this bandwidth is highly dependent on the process-sensitive output resistance of the amplifier. Given this additional consideration, we instead designed the amplifier 3 dB bandwidth to be slightly below the corresponding sampling angular frequency, in the integrating region of the curve where sensitivity to the output resistance is sufficiently mitigated (Fig. 10).

In this implementation, digital calibration is necessary to compensate for open-loop transconductance and load capacitance variability. First-order calibration is implemented through a judicious combination of digitally-tunable load capacitance and integration time.

Assuming weak-inversion operation, $G_m = 25I_{D,N}$. We denote the amplifier current of this N-channel system as $I_{D,N}$. Under the optimality criterion prescribed by Fig. 10, Eq. (3) can be re-written to analyze the system in the context of supporting N channels as follows:

$$v_{N-in} = \sqrt{\frac{\alpha k T}{G_m T_s}} \Gamma_{min} = \sqrt{\frac{\alpha k T}{25I_{D,N} T_s}} \Gamma_{min}$$

(4)

Proportionally scaling the current with channel count yields $I_{D,1} = I_{D,N}/N$, where $I_{D,1}$ is the equivalent amplifier current for a single channel. $T_s$ must scale by $N$ to support concurrent recording at a target sampling rate per channel, where $T_{s,1} = T_s/N$. Writing $T_s$ in terms of $T_{s,1}$ and substituting yields:

$$v_{N-in} = \sqrt{\frac{\alpha k T}{25I_{D,1} T_{s,1}}} \Gamma_{min}$$

(5)

The sampling period, proportional to $N$, cancels the accompanying scaling of the amplifier current with channel count in Eq. (5). Multiplexing is therefore, to the first order, neutral in circuit power for a given input-referred noise target when compared with non-multiplexed, single-amplifier-per-channel approaches.

### B. Artifact Suppression

Common- and differential-mode artifacts (Fig. 11) severely degrade recording performance in BBCIs because the required stimulation voltages are much larger than the recorded signals.
Neuroscientists are interested in neural behavior that occurs shortly after stimulation [1]. As a result, techniques that suppress common- and differential-mode artifacts are critical enabling technologies for BBCI applications. This system includes differential- and common-mode artifact suppression that can be selectively enabled during artifact events when provided a cue from the stimulator.

In this mixed-signal feedback implementation, the feedback DAC can be leveraged to enable differential stimulation artifact suppression at the amplifier inputs, as shown in Fig. 12. Off-chip artifact template training uses a least mean-squared (LMS) algorithm [32] to iteratively learn the optimum template using the ADC output on the FPGA controller. The FPGA also controls the stimulator and provides the correct template for each stimulation event. A multiplexed recording architecture requires template interleaving because each channel sees a different voltage artifact. The interleaved templates are synchronously serialized with the on-chip recording clock domain, transmitted, and de-serialized on chip. These transmitted artifact codes are summed sample-by-sample with an adder into the delta-encoding loop feeding the input DAC. Implementing the LMS algorithm and artifact template storage on chip would require 256 bits of SRAM per artifact channel, multiplexers, adders, and a bitwise shift. With low-voltage logic, a sign-error LMS algorithm can be performed with 100nW and less than 0.02 mm$^2$ per channel [33].

The proposed architecture also supports more sophisticated adaptive equalization methods implemented on the FPGA. The efficacy of differential suppression techniques is limited by the DAC dynamic range: very large artifacts would require a large DAC range. Additionally, DAC quantization limits the precision of artifact suppression. However, the objective of DM cancellation is to suppress artifact to within the linear range of the amplifier, enabling downstream signal processing for more aggressive removal of these artifacts [34], [35].

Large common-mode (CM) artifacts on the order of 100 s of mV are often overlooked but can also degrade performance, particularly with the noise-efficient open-loop amplifiers employed in modern neural recording systems. Recently published worked demonstrates suppression of CM artifacts using an active CMFB stage [36]. This technique works well for conventional non-multiplexed approaches, but an analog CM loop would require prohibitively high amplifier bandwidth for multiplexed architectures. We implemented a passive, switched-capacitor technique for CM artifact suppression, elaborated upon in [37]. The CMS is selectively enabled on-cue by the stimulator to stabilize the amplifier operating point for the duration of the artifact. CMS maintains gain across the input signal range and suppresses distortion by using a correlated double-sampling technique. Use of the CMS mode introduces $kT/C$ noise when releasing the auto-zero at the transconductor input, resulting in a 12 dB SNR reduction while CMS is enabled during stimulation. This scheme is included in the multiplexed amplifier discussed herein, but can also be utilized for non-multiplexed topologies.

C. Frequency Scalability

An additional benefit of charge sampling is that the system scales efficiently with frequency by tuning of the sampling capacitor $C_L$. The gain of the amplifier in the charge-sampling regime is given by $A_v = G_{m}T_s/C_L$. As $T_s$ changes due to higher or lower sampling frequencies, the gain can be held constant without modifying the $G_{m}$. This characteristic allows for scaling of the system without modifying the transistor-level design of the amplifier. Coupled with the ability to select the number of channels, the system can be scaled in frequency with only a square-root impact on noise efficiency as $T_s$ decreases, as can be seen from Equation (4).

Scaling sampling frequency also affects the system input impedance and bandwidth. The series capacitive component of electrode-tissue impedance forms a high-pass corner with the input impedance. Sampling on the front-end input capacitance creates an equivalent input resistance proportional to sampling
frequency by $1/f_s C_{in}$. During normal operation, $C_{in}$ is the parasitic capacitance of the CDAC and OTA, approximately 5 pF. During CMS, signal is sampled onto the 50 pF series input cap. Electrode capacitance and input resistance combine into a high-pass corner, $f_{HP} = 1/(2\pi R_{in} C_{el}) = f_s C_{in}/2\pi C_{el}$. Secondly, pre-charging $C_{in}$ with the CDAC reduces $f_{HP}$ by partially using charge from the supply rather than the electrode for sampling low frequencies. For an 800 pF microwire electrode, the theoretical high-pass corner without pre-charging is $f_{HP} = 2$ kHz for 5 pF. A 700 mHz corner was measured in this scenario due to the pre-charging effect. Enabling CMS temporarily increases the input capacitance to 50 pF, only during stimulation events. A corresponding 10 Hz corner was measured sampling off of an 800 pF electrode capacitance at 2kS/s.

V. MEASUREMENT RESULTS

This system prototype was fabricated in a 1P9M 65 nm CMOS process. The die photo is shown in Fig. 13. The chip has been tested both on the bench for performance measurements and in-vivo to verify functionality in living tissue.

A. Bench Measurements

Bench measurements used an emulation of a Platinum-Iridium microwire with $R_{ES} = 7.5 \; k\Omega$, $C_E = 820$ pF, and $R_{EP} = 5 \; G\Omega$ [23]. A Stanford Research Systems DS360 signal generator was used for high-resolution sinusoidal signal generation at low frequencies, and a Terasic DE1-SOC FPGA board was used for digital acquisition and analysis.

Figure 14 demonstrates a measured signal recording of a full-scale 35mVrms sinusoid applied at the inputs. This figure displays the interpolation of the accumulated output voltage that drives the DAC with the ADC output residue to form a complete high-resolution waveform.

The noise performance of the system sampling at 2kS/s is given in Fig. 15. The use of correlated double sampling (CDS) is also demonstrated for ECoG recording to demonstrate that flicker noise can be mitigated—if necessary—at the expense of an increase in the thermal noise floor [38]. This increase is due to uncorrelated thermal noise from the signal and reference electrodes.

Fig. 16 shows the INL and DNL of the system before and after calibration. Minimizing INL is a key objective to maximize system performance. Before calibration, the INL of the system varies considerably because the 8-bit DAC linearity is insufficient after amplification when compared to the ADC output linearity. Post-processing of the DAC code weights to minimize INL addresses these linearity constraints. The gain variation due to amplifier CM-DM conversion also contributes to INL by varying the slope of the interpolating sections. This variation increases the INL to $< 3$ LSB.

Linearity is also often evaluated using total harmonic distortion (THD). The mixed-mode feedback of this architecture allows for large signal swing at low frequencies, shaping the linear dynamic range as seen in Fig. 17. The delta-encoded feedback creates a dynamic range profile that matches the characteristic $1/f^2$ ECoG signal profile.

The measured signal transfer function (STF) of the system for 2kS/s ECoG recording is shown in Fig. 18. The results show a 3 dB bandwidth below 1 Hz. The STF is also shown for the reduced-impedance case where CMS is enabled. These modes are enabled intermittently during stimulation events, and
Fig. 16. INL and DNL measurements of the prototype both before and after calibration obtained using the histogram method over 16777216 samples. The DNL is dominated by the noise floor of the amplifier, while the INL shows 2.90 LSB linearity after calibration, demonstrating very strong linear performance across the entire input range.

Fig. 17. RMS input amplitudes causing 1% Total Harmonic Distortion, measured with single-tone sinusoids across frequency for ECoG recording. Overlaid with a typical ECoG PSD (2kS/s) adapted from [27].

Fig. 18. Measured full signal chain transfer functions for ECoG (2kS/s).

the input impedance decreases from 92 MΩ to 9.2 MΩ. This decreased input impedance primarily impacts the low-frequency voltage transfer, which as shown in the figure primarily impacts the Delta-band frequencies below 4 Hz. The remaining motivation for a high input impedance relates to common-mode to differential-mode conversion [39], which this CMS scheme already directly addresses.

Bench measurements of the CMS system are shown in Fig. 19(a). Without the CMS system, even small common-mode artifacts can substantially modulate the gain of the amplifier, which creates a nonlinear intermodulation product with the common-mode artifact and potentially sets the operating point of the amplifier in a highly nonlinear region. With CMS enabled, the gain remains very consistent until the complementary transmission-gate based switches of the multiplexer begin to enter the accumulation region. A 6% degradation of the gain near mid-range is observed due to the complementary switches employed at the MUX input. The linearity of these switches is sufficient for recording purposes, but is degraded in the presence of large voltage swings. This reduction in linearity could easily be corrected by bootstrapping the MUX switches, which could also be leveraged for increased voltage range [40], [41].

Although the proposed CMS architecture achieves a cancellation range of 2.5 V (Fig. 19), significantly higher than prior works [36], capacitive sampling adds noise at the input. Reducing this kT/C noise by increasing $C_{IN}$ would further reduce input impedance and increase area. To measure the input-referred noise impact of CMS, we continuously enabled the feature and measured the noise performance in this worst-case condition. The input-referred noise was measured to be 14.4 $\mu$V, consistent with kT/C noise from a $C_{IN}$ of 50 pF at each input.

In real-world applications, CMS is only enabled during stimulation pulses. For duty cycle, $D$, the resulting impact on input referred noise can be approximated as: $v_{in, eff} = 1.66 \mu V + D \times 12.7 \mu V$. A typical $D$ value of 4% [42] yields input referred noise of 2.17 $\mu V$, resulting in a NEF of 2.89. This expression is applicable for applications that rely on frequency-domain analysis. If low-noise measurement of time domain data during stimulation events is required, another method must be used.

Differential-mode suppression is demonstrated in Fig. 19(b). An H-bridge stimulator [43] in combination with a signal generator to create a test signal with a realistic artifact. A 5 Hz, 10 mV sinusoid with a $\pm$54 mV biphasic differential stimulation artifact is input to the system and suppressed at the amplifier inputs, demonstrating 46 dB of artifact suppression before the amplifier inputs. This number is limited by the step size of the input DAC, but further post-processing suppression is possible because 46 dB of suppression brings the artifact into the linear range of the signal chain.

B. In-Vivo Measurements

In-vivo measurements were taken from two hemispheres of a chronically implanted µECoG array in a sedated macaque. Fig. 20 demonstrates operation on 16 channels simultaneously using the time-division multiplexing implemented on chip. Both
Fig. 19. Bench measurements of the CMS (a) and DMS (b) systems. (a) shows the CMS system swept over CM artifact voltage levels both with and without CMS enabled. The CMS largely mitigates gain variation compared to the nominal case. (b) shows a 10 mV sinusoidal 5 Hz signal with a ±54 mV biphasic stimulation artifact (differential) before and after front-end suppression.

Fig. 20. Simultaneous 16-channel in-vivo measurements from an implanted μECoG array in a sedated non-human primate, shown in the (a) time domain and (b) frequency domain.

the time- and frequency-domains are shown for clarity. Signal amplitude varies significantly between hemispheres, as apparent in both plots. High-quality recordings with a 1 kHz bandwidth were obtained, showing low-frequency behavior consistent with suppressed brain activity under sedation.

In Fig. 21, the common- and differential-mode suppression techniques are demonstrated in-vivo. Fig. 21(a) shows the common-mode suppression technique across 3/64 channels. The buffered common-mode voltage at the amplifier inputs is measured using an oscilloscope with and without the suppression enabled. When suppression is disabled, large common-mode variation is seen due to sampling of stimulus artifacts on the 3/64 channels that are connected. When enabled, the common-mode voltage is stabilized and is always within 10 mV of the nominal common-mode input voltage. Fig. 21(b) shows the behavior of the differential-mode suppression system. While the suppression capability is limited by the coarseness of the input DAC, as discussed in Section III above, we demonstrate that the amplifier does not saturate when suppression is enabled. This allows for linear post-processing of the signal after digitization.

C. Comparison to State-of-the-Art

Table I compares the results from this chip to other recent works representing the state-of-the-art in cortical biopotential acquisition. This work performs comparably to the literature along all metrics and scales well with frequency without a major performance impact. Our primary contributions include the CMS technique for common-mode artifacts and full signal-chain multiplexing which improves the channel density by a factor of >2x. We also demonstrate excellent crosstalk performance because of the auto-zero technique and the lack of parasitic paths between channels past the multiplexer inputs, which are next to the individual pads.
Fig. 21. In-vivo recordings in the presence of artifacts for common- (a) and differential-mode (b). (a) shows the voltage at the input to the amplifier with and without common-mode suppression. (b) shows the reconstructed output waveform with and without front-end suppression. Measurements were performed with 300 μA biphasic stimulation in PtIr microwire electrodes adjacent to recording electrodes, creating 100 mV Pk-Pk stimulus artifact at the recording inputs.

V. PERFORMANCE COMPARISON OF PROTOTYPE WITH OTHER BIOPOTENTIAL AMPLIFIERS

<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>Technology</td>
<td>65 nm</td>
<td>0.18 μm</td>
<td>0.18 μm</td>
<td>0.13 μm</td>
<td>65 nm</td>
</tr>
<tr>
<td>Supply Voltage (V)</td>
<td>0.5</td>
<td>0.5, 1</td>
<td>1.8</td>
<td>1.2, 1.8</td>
<td>0.5, 2.5</td>
</tr>
<tr>
<td>DM Artifact Suppression</td>
<td>—</td>
<td>Yes</td>
<td>—</td>
<td>—</td>
<td>Yes3</td>
</tr>
<tr>
<td>CM Artifact Suppression</td>
<td>—</td>
<td>—</td>
<td>—</td>
<td>—</td>
<td>Yes3</td>
</tr>
<tr>
<td>Multiplexing</td>
<td>—</td>
<td>ADC</td>
<td>Shank</td>
<td>Complete</td>
<td></td>
</tr>
<tr>
<td>Number of Channels</td>
<td>64</td>
<td>8</td>
<td>8</td>
<td>768</td>
<td>64</td>
</tr>
<tr>
<td>Channel Area (mm²/ch)</td>
<td>0.025</td>
<td>0.17</td>
<td>N/A</td>
<td>0.12</td>
<td>0.0023</td>
</tr>
<tr>
<td>Power per channel (μW)</td>
<td>2.3</td>
<td>0.33</td>
<td>56.7</td>
<td>3</td>
<td>2.98</td>
</tr>
<tr>
<td>Input Referred Noise (μVRMS)</td>
<td>1.2</td>
<td>3.05</td>
<td>5.23</td>
<td>12.43</td>
<td>1.665</td>
</tr>
<tr>
<td>Noise Efficiency Factor</td>
<td>4.76</td>
<td>1.63</td>
<td>1.773</td>
<td>8.53</td>
<td>2.215</td>
</tr>
<tr>
<td>AFE Bandwidth (Hz)</td>
<td>1-500</td>
<td>1-2k</td>
<td>0.1-700</td>
<td>0.5-8k</td>
<td>1-1kHz</td>
</tr>
<tr>
<td>Input Impedance</td>
<td>28MΩ @ 8kHz</td>
<td>—</td>
<td>—</td>
<td>13.27MΩ @ 20kHz</td>
<td>92MΩ @ 2kHz</td>
</tr>
<tr>
<td>ADC Resolution / ENOB</td>
<td>15b / —</td>
<td>10b / —</td>
<td>10b / 9.57b</td>
<td>10b / —</td>
<td>8b + 8b / 15.7b</td>
</tr>
<tr>
<td>CMRR (dB)</td>
<td>88</td>
<td>—</td>
<td>—</td>
<td>—</td>
<td>76</td>
</tr>
<tr>
<td>PSRR (dB)</td>
<td>67</td>
<td>—</td>
<td>—</td>
<td>—</td>
<td>82</td>
</tr>
<tr>
<td>Peak-to-Peak Range (mV)</td>
<td>100</td>
<td>—</td>
<td>—</td>
<td>—</td>
<td>110</td>
</tr>
<tr>
<td>Crosstalk (dB)</td>
<td>-85</td>
<td>—</td>
<td>—</td>
<td>-63</td>
<td>-92</td>
</tr>
</tbody>
</table>

1 Least-mean squares algorithm operating on a DAC at the amplifier inputs.
2 Serial chip input to DAC at amplifier inputs.
3 Switched-capacitor offset correction scheme.
4 Full Probe.
5 Amplifier Only.
6 For nominal recording operation, not CMS.
7 DC coupled to electrode.
8 High-pass corner is 10 Hz during CMS.
9 At low-frequencies (frequency dependent).

VI. CONCLUSION

We have demonstrated a delta-encoded digital feedback biopotential amplifier with both CM and DM artifact suppression capabilities. The delta-encoding architecture enables a large dynamic range using low-resolution data converters and enables high-density multiplexing to reduce area. The resulting prototype performance is comparable to the state-of-the-art with up to a 2x reduction in area per channel in the 64 channel configuration. Future work includes integrating this recording architecture with high-voltage compliance stimulation and a digital back-end capable of adaptive artifact cancellation computations. The architecture itself could be improved with increased CDAC resolution (for more differential artifact suppression) and introduction of a servo loop into the common-mode cancellation scheme to maintain a high input impedance.
ACKNOWLEDGMENT

The authors would like to the Washington National Primate Research Center and Brian Mogen for assisting with animal measurements. We would also like to thank Jason Silver for valuable discussions and Patrick Howe for help developing our benchtop setup. Special thanks go to Mentor Graphics for allowing us the use of Analog Fast Spice.

REFERENCES


John P. Uehlin (Student Member, IEEE) received the B.S. and M.S. degrees in electrical engineering in 2013 and 2017, respectively, from the University of Washington, Seattle, WA, USA, where he is currently working toward the Ph.D. degree, researching biomedical circuit design in the Future Analog Systems and Technologies (FAST) Lab. He has previously designed RF/mixed-signal circuits at Noise Figure Research. He is a member of the Center for Neurotechnology.

William Anthony Smith (Member, IEEE) received the B.S. degree from the University of Texas at Austin, Austin, TX, USA, and the M.S. degree from the University of Utah, Salt Lake City, UT, USA, in 2007 and 2011, respectively, both in electrical engineering, and the Ph.D. degree in electrical engineering from the University of Washington, Seattle, WA, USA, in 2017. He has previously held industry positions at AMD, Spansion, and Fathom Computing. He is currently employed by SpaceX in Redmond, WA, USA, as an RFIC Design Engineer, where he designs millimeter-wave silicon for the Starlink Broadband Satellite Internet Project.

V. Rajesh Pamula (Member, IEEE) received the B.Tech. degree in electrical engineering from the Indian Institute of Technology (IIT) (BHU) Varanasi, Varanasi, India, in 2007 and the M.Sc. degree in electrical and electronics engineering from Imperial College London, London, U.K., in 2010. From 2013 to 2017, he was a Research Assistant with MICS/ESAT, KU Leuven, Leuven, Belgium, in collaboration with imec, Leuven. He is currently a Visiting Research Scientist with the Processing Systems Laboratory (PSyLab), University of Washington, Seattle, WA, USA. His research interests include biomedical circuits, low-power sensor circuit design, and hardware security circuits. He was the recipient of four Gold Medals by IIT (BHU) Varanasi in 2007 and the Analog Devices Outstanding Student Designer Award in 2016.

Steve I. Perlmutter received the Sc.B. degree in biomedical engineering from Brown University, Providence, RI, USA, in 1979, the M.S. degree in biomedical engineering from University of California, Los Angeles, Los Angeles, CA, USA, in 1982, and the Ph.D. degree in physiology and neuroscience from Northwestern University, Evanston, IL, USA, in 1991. He is currently a Research Professor with the Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA, a Research Affiliate with the Washington National Primate Research Center, Seattle, WA, USA, and a member of the Center for Neurotechnology and the University of Washington Institute for Neuroengineering. His research interests include spinal control of voluntary movements, neural plasticity, and neuroprosthetics. His lab is developing therapies for spinal cord injury and stroke that use activity-dependent, targeted, electrical, and optical stimulation of the nervous systems.

Jacques C. Rudell (Senior Member, IEEE) received the B.S. degree in electrical engineering from the University of Michigan, Ann Arbor, MI, USA, and the M.S. and Ph.D. degrees in electrical engineering from the University of California at Berkeley, Berkeley, CA, USA. He was an RFIC Designer with Qualcomm, Inc., San Diego, CA, USA, and Intel Corporation, Hillsboro, OR, USA. In 2009, he joined the University of Washington, Seattle, WA, USA, where he is currently an Associate Professor of electrical engineering. His current research interests include RF and millimeter-wave integrated circuit designs for communication systems, in addition to biomedical electronics for imaging and neural interface applications. He is an Active Member of the Center for Sensorimotor Neural Engineering, and an NSF Engineering Research Center based at the University of Washington. He is the Co-Director of the Center for Design of Analog-Digital Integrated Circuits. He was a recipient of the Demetri Angelakos Memorial Achievement Award, a citation given to one student per year by the EECS Department, the 2008 ISSCC Best Evening Session Award, and the 2015 NSF Career Award. He was a co-recipient of the best paper awards at the International Solid-State Circuits Conference (twice), namely, the 1998 Jack Kilby Award and the 2001 Lewis Winner Award, and the 2011 and 2014 RFIC Symposium Best Student Paper Award. He was on the ISSCC Technical Program Committee (2003–2010), and the MTT-IMS Radio Frequency Integrated Circuits (RFIC) Symposium Steering Committee (2002–2013), where he was the 2013 General Chair. He is currently on the Technical Program Committee of ESSCIRC. He was an Associate Editor of the Journal of Solid-State Circuits (2009–2015).

Vivek S. Sathe (Member, IEEE) received the B.Tech. degree in electrical engineering from the Indian Institute of Technology, Bombay, Mumbai, India, in 2001, and the M.S. and Ph.D. degrees in electrical engineering and computer science from the University of Michigan, Ann Arbor, MI, USA, in 2004 and 2007, respectively. From 2007 to 2013, he was a Member of Technical Staff with the Low-Power Advanced Development Group, AMD, where his research focused on inventing, developing, and producing new technologies into next-generation microprocessors, including low-power circuit design, high-speed circuits, adaptive clocking for supply noise mitigation, and resonant clocking. In 2010, he led the research and development effort that resulted in the first resonant-clocked commercial microprocessor. In 2013, he joined the University of Washington, Seattle, WA, USA, where he is currently an Assistant Professor. His current research interests include the areas of digital, mixed-signal, and power-management circuits and architectures. He was a Guest Editor for the Journal of Solid-State Circuits, and a member of the Technical Program Committee for the Custom Integrated Circuits Conference. He was a recipient of the NSF CAREER award.