F6 Wirelineforum 2024
F6 Wirelineforum 2024
Forum 6
Toward Next Generation of Highly Integrated
Electrical and Optical Transceivers
Forum 6
Francis Lin
MediaTek Inc.
Thermal
Serdes Requirement
[1][2]
Thermal
Serdes Requirement
[4] [5]
1.0E-07
224G C2C
BER
1.0E-08
224G CR 1mDAC
1.0E-09 224G C2M config1
System A224
224G C2M config2 Same dimension
1.0E-10 System A112 state of the art material.
1m AWG28 112G CR/KR IL(53G) -> 55.68dB
IL(26G 37dB)
1.0E-11
30 35 40 45 50 55 60
IL(dB)
Thermal
Serdes Requirement
Thermal preheat
Thermal
Serdes Requirement
BW preservation
Skew tunning
Crosstalk mitigation
Reflection mitigation
D0TX3(L3) D0TX4(L3)
Skew involves
◼ PN length difference K1=S31-S42
SDD21
SDC21
T
160
140
S
120
100
80
60 80 100 120 140
Path1
Path2
Path3
5.5
4.5
COM(dB)
3.5
2.5
2
10 15 20 25 30 35 40
bump to tump IL(dB)
Thermal
Serdes Requirement
Thermal
Serdes Requirement
1.0E-06
C2C
BER
CR 1m bump to bump
1.0E-07 C2M config1
C2M config2
1.0E-08
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 1 of 86
© 2024 IEEE International Solid-State Circuits Conference
Outline
Overview of Connectivity Trends
◼ Megatrends
◼ Wireline Transceiver & Application Trends
New Technologies and Considerations for 200G Links
◼ Within Transceivers
◼ 200G Optics
◼ Optical/Electrical Co-Design
Beyond 200Gb/s
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 2 of 86
© 2024 IEEE International Solid-State Circuits Conference
Relentless Growth in Data Consumption
AI INFLECTION POINT EXPONENTIAL RISE IN DATA
2010-2025
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 3 of 86
© 2024 IEEE International Solid-State Circuits Conference
AI Connectivity
DGX H100 256 GPU SuperPOD
Networking
Revenue Share %*
Server Market
General
Purpose
xPU
By 2027, approximately:
• 50% of market revenue will be AI-accelerated servers
• 20% of Ethernet Data Center Switch Ports connected to AI xPU’s consuming more bandwidth
servers
• 50% of data center switch ports driven by 400Gbps or higher 10+Tb/s: Bandwidth for xPU to xPU
30% of time in AI/ML spent in networking (Meta at OCP 2022) 100’s – 10,000’s xPUs per cluster
• Moving data between GPUs * Souce: Dell’Oro Group Data Center IT Capex Forecast, Jan 2023
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 4 of 86
© 2024 IEEE International Solid-State Circuits Conference
AI Driving Connectivity
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 5 of 86
© 2024 IEEE International Solid-State Circuits Conference
Connectivity Trends in Modern Compute Infrastructure
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 6 of 86
© 2024 IEEE International Solid-State Circuits Conference
Connectivity Demands for AI
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 7 of 86
© 2024 IEEE International Solid-State Circuits Conference
Example of xPU Cluster Network Topology
Front-End Datacenter Network [Norm Jouppi, … David A Patterson,
“TPU v4: An Optically Reconfigurable
• Connects traditional compute servers Supercomputer for Machine Learning
• Carries relatively short-lived streams of data with Hardware Support for
➢ Flexible architecture preferred Embedding,” ISCA, June 21, 2023]
Back-End ML Network
• xPU AI accelerator
interconnections
• Large training workloads
create sustained high-
bandwidth traffic with
regular patterns
• Low latency
➢ Flat hierarchy
➢ Non-blocking
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 8 of 86
© 2024 IEEE International Solid-State Circuits Conference
Google TPU v4: 4096-Chip AI Supercomputer
Optical Connectivity
TPUs1
Electrical interconnect
Norm Jouppi, … David A Patterson, “TPU v4: An Optically
within a rack Reconfigurable Supercomputer for Machine Learning with
Hardware Support for Embedding,” ISCA, June 21, 2023
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 9 of 86
© 2024 IEEE International Solid-State Circuits Conference
Disaggregated Storage
• Concentration of Optical Interconnect
storage in shared pools
improves efficiency CPU CPU
GPU GPU
efficiency Storage
Memory
Memor
• Relies on low-latency y
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 10 of 86
© 2024 IEEE International Solid-State Circuits Conference
WIRELINE TRENDS
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 12 of 86
© 2024 IEEE International Solid-State Circuits Conference
Published Wireline Transceivers 2010-2023
224
112
Energy/Bit [pJ/bit]
20.0
10.0
5.0
3.2
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 14 of 86
© 2024 IEEE International Solid-State Circuits Conference
Anatomy of the DSP Transceiver
• Combination of ISI and 4-PAM
necessitates extensive DSP for
Ref.
– encoding/decoding – equalization Clk ∫ 𝐾𝐼
– modulation/demodulation – timing recovery
∫ 𝐾𝑃
• As data rates increase:
– Equalization and FEC complexity increases
𝜙 FFE-t MM-PD
➢Challenging to keep power and latency low
FEC FEC
FIR AFE FFE
Enc. Dec.
DFE
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 15 of 86
© 2024 IEEE International Solid-State Circuits Conference
Evolution of Datacentre Rates
x2 x2 x2 x2
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 16 of 86
© 2024 IEEE International Solid-State Circuits Conference
Benefits of 200G Links
32 Modules • A 51.2T 1RU switch requires 32 x 1.6T
modules
• 16x100G optical links generally
requires twice the number of lasers as
1 RU Switch Faceplate
8x200G links
➢200G links halve the laser power and
Optical Transceiver cost
➢Realization of these benefits requires
robust 200G SerDes within available
power and cost envelopes
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 17 of 86
© 2024 IEEE International Solid-State Circuits Conference
Benefits of 200G Links
32 Modules
1 RU Switch Faceplate
Signal integrity is
challenging
High-power transceivers
in the ASIC
☺ Low cost
☺ Low power
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 20 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Passive Cable Channel Responses
• Blue: 13” cable has higher loss – 37dB
• Red: 8” cable has worse distant post-cursor reflections
Reflections
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 21 of 86
© 2024 IEEE International Solid-State Circuits Conference
Chip-to-Module Links
• Difficult routing for some
channels Switch
• Significant additional loss PCB
introduced by large
packages
➢ Total chip-to-module
channel loss may exceed
ASIC 9”
35dB at 200G
• Potential measures: 3”
▪ Extensive equalization
▪ Repeaters
Modules
▪ Flyover cables
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 22 of 86
© 2024 IEEE International Solid-State Circuits Conference
Repeaters or Active Cables
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 23 of 86
© 2024 IEEE International Solid-State Circuits Conference
Flyover Cables
Signal integrity is relaxed
Higher system cost and
power
☺ Simpler (lower-power)
transceivers in the ASIC
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 24 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Flyover Cable vs. PCB Responses
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 25 of 86
© 2024 IEEE International Solid-State Circuits Conference
Massive Scale-Out
Signal integrity is
challenging
Power consumption is high
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 26 of 86
© 2024 IEEE International Solid-State Circuits Conference
Optical Interconnect
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 27 of 86
© 2024 IEEE International Solid-State Circuits Conference
OPTICAL MODULES
Spine Switch
Leaf Switch
TOR Switch
Pluggable
Modules or Switch System (box) in all levels Switch
(Core, spine, leaf, TOR) ASIC
Transceivers
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 29 of 86
© 2024 IEEE International Solid-State Circuits Conference
Optical Module Anatomy
Host Interface
Line Interface
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 31 of 86
© 2024 IEEE International Solid-State Circuits Conference
Optical Module Anatomy
Host Interface • Silicon photonics allows for integration of
several optoelectronic components into a
single silicon die
• Laser count may be reduced
Line Interface
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 32 of 86
© 2024 IEEE International Solid-State Circuits Conference
Linear Pluggable Optical (LPO) Module Anatomy
Host Interface • Alternative architecture to eliminate the DSP
• No receive and retransmit function between
the host and line interfaces to reset noise and
timing jitter
⇒ “Linear Pluggable Optics”
Example:
Switch System
Traditional
DSP
Linear
Traditional Linear Traditional
DSP DSP
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 35 of 86
© 2024 IEEE International Solid-State Circuits Conference
Parallel optical fiber
Miniature
optical engines
CO-PACKAGED OPTICS
Chip-to-Chip Links
Gen II – OBO/NPO
OR
Laser Laser
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 37 of 86
© 2024 IEEE International Solid-State Circuits Conference
Electronic / Photonic Integration
ASIC Many permutations possible:
Laser a) RF circuits + DSP integrated onto ASIC
• Heavy burden on the nano-CMOS ASIC
ASIC d) RF, DSP and optical control spread across chiplet + PIC
Laser • Requires silicon photonics technology with CMOS
integration
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 38 of 86
© 2024 IEEE International Solid-State Circuits Conference
Opportunities and Challenges for CPO
Pros Cons
• May eliminate retimers • Concentrates power/heat
➢ Lower system power within the ASIC package
➢ Low latency
➢ Potential to lower cost
• Switch assembly more
complex
• Potential to improve
• Field service more difficult
aggregate I/O bandwidth
• Alleviates faceplate density issues • Reduced ecosystem for
innovation
vs.
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 39 of 86
© 2024 IEEE International Solid-State Circuits Conference
NEW TECHNOLOGIES FOR 200G
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 41 of 86
© 2024 IEEE International Solid-State Circuits Conference
System Impairment Sensitivity
➢ At 200G, links can be extremely sensitive to termination and package
discontinuities unless highly capable equalizers are provided
Post
Cursor CTLE TX TX Die Term PKG
TX TX Dual Bump PKG
Taps BW RLM Cap Inductor Ball Imped.
Swing SNR RJ Dirac Cap Trace
Cap
Jitter
Noise
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 42 of 86
© 2024 IEEE International Solid-State Circuits Conference
Channel Loss @
Number 53.125GHz
Channel: 1 2 3 4 5 6 Channel: 1 2 3 4 5 6
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 43 of 86
© 2024 IEEE International Solid-State Circuits Conference
• Short channels present
Roving-Tap FIR Equalizer additional challenge due to
reflections
• Roving equalization taps can
help address this concern
Long channel
(vertically normalized)
0 20 40 60 80 100
UI
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 44 of 86
© 2024 IEEE International Solid-State Circuits Conference
Segmented FEC Encoder
C1
Decoder
C1
Encoder
C2
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 45 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Concatenated FEC Encoder Decoder Encoder
C1 C1 C2
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 46 of 86
© 2024 IEEE International Solid-State Circuits Conference
Interleaving Concatenated Codes
• Inner code will
fail with Encoder Encoder
significant Interleaver
C1 C2
probability
➢ Burst-error
correction “Outer” code “Inner” code
performance
e.g. RS(544,514) e.g. Hamming
is a key
(128,120)
consideration
⇒ aided by an
interleaver
• Further source (Soft)
Decoder
of latency Deinterleaver Decoder
C1
C2
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 47 of 86
© 2024 IEEE International Solid-State Circuits Conference
Ensuring Satisfactory Post-FEC BER
• Large DFE tap weights
can introduce error
propagation that is
more likely to cause
post-FEC errors
• Difficult to model since
it is dependent on
interleaving details
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 48 of 86
© 2024 IEEE International Solid-State Circuits Conference
Differing Optima for Pre-FEC and Post-FEC BER
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 52 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G EMLs
• CW DFB laser monolithically integrated with
Interconnect Model
an electro-absorption modulator (EAM)
• P-contact for DFB and EAM are separated by EAM-
~10s kW
• N-contact is usually shared between EAM and Termination
DFB Resistance
• DFB is forward biased in gain region and EAM 𝑅𝑡
is reverse biased in absorption CMN
• Challenges
• Simultaneous optimization of ER and Chirp especially EAM Diode Model
at longer reaches
DFB+ EAM-
• Crosstalk between EMLs in an array
• Modest swing requirement (challenging for CMOS) p-type InP
• Differential variants under research and 𝑅𝑡
development Light
• e.g. [K. Adachi et al., "53-Gbaud PAM4 Differential Drive of a
Conventional EA/DFB Toward Driver-Amplifier-Less Optical n-type InP
Transceivers," OFC’19]
CMN
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 53 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G EMLs
• CW DFB laser monolithically integrated with
an electro-absorption modulator (EAM) [K. Nishimura et al., "225-Gb/s PAM4
Operation Using Lumped-Electrode-Type
• P-contact for DFB and EAM are separated by EA-DFB Laser for 5- and 10-km
~10s kW Transmission with Low TDECQ," OFC 2023.]
• N-contact is usually shared between EAM and
DFB
• DFB is forward biased in gain region and EAM
is reverse biased in absorption
• Challenges
• Simultaneous optimization of ER and Chirp especially
at longer reaches
• Crosstalk between EMLs in an array
• Modest swing requirement (challenging for CMOS)
• Differential variants under research and
development
• e.g. [K. Adachi et al., "53-Gbaud PAM4 Differential Drive of a
Conventional EA/DFB Toward Driver-Amplifier-Less Optical
Transceivers," OFC’19]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 54 of 86
© 2024 IEEE International Solid-State Circuits Conference
Chirp at 200G vs 100G
• EML chirp is the variation in its
output wavelength with output
level
➢A significant potential challenge
at 200G when combined with
chromatic dispersion
• Results on the right assume
bandwidth is scaled to maintain
similar TDECQ (~2dB) and ER
(~5dB) at zero dispersion for
both 100G and 200G links
• c.f Worst-case dispersion in
CWDM applications is
approximately -6/+3.5 ps/nm⋅km
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 55 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Optics: Silicon Photonic MZMs
• SiP is attractive due to its low cost, 𝑳
integration, reliability, potential for differential 𝒁𝟎
drive
G S G S G
• SiP MZMs demonstrated for 100G 4-PAM Oxide
• Typical 1.5 – 2 V⋅cm phase efficiency
• 𝑉𝜋 = 6-7V and BW=30-35GHz ⇒ ≈2Vppd swing Doping Si Substrate
• 1 laser shared for multiple lanes
• Challenges of achieving 200G/lambda: Design Tx Swing BW Area Optical
• Required 𝑉𝜋 , bandwidth, and loss are difficult Parameter Req’t Loss
to achieve simultaneously
• Marginal improvements are expected by
further optimization of doping, length, or
𝐿 ↑
☺
bias voltage
• Advanced packaging solutions:
Junction
doping ↑ ☺ –
• Remove termination with tight integration
of the driver
Characteristic
Impedance,
☺ – –
• Segmented MZM with multiple NRZ
𝑍0 ↓
drivers ⇒ requires more Tx drivers
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 56 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Optics: EML vs. Silicon Photonics MZM
• Symbol rate = 106.25GB/s SiP MZM EML
After Tuning
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 59 of 86
© 2024 IEEE International Solid-State Circuits Conference
200G Optics: TFLN
• Low loss and high bandwidth TFLN
achievable • High-speed modulators
• Longer modulators can achieve
low 𝑉𝜋
• Two approaches
1.LNOI offering higher performance
2.Hybrid LN over SiN waveguides to leverage
from existing SiP platforms Silicon Photonic Platform
• Challenges • Photodetection
• Technology maturity • Splitters
• Generally large sized & high cost • Couplers
• Differential drive being researched • etc.
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 60 of 86
© 2024 IEEE International Solid-State Circuits Conference
OPTICAL/ELECTRICAL CO-DESIGN
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 62 of 86
© 2024 IEEE International Solid-State Circuits Conference
Receiver Interconnect Optimization Passive Modelling
10 fF Cbump+Cvia
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 63 of 86
© 2024 IEEE International Solid-State Circuits Conference
On-Die T-Coil Incorporated into Co-Optimization
2x BW
w/ T-coil
Bump
model 20 pH Lbump+Lvia
10 fF Cbump+Cvia
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 64 of 86
© 2024 IEEE International Solid-State Circuits Conference
Optical Receiver Front-End + DSP Co-Design
[Radi et al, “Optimal Optical Receivers in
Nanoscale CMOS: A Tutorial,” TCAS-II, 2022] Rx bump
model TIA
Pkg.
Trace
Tcoil
Photodiode model Tx bump
FFE
DFE
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 65 of 86
© 2024 IEEE International Solid-State Circuits Conference
PD to RX Interconnect Optimization Simulations
60 GHz 60 GHz
• Click to edit Master text styles
• Second level
• Third level
• Fourth level
• Fifth level
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 66 of 86
© 2024 IEEE International Solid-State Circuits Conference
Complete Optical Receiver Co-Design
• Dozens of design
parameters
• Co-optimization
can minimize
output noise
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 67 of 86
© 2024 IEEE International Solid-State Circuits Conference
T-Coil Optimization Flow Chart
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 69 of 86
© 2024 IEEE International Solid-State Circuits Conference
Optimization Flow
[B. Radi, Z. Li, D. Patel and A. Chan Carusone, "Optimizing the
Photodetector/Analog Front-End Interface in Optical Communication
Receivers," in IEEE Transactions on Signal and Power Integrity, Aug. 2023]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 70 of 86
© 2024 IEEE International Solid-State Circuits Conference
Proposed TIA overview
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 71 of 86
© 2024 IEEE International Solid-State Circuits Conference
16nm FinFET Prototype TIAs
RX1
200 µm
RX4 RX3
Digital Control
3
# Block
L W iPD
(µm) (µm) 2
1 3-Satge TIA 150 70
600 µm
2 DCOC Loop 80 75
CML Output
3 80 110
Buffers
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 72 of 86
© 2024 IEEE International Solid-State Circuits Conference
Co-Packaged Photodiodes + TIA Prototypes: Diagram
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 73 of 86
© 2024 IEEE International Solid-State Circuits Conference
Experimental results
• Performance
improvement
validated
using higher
Z0 over
shorter
package
traces
[Radi, Li, Patel, Chan
Carusone, "Optimizing the
Photodetector/Analog Front-
End Interface in Optical
Communication Receivers," in
IEEE Trans. Signal and Power
Integrity, 2023.]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 74 of 86
© 2024 IEEE International Solid-State Circuits Conference
BEYOND 200GB/S
Wavelength
Wavelength
E→O, O→E, 2
Demux
2
Tx Rx
sufficient
Mux
100G 100G
accuracy/ Tx
E→O, 3 O→E, 3
Rx
tunability 100G 100G
E→O, 4 O→E, 4
Tx Rx
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 76 of 86
© 2024 IEEE International Solid-State Circuits Conference
Signaling Beyond 200Gb/s
• 4-PAM with increased baud rate
➢Combination of increased analog and
optical bandwidth, enhanced DSP and
coding
• Early technology demonstrations for
6-PAM and 8-PAM optics
• Right: [K. Naoe, "Ultrahigh Speed EA-DFB Lasers beyond 200 Gbps
per Lane," 2023 OFC 2023.]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 77 of 86
© 2024 IEEE International Solid-State Circuits Conference
Evolution of Coherent Transmission
Coherent optical communication is a proven technology that allows for 4x data rate at the
same baud rate compared with IMDD
Trend towards
the use of
3200G coherent
transmission
MODULE SPEED
over shorter
1600G distances
Power and cost
100+ GBaud is reduced at
800G each step
50 GBaud
➢ Coherent
transmission
400G pushing
towards use
2-5m 0.1km 0.5km 2km 10km 40km 120km inside the
REACH datacenter
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 78 of 86
© 2024 IEEE International Solid-State Circuits Conference
Coherent-Lite Solutions for less than 10km Reach
Transmitter:
• Avenues for research: Receiver:
• Reductions in laser cost
• Reductions in DSP power consumption
• Recent adoption of O-Band for
100+Gbaud (800Gbps DP-16QAM)
coherent links below 10km in length
• O-Band allows for lower power DSP
implementation
• Higher fiber loss in X Tx, X Rx Y Tx, X Rx X Tx, Y Rx Y Tx, Y Rx
O-Band (compared
with traditional
C-Band) is acceptable
for short distances
(< 10km)
Example at 56GBaud [Maharry et al, UCSB, OFC 2023]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 79 of 86
© 2024 IEEE International Solid-State Circuits Conference
Coherent-Lite Solutions for less than 10km Reach
• 64 Gbaud
DP-16QAM carries
400+Gb/s over a QPSK @ 64Gbd
EVM < 10%
single wavelength
➢Potential path
towards
3.2Gbps/fiber and beyond
using multiple wavelengths
QAM-16 @ 64Gbd
EVM < 9%
Right: [Intel, OFC’23]
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 80 of 86
© 2024 IEEE International Solid-State Circuits Conference
Short-Reach Coherent DSP Alternatives
Analog / Oversampled Baud-Rate
Mixed-Signal Digital Digital
1. C-Band Coherent DSP
• Modest oversampling, M/N
• FFT based CD filter
• Rate conversion after the CD
filter
Assumptions:
MMSE
• No noise FFE
• Ideal samplers (no quantization) Synchronous
Baud-Rate Sampling
• Unlimited tap count
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 82 of 86
© 2024 IEEE International Solid-State Circuits Conference
Aliasing Penalty of Low Over-Sampling Rate DSP
➢ Non-negligible penalty due
to aliasing of the signal for
oversampling ratios up to
M/N = 1.2
➢ No penalty for
synchronous baud-rate
sampling
➢ Performance improvement
noticeable over short
Penalty due to reaches
re-sampler ➢ Performance improvement
aliasing is increased with higher Tx
BW
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 83 of 86
© 2024 IEEE International Solid-State Circuits Conference
Experimental Demonstration of Low-BER Floor
• Low-BER floor
demonstration using an
FPGA baud rate
architecture
➢ Potential for lower
power and lower latency
coherent optical
communication over
short-reach applications
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 84 of 86
© 2024 IEEE International Solid-State Circuits Conference
Summary
DSP
• Megatrends driving connectivity:
• Massive scale-out of compute for AI
Analog
• AI workloads demanding low latency, Devices
Design
low power, high throughput
• Disaggregated storage is another
driver R&D
• Proliferation of new connectivity Collab.
technologies
• Electrical links with 40+dB loss
• Optical connectivity over shorter FEC
Optics
reaches Coding
• Linear optics, co-packaged optics, new
Compute
optical modulation technologies, …
• Coherent optical links over shorter
and
reaches Network
Arch.
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 85 of 86
© 2024 IEEE International Solid-State Circuits Conference
Acknowledgements
• Behzad Dehlaghi, Shayan Shahramian, Ming Yang,
Or Vidal, Alik Gorshtein, Nir Sheffi
• Dhruv Patel, Chris Li, Bahaa Radi
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 86 of 86
© 2024 IEEE International Solid-State Circuits Conference
Please Scan to Rate
This Paper
Tony Chan Carusone ISSCC 2024 - Forum 6.2: The Impact of Industry Trends on 200+Gbps Wireline R&D 87 of 86
© 2024 IEEE International Solid-State Circuits Conference
Beyond 200Gbps Electrical
transceivers – Circuit Architecture,
Design Implementation and
Silicon Results
Ariel Cohen
Intel Corporation
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 2
Ethernet Port Speed Evolution
15 years ago
Now/Future(?)
Now/Future(?)
Now/Future(?)
Now/Future(?)
Trending 1.6Tb
in 2025!!
• Ethernet single lane speed is increased at 20x/15yr
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 6
CMOS Electrical Links in the Past 21 Years
Both speed and efficiency have scaled by >21x in the past 21 years!
▪ AlaskaTM ▪ EPHYTM
▪ Dual 400GbE w/ ▪ 112G LR SerDes in
▪ TomahawkTM 5 100G PAM4 SerDes 7nm
▪ Agilex-ITM FPGA ▪ 51.2 Tb/s in 5nm
▪ 116Gb/s PAM-4 LR ▪ 64x800GbE
▪ 4.2 Tb/s Max BW ▪ 8x106Gb/s PAM4 LR
112G-ELR PAM4
▪ AlphaCORETM SerDes PHY in 5nm
▪ 112G LR SerDes IP
Source: “PAMn vs Channel and FEC investigation for 224 Gb/s”, IEEE P802.3df Ethernet Task Force
8.9ps
√ Transistor’s But …
Bandwidth X Not friendly for
√ Low Power analog design
√ Dimensions X Heat aware design Radosavljević et. el, IEDM,2023 (Intel)
√ Low patristics X Reliability
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 14
Outline
Background and key challenges
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 15
Why DAC-ADC Transceiver for 224Gb/s
At 224Gb/s data-rate, we need to deal with
Tough channels with higher 56GHz Nyquist
◼ Higher insertion loss (package, board, connector, etc.)
◼ Critical reflections at many UIs later than the cursor (large package and short UI)
◼ Sharp roll-offs or some notches
◼ Non-ideal smoothness (ripples in insertion loss)
Implementation
◼ Analog DFE loop timing in RX can not be done with a UI=8.9ps
Need more powerful and flexible channel equalization for good BER
→ Need many-tap TX/RX FFE, continuous time EQ, DFE, sometimes advanced
equalization (sliding-block DFE, MLSD, etc)
TX
TX
uP + DSP
Controls Coefficients
CDR
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 19
RX Architecture Philosophy
DSP
In
Analog Front End 6-bit 16X4 Digital out
with powerful interleaved Equalizer
Equalizer ADC
Low noise
clock
clock & data
phases recovery
Low noise
clock
clock & data
phases recovery
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ Φ2,x
16x6b
DGlobal Align
1,1
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ Φ2,x
16x6b
1,3
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ Φ2,x
16x6b
DGlobal Align
1,1
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ Φ2,x
16x6b
DGlobal Align
1,1
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
Hybrid CTLE
◼ Shunt peaking (Q-shaping) & RC source
degeneration → Better channel matching
CMOS gm
◼ 2x gm at the same bias current
20dB boost & 7dB peak gain @ 53GHz
Noise optimized sizing
1.2-1.5V supply to ensure linearity
Low impedance Vcm for low CM gain
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 33
Q-Shaping (LC-tuned Amplifier)
L 𝜔𝐿 2
CL 𝑄𝑖𝑛𝑑 = 𝑅𝑝 = 𝑄𝑖𝑛𝑑 𝑅𝐿 = 𝑄𝑖𝑛𝑑 𝜔𝐿
gm 𝑅𝐿
RL
RD CD 𝑔𝑚 1
𝐴𝑣 = 𝑔
𝑚 𝐷 𝑅 𝑄𝑖𝑛𝑑 𝜔𝐿 , 𝜔 =
1+ 1+𝑗𝜔𝑅 2𝜋 𝐿𝐶𝐿
𝐷 𝐶𝐷
70GHz 70GHz
LaCroix, ISSCC 2021 [42] Xu, ISSCC 2021 [43] Guo, ISSCC 2022 [36]
116Gb/s 112Gb/s 112.5Gb/s
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 36
Interleaved ADC Front-End
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ Φ2,x
16x6b
1,3
+
-
clgc
VBP VBP
clk clk
VCMFB
VBP VBP
clk clk
VCMFB
VBP VBP
clk clk
VCMFB
VBP VBP
clk clk
VCMFB
16x6b
+-
i clgc
x4
x4
Φ1,1 Φ2,x
16x6b
DGlobal Align
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
Vs Vs Vs
|Vo/Vs|
|Vo/Vs|
|Vo/Vs|
filtering
Low-dropout regulators
(LDO) regulate the supply
noise and and ensure low
deterministic jitter (Dj)
16x6b
+-
i clgc
x4
x4
Φ Φ2,x
16x6b
DGlobal Align
1,1
+-
clgc
In Matching Digital out
64x6b
Network in x4
+ESD Φ1,2 Φ2,x
x4 Equalizer
16x6b
+
-
EQ CTLE
+2 STG clgc
VGA x4
x4
Φ1,3 Φ2,x
16x6b
+
-
clgc
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 51
224Gb/s TX – Clocking & Serialization
Final data serialization comes with clocking architecture (# of phases)
Should be planned together and co-optimized given technology’s capability
Deven D0 D0
D1
Dodd 2:1 D2 4:1 8:1
D3
D7
2 4 8
Timing
Jitter BW
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 53
N3 224Gb/s TX Design [45] - Architecture
7b-
9-Tap segmented
FFE DAC
NRZ
PAM-4
PAM-6
4:1 serializer
directly at the
output
>36dB SNDR
>0.97 RLM
>1Vpp swing
Timing
Jitter BW
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 54
N3 224Gb/s TX Design [45] - Architecture
LC D-PLL
<-99dBc/Hz
@1MHz
Timing
CMOS
w/without
inductors
<RJ 62fs
<DJ 110fs Jitter BW
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 55
N3 224Gb/s TX Design [45] - Architecture
Timing
Replica based
Jitter BW
DRV with phase
detector for
Ariel Cohen phase calibration
Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 56
N3 224Gb/s TX Design [45] - Architecture
7b-
9-Tap segmented
FFE DAC
NRZ
PAM-4
PAM-6
4:1 serializer
directly at the
output
LC D-PLL
>36dB SNDR
<-99dBc/Hz
>0.97 RLM
@1MHz
>1Vpp swing
Timing
CMOS
w/without
inductors Replica based DRV
<RJ 62fs with phase
Jitter detector for phase
<DJ 110fs BW calibration
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 57
Low-power 224Gb/s TX [45] - Driver
4:1 Serializer and output driver 4UI clock dist. and output network floorplan
Voltage (V)
Q4 0.25V 0.8
Q3
0.6
Q2
M3 1 1 0 1 0 0 1 0 1 1
1V M4
Q1 0.4
8:4 MUX Retimer Pulse Gen
0.2
VCC
M5 0
Y 1.05E-09 1.07E-09 1.09E-09 1.11E-09 1.13E-09 1.15E-09
Time (s)
8:4 D M6
L X M2 Node Y
1.2
M7
1
M1
Voltage (V)
0.8
0.6 0 0 1 0 1 1 0 1 0 0
4:1 MUX Driver
VSSHI 0.4
0.2
28nm CMOS
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 63
Package Design – On Package Connector
Top-side
Die
Key aspects:
Connector 1
1
C4 – BGA connectivity
C4 –on package connector
Board
2
Custom pitch
Custom ball size
Optimizing high speed
Bump
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 69
Die images
Keysight
1m cable and ISI coupon M8199A AWG
for channel loss of
31.6dB (including PCB
connectors and package)
Die
Package High-density
Connector
Board
High-density Packaged &
connector on PCB Socketed Die
Increasing Cd
QPRBS31
QPRBS31 TX ISI Channel RX Checker
PreFEC BER vs IL
PAM-4 Histogram
X 1e6
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 84
Passive Cable: 1-meter with 2 OSFP connectors
Collaborative demo with
Amphenol @ ECOC23
X 1e6
Laser
RIN noise Fiber
Modulator
TIA RX
TX Drv TP2 TP3
Photo diode
SOC EIC(Electrical IC) PIC(Photonic IC) PIC(Photonic IC) EIC(Electrical IC) SOC
r
Modulato
Drv TIA RX
TX TP2 TP3
Photo diode
SOC EIC(Electrical IC) PIC(Photonic IC) PIC(Photonic IC) EIC(Electrical IC) SOC
End-to-end data transmission in 224Gb/s through >10 km fiber optic cable! With 1e-4 BER
First seen in the industry
Collaborative demo
with NewPhotonics
LTD @ ECOC2023
Package design
Summary
Ariel Cohen Beyond 200Gbps Electrical transceivers – Circuit
© 2024 IEEE International Solid-State Circuits Conference Architecture, Design Implementation and Silicon Results 91
Summary
We presented the details of state-of-the-art 224Gb/s TX and RX focusing on
design techniques for core circuit blocks
>30% power reduction versus 100G transceivers with a comparable die size:
◼ RX - 1.41pJ/b achieved by the combination of hybrid CTLE, inductive peaking clocks
and 6-bit memory-less ADC
◼ TX - 0.92pJ/b achieved by the combination 1UI pulse generation drives 1:4 mux
driver, inductive peaking clocks and circuit adaptation based on replica
Breakthrough in Signal Integrity and co-design with Serdes is essential for the
deployment of 200G technology
Naim Ben-Hamida
Senior Director
Ciena Corporation
J. Wei et al, “Experimental demonstration of advanced modulation formats for data center networks on 200 Gb/s lane rate IMDD links”,Vol. 28 No. 23 / 9
November 2020 / Optics Express
Cost of Optics
•IMDD module: 0.5$/Gbps (400$ for 800G)
•Copper cables: 0.05$/Gbps (80$ per cable)
Power of Optics
•IMDD: 16pJ/bit
•LPO: 5-8pJ/bit
•Half retime: 8-12pJ/bit
Chip walls
•Power wall per chip at 1kW
•Cost wall
•Reticle size (26 mm by 33 mm or 858 mm²)
•Time to market and accelerated demand
•Yield issues
Wavelogic 6e
3nm CMOS
>200 Gs/s DAC/ADC
100.00 Wavelogic 5e
QREW 90nm CMOS 7nm CMOS
4x22Gs/s 6 bit ADC 120 Gs/s DAC/ADC
1.00
© 2024 IEEE 1990 1995 2000 2005 2010 2015 2020 2025
International Solid-State Circuits Conference Year
Common IP for Flexible Reach
Long reach
Medium reach
Short reach
© 2024 IEEE
International Solid-State Circuits Conference
Digital Fractional PLLs
Bang-Bang PLLs with Quantization Noise Cancellation
76.7fs-Integrated-Jitter and -71.9dBc In-Band Fractional- A 66fsrms Jitter 12.8-to-15.2GHz Fractional-N Bang-Bang
Spur Bang-Bang Digital PLL Based on an Inverse- PLL with Digital Frequency-Error Recovery for Fast
Constant-Slope DTC and FCW Subtractive Dithering Locking (Santiccioli, ISSCC 2020)
S. M. Dartizio et al., "4.3 A 76.7fs-lntegrated-Jitter and −71.9dBc In-Band A. Santiccioli et al., "17.2 A 66fsrmsJitter 12.8-to-15.2GHz Fractional-
Fractional-Spur Bang-Bang Digital PLL Based on an Inverse-Constant-Slope N Bang-Bang PLL with Digital Frequency-Error Recovery for Fast
DTC and FCW Subtractive Dithering," 2023 IEEE International Solid-State Locking," 2020 IEEE International Solid-State Circuits Conference -
Circuits Conference (ISSCC), San Francisco, CA, USA, 2023, pp. 3-5 (ISSCC), San Francisco, CA, USA, 2020, pp. 268-270
D. Kar, S. Mohapatra, M. A. Hoque and D. Heo, "A 14 GHz Integer-N X. Geng, Y. Tian, Y. Xiao, Z. Ye, Q. Xie and Z. Wang, "A 25.8GHz Integer-N PLL With
Sub-Sampling PLL With RMS-Jitter of 85.4 fs Occupying an Ultra Low Time-Amplifying Phase-Frequency Detector Achieving 60fsrms Jitter, -252.8dB FoMJ,
Area of 0.0918 mm 2," in IEEE Transactions on Circuits and Systems I and Robust Lock Acquisition Performance," 2022 IEEE International Solid-State
Circuits Conference (ISSCC), San Francisco, CA, USA, 2022, pp. 388-390
B. -T. Moon, S. -G. Lee and J. Choi, "24.2 A 264-to-287GHz, −2.5dBm Output J. Bang, J. Kim, S. Jung, S. Park and J. Choi, "4.6 A 47fsrms-Jitter and 26.6mW
Power, and −92dBc/Hz 1MHz-Phase-Noise CMOS Signal Source Adopting a 75fsrms 103.5GHz PLL with Power-Gating Injection-Locked Frequency-Multiplier-Based
Jitter D-Band Cascaded Sub-Sampling PLL," 2023 IEEE International Solid-State Phase Detector and Extended Loop Bandwidth," 2023 IEEE International Solid-
Circuits Conference (ISSCC), San Francisco, CA, USA, 2023, pp. 364-36 State Circuits Conference (ISSCC), San Francisco, CA, USA, 2023, pp. 84-86
© 2024 IEEE
International Solid-State Circuits Conference
High Speed ADC
Performance curve showing both SNDR and ENOB as a function of the output frequency of an 8-bit
time-interleaved ADC operating at a sampling rate of 68GS/s and fabricated in 28nm CMOS SOI.
Y. Greshishchev et al “A 60 GS/s 8-b DAC with > 29.5dB SINAD up to Nyquist frequency in 7nm FinFET CMOS” 2019 IEEE BiCMOS and Compound Semiconductor
Integrated Circuits and Technology Conference
Y. Segal et al., “A 1.41pJ/b 224Gb/s PAM-4 SerDes Receiver with 31dB Loss Compensation,” in Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC),
2022, pp. 114-115.
Silicon performance:
✓ The AFE and ADC supports
up to 25-dB boost and 18-dB
gain at a peak frequency of 53
GHz.
✓ Power consumption is 49mW.
✓ CMOS5nm
© 2024 IEEE
International Solid-State Circuits Conference
High Speed DAC
Y. Greshishchev et al., "A 60 GS/s 8-b DAC with > 29.5dB SINAD up to Nyquist frequency
in 7nm FinFET CMOS," 2019 IEEE BiCMOS and Compound semiconductor Integrated
Circuits and Technology Symposium (BCICTS), Nashville, TN, USA, 2019, pp. 1-4
J. Im et al., "A 112-Gb/s PAM-4 Long-Reach Wireline Transceiver Using a 36-Way M. -A. LaCroix et al., "8.4 A 116Gb/s DSP-Based Wireline Transceiver in 7nm CMOS
Time-Interleaved SAR ADC and Inverter-Based RX Analog Front-End in 7-nm Achieving 6pJ/b at 45dB Loss in PAM-4/Duo-PAM-4 and 52dB in PAM-2," 2021 IEEE
FinFET," in IEEE Journal of Solid-State Circuits, vol. 56, no. 1, pp. 7-18, Jan. 2021 International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA,
2021, pp. 132-134
J. Kim et al., "A 224-Gb/s DAC-Based PAM-4 Quarter-Rate Transmitter With 8-Tap FFE in 10-nm FinFET," in IEEE Journal of Solid-State Circuits, vol. 57, no. 1, pp. 6-20, Jan. 2022
D. Widmann et al, “A Time-Interleaved Digital-to-Analog Converter up to 118 GS/s With Integrated Analog Multiplexer in 28-nm FD-SOI CMOS Technology”
IEEE Journal of Solid-State Circuits, pp. 1--15, 2023
C. Schmidt et al., "Data Converter Interleaving: Current Trends and Future Perspectives," in IEEE Communications Magazine, vol. 58, no. 5, pp. 19-25, May 2020
Electrical interleaving will be pursued in the next couple of years enabling broad-bandwidth
single-wavelength transceivers. Later, optical interleaving will complement the electrical approach
allowing waveforms spanning multiple Terahertz of bandwidth, which would enable truly flexible
optical networks.
C. Schmidt et al., "Data Converter Interleaving: Current Trends and Future Perspectives," in IEEE Communications Magazine, vol. 58, no. 5, pp. 19-25, May 2020
© 2024 IEEE
International Solid-State Circuits Conference
Baud rates for different line rates and modulation formats assuming 28% FEC overhead and Dual Polarization
𝐶 = 𝐵𝑊 ∗ 𝑙𝑜𝑔2 (𝑠𝑛𝑟 + 1)
(a) Without constellation shaping and (b)
with shaping [Roberts, 2016]
ISSCC 2024 - Forum 6.4: Modulation schemes 29
© 2024 IEEE for ultra-high-speed transceivers
International Solid-State Circuits Conference
High Speed Optical Transceiver Requirements
Ciena’s WL6 @1600Gb/s
800
400
200
100
10 40
Equivalent capacity per technology generation
2005 2008 2009 2012 2017 2020 2024
WL WL2 WL2 WL3 WLAI WL5E WL6
A/D FIR FI
xpol A/D R xpol (equalized)
FI
R exp(-
FI j*φ(t))
R
FI
ypol A/D
A/D FIR ypol (equalized)
R
R y = R3 + j R4
Rx Laser A
‘LO’
Rx hxx hyx E x
Channel R = h , E x & E y are transmitted
Model: y xy hyy E y
Eˆ x c xx c yx Rx
Function of ˆ = R
coherent Rx: y xy
E c c yy y
1 3
0 0
0
p
s
Effects of Polarization Mode Dispersion
0
p
s
DSP tracking
Amplitude
Polarization
noise
Strong FEC
Standard
a = 1.0
Adaptive
[h] Algorithm
Out
Adaptive
Algorithm
Frequency-domain HN-1
tN-1 t’N-1
HN-2 Given a filter length of N taps operating on N samples
tN-2 t’N-2 ◼ Time domain complexity O(N2)
HN/2
◼ Frequency domain complexity O(N x log2(N)) ;
tN/2 t’N/2 N~radix2
HN/2-1
CFFT ICFFT
◼ Power dissipation at a given process node ~
tN/2-1 t’N/2-1
complexity
H1
For 115 Gb/s DP-QPSK and 1500 km of G.652 fiber,
t1
H0
t’1
dispersion spreads across ~160 T/2 samples
See, for example, R. Kudo et al., JLT, v.27 No.16 pp. 3721-3728 (2009).
t0 t’0
SerDes
Encryption
ADC Client Interfaces & IP
Rx Filter
Client n x 100GE over CAUI
1 x 400GE over CDAUI
n x OTU-4 over OTL4.4
1 x OTUC4 over 4 x OTL4.4
Encryption 256-AES per ODUFlex, ODU4
25G Flex Rate Granularity
BDA FEC
100GHz 100GHz
ICR CDM
200GBaud, 1.6Tb/s
© 2024 IEEE
International Solid-State Circuits Conference
400ZR Coherent Pluggable
© 2024 IEEE
International Solid-State Circuits Conference
BER vs SNDR for PAM2-4-6-8-12-16
1e-2
1e-4
6dB 1e-6
7nm measurement results: SNR and BER results for different residual channel loss
H. Li et al., "A 3-D-Integrated Silicon Photonic Microring-Based 112-Gb/s PAM-4 J. Rafique, T. Nguyen and S. P. Voinigescu, "A 4.6V, 6-bit, 64GS/s Transmitter in
Transmitter With Nonlinear Equalization and Thermal Control," in IEEE Journal of 22nm FDSOI CMOS," 2019 IEEE BiCMOS and Compound semiconductor Integrated
Solid-State Circuits, vol. 56, no. 1, pp. 19-29, Jan. 2021 Circuits and Technology Symposium (BCICTS), Nashville, TN, USA, 2019, pp. 1-4
NRZ 112Gb/s
PAM-4 112Gb/s
ISSCC 2024 - Forum 6.4: Modulation schemes 48
© 2024 IEEE for ultra-high-speed transceivers
International Solid-State Circuits Conference
224GS/s System for 400Gb/s
Unterminated 1 2 3
SNR 1e-2
1e-4
Low SNR High SNR
1e-6
Z. Jiang, H. Beshara, J. Lam, N. Ben-Hamida and C. Plett, "High Speed DMT for 224 Gb/s and Faster Wireline Transmission," in IEEE Transactions on Circuits and
Systems I: Regular Papers, vol. 70, no. 4, pp. 1758-1771, April 2023,
BER for the DMT system at different data rates with both transmit pre-compensation and
receive-side post compensation applied and with only receive-side post-compensation
ISSCC 2024 - Forum 6.4: Modulation schemes 57
© 2024 IEEE for ultra-high-speed transceivers
International Solid-State Circuits Conference
Test bed setup
Plot of the frequency plan for the 252 to 400 Plot of the measured BER vs sub-channel
Gb/s measurement run showing constellation frequency for the 252 to 400 Gb/s DMT runs
size vs sub-channel frequency
Power: For long reach Serdes requiring a larger number of taps, frequency
domain correction is advantageous over time domain.
Latency: Both advanced FEC and DMT add to the latency, but it can be
minimized.
Data Rec
Source
IFFT DAC ADC FFT
Data
Sample&Hold
Channel Model
Matlab
Channel Channel
Compensation Compensation
• Base DMT system- relies on multiple independent channels
• Less susceptible to ISI, varying frequency response
ISSCC 2024 - Forum 6.4: Modulation schemes 65
© 2024 IEEE for ultra-high-speed transceivers
International Solid-State Circuits Conference
Crest factor vs SNR
BER and data rate with varied CP length in a 280Gb/s DMT simulation
Comparison of simulated BER vs. data rate of Comparison of simulated BER vs. data rate of
conventional SERDES systems (varied from NRZ to conventional SERDES systems and DMT with and
PAM-16 with no DFE and a 50-tap ideal DFE) without the described pre-correction algorithm
Mayank Raj
1e14
Model Size: ~2x/4months
Model Parameters
1e13
GPT4
1e12
Megtron-Turing NLG PaLM
Gopher PaLM2
GPT3
1e11 LLaMA2
Turing-NLG
1e10 T5
Megtron-LM
1e9 GPT2
BERT Large
ELMo
1e8
2017 2018 2019 2020 2021 2022 2023 2024 2025 2026
Year
Source: Meta, Optica Photonic-Enabled Cloud Computing, 2023
5” PCB reach
(3 dB/inch @56 GHz)
112Gbps 224Gbps
ASIC
Substrate
ASIC
System Board
Metric Spec
Aggregate Bandwidth +10 Tbps
Energy per bit 2.5 pJ/bit
Edge Bandwidth 1 Tbps/mm
Link latency 200 ns + TOF
Link reach 100m+
BER 1e-12
BGA
Electrical IC (EIC) RX input directly on top of the Photonic IC (PIC) using Cu-
pillar bumps
15-fiber array with a 250 μm pitch via V-grooves
ISSCC 2024 - Forum 6.5: Silicon photonics AMD generated image for illustrative 7 of 50
© 2024 IEEE based high throughput optical transceivers purposes only
International Solid-State Circuits Conference
PIC
8 MRMs
16x WDM 1.5 nm channel spacing, large enough to avoid inter-modulation
Cartoon shows one bank. There are six banks per PIC
ISSCC 2024 - Forum 6.5: Silicon photonics 9 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
What is a Micro Ring Modulator (MRM)?
Input κt Output
Resonant-based modulator
Transmission
Vnp =0V
0.6
Vnp =1.8V
0.4
Kd (drop coupling coefficient) + insertion 0
0
1308 1309 1310 1311 1312
Wavelength(nm)
*Electro Absorption Modulator **Micro Ring Modulator *** Mach Zehnder Modulator
Z (µm)
n++ p++
0
◼ Less efficient overlap with mode
◼ “Wasted” junction capacitance (Cj) -0.5
-1 0 1
X (µm)
Z (µm)
◼ More sensitive to doping condition 0
◼ Better junction overlap with the light
◼ More sensitive to doping condition -0.5
-1 0 1
◼ ~1.5x better modulation efficiency (ME) X (µm)
Transmission
Slope (/nm)
1
1
𝑓max _𝑠𝑙𝑜𝑝𝑒 = 𝑓𝑜 1 − 0.5 0
2 3𝑄
-1
-2
Choose laser λ for max 0.25
-3
optical modulation -4
amplitude(OMA) 0 -5
1309 1309.5 1310 1310.5 1311
Wavelength (nm)
Transmission
DSP 0.6
PIC
Low-speed
1. DC loop sets the IDAC code 0.4
PD 0.25
2. Sweep the thermal DAC code 0.2
to find the maximum IDAC value
Tx RM TIA Slicer
CapFF 0
3. Initialize the thermal DAC to 1308 1309 1310 1311 1312
Vcm_set value that gives max OMA based Wavelength(nm)
Heater
on the known shape of the ring 100
Uncorrelated
UnCorrr. modulator drop port output. Input
Clk.
Clk 80 75%
4. Freeze the DC loop. Monitor
IDAC
Drop (%)
the avg. power by looking no of 60
1s and 0s.
40 Drop
5. Keep the avg power fixed by
changing DAC code to maintain 20
50%1s and 50% 0s.
DAC 0
1308 1309 1310 1311 1312
Thermal Wavelength(nm)
0.8 -10
Transmission
Crosstalk (dB)
0.6 -15
0.4 -20
0.2 -25
Left Right
Adjacent Channel Adjacent
0 -30
-2 -1 0 1 2 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Wavelength – Resonant Wavelength (nm) Channel Spacing (nm)
> 20 dB isolation for channel spacing > 1.2 nm at 50 Gb/s per channel
PD
Channel Spacing =
0.75nm
1 2 15 16
PD
PD
RX1 RX15
PD
2 16 2 16
PD
PD
RX2 RX16
IL 0dB
Through
Drop (dB)
-15dB
Drop
RX PD
λ
1.5nm
<1.5 dB insertion loss (IL)
Crosstalk of <15 dB
1.5 nm channel separation M. Raj et al., ISSCC 2023
ISSCC 2024 - Forum 6.5: Silicon photonics 17 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
WDM Filter – Advantages of CRR Over RR
Drop (dB)
Drop (dB)
-15dB -15dB Drop
Drop
PD
RX
PD
RX
1.5nm λ 1.5nm λ
CRR RR
40 dB/dec roll off → better channel isolation
Flat top → better bandwidth
Flat top → kinder to channel misalignment
Thermal C
Stabilization across process variation & temperature change
ISSCC 2024 - Forum 6.5: Silicon photonics 19 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
CRR Design: Improving Process Variation
Racetrack design → longer coupling section
coupling Through
gaps
PD
RX
Channel spacing implemented by vertical
spacers
PD
PD
RX7
ISSCC 2024 - Forum 6.5: Silicon photonics 20 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
EIC
TX/RX/drop/heater bump
pitched matched to PIC (55 µm)
proxy IC
and supply bumps for EIC only
uses 36 µm pitch micro-bumps
for InFO
V (V)
500
20 µm
450
400
0 10 20 30 40 50 60
Frequency (GHz)
T-coil
achieve this. AC cap 200fF. 0.9v Tx
0-0.9v 0.9v
Bump
Unlike passive AC coupling, 0.9v
the latch-based solution is
insensitive to CIDs as it has
EAM
memory
‘0’ eq.
ctrl
Programmable Sub-UI Laser
Pre Equalization 0v
ISSCC 2024 - Forum 6.5: Silicon photonics
based high throughput optical transceivers 26 of 50
© 2024 IEEE
International Solid-State Circuits Conference
TX Architecture
1.8v 1.8v
‘1’ eq.
ctrl
On-chip sub-UI
20µmx30µm
eq. allows for PRBS Generators
128:1 Serializer
T-coil
larger fanout of
0.9v
the predriver 0.9v
Tx
Bump
stages without 0.9v
significantly
increasing ISI
EAM
induced jitter
‘0’ eq.
ctrl
Programmable Sub-UI Laser
Pre Equalization 0v
ISSCC 2024 - Forum 6.5: Silicon photonics 27 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
EIC RX
8X <4> <4> 4X
4X <3> <3> 2X 80
70
2X <2> <2> 1X 60
1X <1> <1> 1X 50
5b
Zt (dB
5b
<0> <0> 1X 40 Zt 4k-6K
30
R3 R1 Photo 0.9A/W
20
BW > 35 GHz
0.12-1k 0.2-1.5k
Diode 10 Integ. Noise <3 µA
Voutb Rx µ
PD
Bump
InFo 0 105 106 107 108 109 1010 1011
Parasitic
gm3 gm2 gm1 E
S 25fF Parasitic 10fF Frequency (Hz)
D 25fF
Vout
Rx
Bump
Cin
First invertor is the main noise source due to peaking from Cin
Reduce Cin as much as possible—waveguide-based PD, no Power Grid (PG)
under bump
Increase gm until it does not increase Cin
Reduce parasitic and gate resistance in the Ipd path – use low fin mos
ISSCC 2024 - Forum 6.5: Silicon photonics 30 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
TIA DC Loop and PSRR
5b 5b
R3 R1 Photo 0.9A/W
0.12-1k 0.2-1.5k
Diode
To Slicers
Voutb Rx µ
PD
Bump
InFo Parasitic
Vcm2 gm3 gm2 gm1 E
S 25fF Parasitic 10fF
2b res 21k Cancel
D 25fF
w/o RC filter on
R3 R1 Photo 0.9A/W
0.12-1k 0.2-1.5k
Deser. 4 +1 Diode
Voutb Rx µ
PD
Bump
InFo Parasitic
gm3 gm2 gm1 E
S 25fF Parasitic 10fF
PRBS Checker
21k 0-1.3mA
+
4b p Reg
OTA PD Current range
0.88V
Vcm Improves
To TIA
differential
CDR 4b n
PSRR
Common LC PLL
Regulator
RX µ- Error Time
bump 16x5 64x16x5 64x5
Single ended 8:64 16:1 MUX Interleaved
4 CDR
TIA Data
Data
Data
Data
Common
Mode Gen.
4 7b
16x7 PI Codes for 16 Channels 12.5GHz
ILO1 PI ILO2
Common 8 Common
PI ILO
12.5GHz ppm tracked
Common LC
ILO2 ILO1
Baud-rate CDR
12.5GHz 4 2 8 2 12.5GHz
PLL
S/D PI S/D
4 QED VtoI
Quadrature
Error Detector
Voltage to C1
Current
Proposed QLL
Proposed Quadrature Locked Loop (QLL)
Baud-rate CDR with ILO-PI-ILO architecture
QLL corrects I/Q mismatches and generates supply voltage
Reference 12.5 GHz is generated by a common LC PLL
ISSCC 2024 - Forum 6.5: Silicon photonics 35 of 50
© 2024 IEEE based high throughput optical transceivers
International Solid-State Circuits Conference
MEASUREMENTS
PIC
PRBS RX fiber
Blocked out Checker input
RX
ISSCC 2024 - Forum 6.5: Silicon photonics AMD generated image for illustrative 37 of 50
© 2024 IEEE based high throughput optical transceivers purposes only
International Solid-State Circuits Conference
Measurement Setup
Tunable laser source for 3.7dB ER 0dBm avg. power
wavelength sweep
Reference modulator to
Tunable Laser Reference Variable Optical
generate optical data input Source Modulator Attenuator
PIC (DUT)
ISSCC 2024 - Forum 6.5: Silicon photonics AMD generated image for illustrative 38 of 50
© 2024 IEEE based high throughput optical transceivers purposes only
International Solid-State Circuits Conference
RX OMA Sensitivity (BER 1e-12)
OMA Sensitivity for RX1 - RX7
OMA Sensitivity for RX1 - RX7
1E-02
1E-02
1E-03
1E-03
1E-04
1E-04 0.61
1E-05 0.61
1E-05 0.59
1E-06 0.59
1E-08 0.550.53
1E-08
1E-09 0.530.51
1E-09 0.51
1E-10 0.49
0.49 -0.75
1E-10
1E-11 -1
-0.75
-0.5-0.25 00.250.250.50.750.751
-0.25
Delta
-1 Delta wavelength
0 from
-0.5wavelength 0.5 grid (nm)
1
1E-11
1E-12 Delta wavelength from
from grid (nm)grid
1E-12 Deltarx1+1.5dBm
wavelength from rx7 grid rx1
1E-13 rx1+1.5dBm rx7 rx1
1E-13 -17 -16 -15 -14 -13 -12 -11 -10
-17 -16 -15 -14 OMA (dBm)
-13 -12 -11 -10
OMA (dBm)
RX1 RX2 RX3 RX4 RX5 RX6 RX7
RX1 RX2 RX3 RX4 RX5 RX6 RX7
1e-3
1E-03
1e-4
1E-04
1e-5
1E-05
RX1
RX1
1e-6
1E-06
RX2
RX2
BER
BER
1e-7
1E-07 RX3
RX3
1e-8
1E-08 RX4
RX4
1e-9
1E-09
RX5
RX5
RX6
RX6
1e-10
1E-10 RX7
RX7
1e-11
1E-11 14% over all RXs
1e-12
1E-12
20
20 30
30 40
40 50
50 60
60 70
70 80
80
%UI
%UI
211mV
1311.5nm RX2 1305.5nm RX6
1311.5nm 1305.5nm
211mV
1310nm
1310nm RX3 1304nm
1304nm
RX7
211mV
1308.5nm RX4
1308.5nm 1UI
211mV
1e-4
1E-04 RX2 with
RX2with crosstalk
crostalk
fromRX3
from RX3
1e-6
1E-06 RX2nono
RX2 crosstalk w. crosstalk
crosstalk
BER
BER
211mV
1e-8
1E-08
1e-10
1E-10
no crosstalk
1e-12
1E-12 1UI
20
20 40
40 60
60 80
80
%UI
% UI
0.5 1e-5 <1e-10
RX2-RX3 enabled at the same time using a 2-λ laser source
Minimal crosstalk impact on vertical and horizontal eye margins
λ=1308.04 nm ER: 3 dB
ISSCC 2024 - Forum 6.5: Silicon photonics AMD generated image for illustrative 43 of 50
© 2024 IEEE based high throughput optical transceivers purposes only
International Solid-State Circuits Conference
424Gbps in a single fiber (8x53Gbps)
ISSCC 2024 - Forum 6.5: Silicon photonics AMD generated image for illustrative 45 of 50
© 2024 IEEE based high throughput optical transceivers purposes only
International Solid-State Circuits Conference
Conclusions
45 Gb/s/λ WDM Link
Hybrid integration with EIC in 7 nm and PIC in 45 nm SOI process to get best
of both worlds
EIC consumes < 1.6 pJ/bit. Each RX and TX channel only occupies 0.031mm2
The authors thank the AMD design, layout, and verification teams.
The information presented in this document is for informational purposes only and may contain technical inaccuracies,
omissions, and typographical errors. The information contained herein is subject to change and may be rendered
inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard
version changes, new model and/or product releases, product differences between differing manufacturers, software
changes, BIOS flashes, firmware upgrades, or the like. Any computer system has risks of security vulnerabilities that
cannot be completely prevented or mitigated. AMD assumes no obligation to update or otherwise correct or revise this
information. However, AMD reserves the right to revise this information and to make changes from time to time to the
content hereof without obligation of AMD to notify any person of such revisions or changes.
THIS INFORMATION IS PROVIDED ‘AS IS.” AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE
CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS, OR OMISSIONS THAT MAY
APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF NON-INFRINGEMENT,
MERCHANTABILITY, OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON
FOR ANY RELIANCE, DIRECT, INDIRECT, SPECIAL, OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF
ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
Gunther Roelkens
Micro-transfer printing
◼ Technology deep-dive
◼ Demonstrations
◼ Outlook
Conclusions
ISSCC 2024 - Forum 6.6: Micro-transfer printing for 2 of 39
© 2024 IEEE heterogeneous electronic-photonic integrated circuits
International Solid-State Circuits Conference
SiPho integration platforms
Several foundries have a silicon photonics offering
Most focus on a pure silicon photonics offering, not including monolithically
integrated electronics / lasers
Example:
Micro-transfer printing
◼ Technology deep-dive
◼ Demonstrations
◼ Outlook
Conclusions
ISSCC 2024 - Forum 6.6: Micro-transfer printing for 6 of 39
© 2024 IEEE heterogeneous electronic-photonic integrated circuits
International Solid-State Circuits Conference
Customer’s pain
Missing building blocks for fully integrated photonic systems-on-chip
Function Material systems Application domains
Gain,lasers (VIS-SWIR) GaN, GaAs, InP, GaSb All
Electro-optic modulators LiNbO3, BaTiO3, … Transceivers, Optical I/O, Quantum
1.3um electro-absorption mods InP, 2D materials Transceivers, Optical I/O
Optical isolators and circulators Ce:YIG, BIG All
Single photon sources/detectors InAs/GaAs QDs, NbTiN Quantum, Sensing
Electronics Si, SOI, III-V All
Detection beyon 1600nm InP, GaSb Sensing, Health
Inherits advantages of flip-chip integration (known good die, back-end integration) and
wafer bonding (high-throughput integration, efficient coupling)
While being less mature, micro-transfer printing has the potential to become
the dominant heterogeneous integration technology for silicon photonics
Micro-transfer printing
◼ Technology deep-dive
◼ Demonstrations
◼ Outlook
Conclusions
ISSCC 2024 - Forum 6.6: Micro-transfer printing for 15 of 39
© 2024 IEEE heterogeneous electronic-photonic integrated circuits
International Solid-State Circuits Conference
Micro-transfer printing basics
Transfer of released, micron-scale semiconductor chiplets to a silicon target
wafer
Based on the visco-elastic properties of PDMS stamps
Optics ( x, y, z )
Stamp ( z, Ɵ, Tx, Ty )
Transparent Soft
Translation stages ( x, y )
Au
Cu
III-V
Au
Fiber Edge
SiN Coupler
Poly-Si
SiO2 Si
air gap Fiber
(undercut) Grating
Si Substrate Coupler
e.g. LN for
phase
e.g. Au
modulators
Ce:YIG for Cu
optical III-V
isolators A W heater
u Si Fiber Edge
SiO2 SiN Coupler
Poly-Si
Fiber Grating
Coupler
air gap
Si Substrate (undercut)
31
10 µm
100 µm
Any material or device that can be released from its substrate can be
transfer-printed with high alignment accuracy and massively parallel to a
silicon photonics target wafer
The technology can spill over in many other domains: electronics, MEMS,
smart sensors,…
Micro-transfer printing
◼ Technology deep-dive
◼ Demonstrations
◼ Outlook
Conclusions
ISSCC 2024 - Forum 6.6: Micro-transfer printing for 37 of 39
© 2024 IEEE heterogeneous electronic-photonic integrated circuits
International Solid-State Circuits Conference
Conclusions
Silicon Photonics is an enabling technology for a wide range of applications
Tom Gray
Sr. Director of Circuit Research
NVIDIA
Compute cluster
GPU
NVSwitch GPU
NVLINK
NVSwitch
Compute network
GPU
◼ NVLINK/NVSwitch
GPU
Datacenter network
Compute cluster
Compute cluster GPU
◼ InfiniBand
NVSwitch GPU
NVSwitch
GPU
GPU ◼ Ethernet
NVLINK
NVSwitch
NVLINK GPU
NVSwitch
GPU GPU
GPU
InfiniBand/Ethernet
NVLINK
NVSwitch
GPU
GPU
Compute cluster
Compute cluster GPU
NVSwitch GPU
GPU
NVSwitch GPU
NVLINK
NVSwitch
NVLINK GPU
NVSwitch
GPU GPU
GPU
InfiniBand/Ethernet
NVLINK
NVSwitch
GPU
GPU
Compute cluster
Compute cluster GPU
NVSwitch GPU
GPU
NVSwitch GPU
NVLINK
NVSwitch
NVLINK GPU
NVSwitch
GPU GPU
GPU
InfiniBand/Ethernet
p p p
n n ... n
Laser
(comb)
... l
PD PD PD
TIA TIA TIA
Nedovic, “Methodology for Device, Circuit and Architecture Level Co-design and
Optimization of Dense Wavelength Division Multiplexing Links,” CLEO 2023
DES
Free spectral range (FSR) limits useful optical spectrum due to aliasing
FSR maximized by small radius. Radius limited by excessive curvature loss
Pack many channels into FSR to maximize throughput
Increase throughput/improve energy with slower/densely packed channels
ISSCC 2024 - Forum 6.7: Silicon Photonics and 14 of 28
© 2024 IEEE Foundry Requirements for AI Datacenters
International Solid-State Circuits Conference
Scaling
Challenges
◼ Loss: 1-2dB typical per coupler
◼ Density: 127-250mm typical spacing
◼ Testability and packaging
T. Barwicz et al.,
[A. Mekis et al., JSTQE2011] JSTQE2016
LCA Device
Device
Optical Netlist
model
model
device
Device model
simulations Multi-wavelength
circuit simulation
(Spectre/HSPICE)
Electrical
device
Link
models Link model
architecture
From CMOS
From photonic foundry /PDK developer foundry
Cost of many precision active alignments. Alignment and reflection management (no Balancing cavity gain and mode spacing.
isolators).
Array yield and manufacturing controls. Substantial process development and thermal Achieving sufficient efficiency from the nonlinear
engineering. process.
B. Lee, ICSJ, Nov. 2023
ISSCC 2024 - Forum 6.7: Silicon Photonics and 23 of 28
© 2024 IEEE Foundry Requirements for AI Datacenters
International Solid-State Circuits Conference
Packaging for 2.5D Integrated Optics
(0.5 mm thick)
(1 mm thick)
(0.1 mm thick)
(0.5 mm thick)
(1 mm thick)
(0.1 mm thick)
For TX ring Q of 7000 (32 GHz) and pp resonance shift of 8 GHz (50 pm), Closed-loop response to 500-W step in ASIC power
control loop needs to maintain temp. within ± 0.4 K (± 5 GHz) for
negligible increase in TX penalty
For RX ring Q of 5000 (45 GHz), ± 0.4 K results in negligible eye closure
Simulate a conventional proportional-integral (PI) thermal controller
Impacts of ASIC & heater powers are summed to obtain ring temp.,
which shifts resonance and impacts power measured on a weakly coupled
drop-port photodiode
Measured power is compared to target, and error is fed back to controller
Sim 2
Vladimir Stojanović
Sensing
Runtime breakdown
600B weights
150B weights
37B weights
laser
8 wavelengths/port
Gen 1: Gen 2:
Actively aligned Passively aligned
vertical attach of edge attach of
fiber pigtails fiber pigtails
2.5D EMIB
Gen 3: System-In-Package
Detachable
edge attach
optical connector
TeraPHYTM chiplets
Agilex I-series Multi-Chip
Package with SuperNovaTM
Optical I/O laser modules
2023 Demos:
ERI Summit, Hot Chips,
White House Demo Day
Supercomputing
12
© 2024 IEEE
International Solid-State Circuits Conference
Future Systems-In-Package with Optical I/O
Electrical I/F Optical I/F Optical Off-package
(Advanced Package) (CW-WDM) Chiplet IO BW (4-8
Gen
BW chiplets per
I/F Modules Tx / Rx Data Rate Ports λs / Data Rate (Tx+Rx) package)
IOs [Gbps/IO] Port [Gbps/λ]
1 AIB 24 20 / 20 2 8 8 16 2 Tbps 8-16 Tbps
SuperSwitch SuperSwitch
1000 nodes
per pod
1000 pods
Two hops to anywhere in the system
© 2024 IEEE
<1us link/flow reconfiguration 16
Laser
Power PL PS
Wall-plug efficiency αT = αC3αIL
𝜂 W = 10%
Coupling loss Insertion loss Coupling loss Coupling loss Limited Rx
αC = 4dB αIL = 5dB αC = 4dB αC = 4dB sensitivity IRx,sen
80mW
Optical Power
1.3mW ℛ (=0.5A/W)
400x
502 µW
200 µW
Laser 50:50
Power PL
150 µm
Modulator
RX AFE Rx Macro
3dB Coupler Port-2 GC
Port-1 GC
Drop
Port PD PD-1
TX
30 µm
sites
RX sites
Modulator driver (16 slices)
150µm
Flip-
Chip
Bonding
CMOS
Unit Cell
4.3 mm
High
voltage
FETs
Bootstrap
Capacitors
22 mm
Low-Voltage
with <1us switching time, Digital Logic 21
22
© 2024 IEEE
International Solid-State Circuits Conference Henriksson et al ECOC 2023
Gaps to fill
System level
Control plane programming
Single/multiple concurrent jobs
Link level
6-9dB of additional insertion loss
(switch, connectors)
Use laser-forwarded coherent WDM link
architecture
sub-100ns WDM link lock
Devices & Materials
Dense WDM, fast electro-optic tuning Mehta et al, JSSC 2020
RU
1024 antennas/RU
ISSCC 2024 - Forum 6.8 24
© 2024 IEEE
International Solid-State Circuits Conference
Disaggregated RU
Radio unit Beamforming processors
RFIC
ADC/DAC
Optical I/O
chiplets
DRU
30
© 2024 IEEE
International Solid-State Circuits Conference
ISSCC 2024 - Forum 6.8
Link Architecture
LNA
(c)
Comb laser
34
© 2024 IEEE
International Solid-State Circuits Conference Wang ESSCIRC23, Singh ECOC23
Single Ring Modulator Linearity Modeling
A noise & linearity sweetspot: blue side of the resonance wavelength
Performance of an Intensity-modulated Direct-detection Link Phase and Normalized Through Port Power
Buchbinder
CLEO21
36
© 2024 IEEE
International Solid-State Circuits Conference
Another Interconnect Bottleneck:
Cryogenic Computing (Quantum and Classical)
72 Qubits
▪Subthreshold Amplifier
▪Gain: 28dB
▪BW: 854MHz
▪Power: 38uW
▪MRM
▪30GHz/V
▪E-O BW: 2.2GHz
28 TX
test sites
9mm
50um
1.8mm
Received CURL
44
© 2024 IEEE
International Solid-State Circuits Conference
What’s next: Superconducting Multi-Chip-Module
Niobium-based integrated-
circuit fabrication process
Adamopoulos
CICC’21, CLEO’21,
OJSSC’21, SSCL’21
50 of 16
© 2024 IEEE
International Solid-State Circuits Conference