100% found this document useful (1 vote)

767 views4 pages

Brown Mercer Paper

The document discusses research from the early 1990s on using Bayesian probability analysis and comparing languages to find translation patterns. It describes using random variables and conditional probabilities in statistical machine translation models. Later pages discuss training statistical machine translation models using EM and Viterbi algorithms on large datasets containing millions of translations.

Uploaded by

Jonathan Cancino

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

767 views4 pages

Brown Mercer Paper

Uploaded by

Jonathan Cancino

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

From the research paper authored by Peter Brown and Robert Mercer while at the IBM Watson

Research Center in the early 90s.

http://www.naxa.com/downloads/J93-2003.pdf

It seems the initial 5 models are based on Bayes Theorem for probability analysis. They use a
series of analysis to compare one against another (.e.g French vs English) to find patterns to see
how connection relate with each other. This is for translation purposes.
P

Pg 7: We generally follow the common convention of using uppercase letters to denote random
variables and the corresponding lowercase letters to denote specific values that the random
variables may take. We have already used I and m to represent the lengths of the strings e and L
and so we use L and M to denote the corresponding random variables.

As stated here by James Baker, https://www.quora.com/What-are-the-investment-strategies-of-

James-Simons-Renaissance-Technologies-I-understand-he-employs-complex-mathematical-
models-along-with-statistical-analyses-to-predict-non-equilibrium-changes

We need to understand the corresponding random variables. There is mention of Lagrange

multipliers and normalizing predictive data.

Auxiliary functions are used as well to generate desirable parameters using extrema/maxima
analysis. Conditional probabilities are used as well. On pg 276 there are translation, distortion
and fertility probabilities.

Page 282: Model 5 is a powerful but unwieldy ally in the battle to align translations. It must be
led to the battlefield by its weaker but more agile brethren Models 2, 3, and 4. In fact, this is the
raison d'etre of these models. To keep them aware of the lay of the land, we adjust their
parameters as we carry out iterations of the EM algorithm for Model 5. That is, we collect counts
for Models 2, 3, and 4 by summing over alignments as determined by the abbreviated S
described above, using Model 5 to compute Pr(ale, f). Although this appears to increase the
storage necessary for maintaining counts as we proceed through the training data, the extra
burden is small because the overwhelming majority of the storage is devoted to counts for t(fle ),
and these are the same for Models 2, 3, 4, and 5.

Page 283 shows the number of translations done which goes into the millions to determine a
small subset of useful words. EM algo used with maximum likelihood.

Pg 283 Although the entire t array has 2,437, 020,096 entries, and we need to store it twice, once
as probabilities and once as counts, it is clear from the preceeding remarks that we need never
deal with more than about 25 million counts or about 12 million probabilities. We store these
two arrays using standard sparse matrix techniques. We 283 Computational Linguistics Volume
19, Number 2 keep counts as pairs of bytes, but allow for overflow into 4 bytes if necessary. In
this way, it is possible to run the training program in less than 100 megabytes of memory. While
this number would have seemed extravag…
Page 293 speaks of Viterbi algo training:
We have already used this algorithm successfully as a part of a system to assign senses to
English and French words on the basis of the context in which they appear (…

Page 297 table of notation

Appendix B has summary of models. Note especially Log-Likelihood Objective Function.

Note page 300 iterative improvement.

In order to apply these algorithms, we need to solve the maximization problems of Steps 2 and 4.
For the models that we consider, we can do this explicitly. T

Page 301: Parameter Reestimation Formulae: In order to apply these algorithms, we need to
solve the maximization problems of Steps 2 and 4. For the models that we consider, we can do
this explicitly.

Equation (73) is useful in computations since it involves only O(lm) arithmetic operations,
whereas the original sum over alignments (72) involves 0(I m) operations.

Other styles:

Jan Dil answer:

According to what I read on Bloomberg (Inside a Moneymaking Machine Like No Other ) and
the responses here, RenTech makes on average 41%/year since 1988 with a maximum drawdown
of -4.1% (Bloomberg) to 70%/year (reported here in one of the answers) on, say, $50 Billion,
using some 200,000 trades/day. This daily volume amounts to about 1.4% of Nasdaq’s daily
trading, which sounds reasonable to me. There were times that RenTech occupied 10% of
Nasdaq’s trading volume. They have a reservoir of stocks to trade from and which they know
how to rank (StatArb) in mean-reversion mode, and where they find the volatility to produce the
kind of risks and returns of 70%/year consistently….

The reservoir that will do that is a collection of some 1300 Wall Street stocks with daily-dollar
volumes in excess of $1 Million. The ranking system that is able to rank each quarter the 6 top
ranks from this reservoir of 1300 is called Ergodic ranking. You don’t need any breaking news,
TA, or FA. You just need the historical eod data for this ranking system. CSI is our dataprovider
of choice, and we use Finance.Yahoo as a reference. The portfolio returns are considered as a
weighted sum of the individual asset returns. The weighting system is Kelly’s system where
portfolio weightings are computed each quarter so as to maximize annual returns. You do this for
10 or more years and you assume, with Jim Simons, that past performance is your best predictor
of success.

Other weighting systems like factoring are possible too. Like in signal and RADAR processing,
you may assume a propagation model and/or a probability density function to hold. In addition to
the portfolio weightings, factoring, propagation modelling, and probability density functions just
add fitting parameters that need to be computed. Computing these parameters imply additional
assumptions and fictitious information. With increasing number of parameters, the CPU
increases usually quadratically as do the chances on overfitting. Ergodic ranking reduces this to a
linear dependence, but also this implies an extra assumption that needs to be tested. The proof is
always in the puddin

https://www.quora.com/What-are-the-investment-strategies-of-James-Simons-Renaissance-
Technologies-I-understand-he-employs-complex-mathematical-models-along-with-statistical-
analyses-to-predict-non-equilibrium-changes

Note
https://quantlabs.net/blog/2019/02/c-source-code-and-research-papers-from-renaissance-
technologies/

See the latest of Metaprogramming C++ library

https://github.com/tjolsen/tmpl

open source project from RenTech in 2008

https://github.com/silpol/mrsync

https://nypost.com/2017/06/21/regulators-probing-legendary-hedge-funds-secret-trading-code/

https://whalewisdomalpha.com/renaissance-technologies-13f-strategy/

https://news.efinancialcareers.com/ca-en/3002461/pay-renaissance-technologies

Sample holdings
https://www.sec.gov/Archives/edgar/data/1037389/000103738910000308/0001037389-10-
000308.txt

Rentech software used

https://quant.stackexchange.com/questions/30509/what-is-advent-softwares-geneva

https://www.forexfactory.com/printthread.php?t=434829&pp=40&page=41

Howard Morgan, President, Renaissance Technologies Corp,

"The Microcomputer and DecisionSupport," Computerworld,
Aug 19 1985, pp. 39-45.
https://apps.dtic.mil/dtic/tr/fulltext/u2/a217408.pdf
https://news.ycombinator.com/item?id=16649002
In other words, they're a step above traditional "fundamental" hedge funds, but they focus on the
wrong problem (but not for lack of trying!). In contrast, the truly successful quant funds have
automated the data processing and feature extraction pipeline end to end. The data is a pure
abstraction to them. They don't bother with forming hypotheses and trying to find data to test
them, they allow their algorithms to actively discover new correlations from the ground up. So
many quantitative funds advertise how much data they work with, and how they have all these
exotic sources of data at their disposal...but the data does not matter. The models for the data do
not matter. The mathematics of efficiently processing that data are what matters….

In most cases, a trading strategy is sufficiently multidimensional that any particular set of data
can be completely public. Exclusive data is helpful, but not required. In many cases people
become too dependent on exclusive data and lose sight of the methodology.

https://news.efinancialcareers.com/uk-en/298218/renaissance-technologies-secrets-to-quant-
hedge-funds-vc-career-success

https://www.fxleaders.com/forex-signals/forex-signals-articles/algo-trading-rentec/

https://www.afr.com/technology/inside-the-medallion-fund-a-74-billion-moneymaking-machine-
like-no-other-20161122-gsuohh

--→Peter Brown and Robert Mercer audio speech in 2013 -

https://cs.jhu.edu/~post/bitext/

Old Jim Simons interview from 2000

https://www.institutionalinvestor.com/article/b151340bp779jn/the-secret-world-of-jim-
simons#.WC87Y7IrIuU

HiddenMarkovModels RobertFreyStonyBrook PDF
No ratings yet
HiddenMarkovModels RobertFreyStonyBrook PDF
34 pages
MSM Specification: Discrete Time
No ratings yet
MSM Specification: Discrete Time
5 pages
Slides Aymeric KALIFE Derivatives As Hedge Instruments-2022
No ratings yet
Slides Aymeric KALIFE Derivatives As Hedge Instruments-2022
234 pages
The Econometric Modelling of Financial Time Series 3rd Edition Terence C. Mills Download
100% (1)
The Econometric Modelling of Financial Time Series 3rd Edition Terence C. Mills Download
55 pages
Giuseppe Paleologo Reading List
No ratings yet
Giuseppe Paleologo Reading List
13 pages
Stock Market Prediction Using Hidden Markov Model
No ratings yet
Stock Market Prediction Using Hidden Markov Model
4 pages
Kyle Market Microstructure Syllabus
No ratings yet
Kyle Market Microstructure Syllabus
7 pages
Switching Models Workbook
No ratings yet
Switching Models Workbook
239 pages
A Backtesting Protocol in The Era of Machine Learning
No ratings yet
A Backtesting Protocol in The Era of Machine Learning
18 pages
Microstructure Tutorial PDF
No ratings yet
Microstructure Tutorial PDF
38 pages
P. Christoffersen. Evaluating Interval Forecast. Internatinal Economic Review, 39, 1998.
No ratings yet
P. Christoffersen. Evaluating Interval Forecast. Internatinal Economic Review, 39, 1998.
23 pages
Quantitative Finance Guide
No ratings yet
Quantitative Finance Guide
10 pages
An Elementary Introduction To Mathematical Finance 3rd Edition by Sheldon M Ross Ebook and TestBank Bundle Download Instantly
No ratings yet
An Elementary Introduction To Mathematical Finance 3rd Edition by Sheldon M Ross Ebook and TestBank Bundle Download Instantly
349 pages
Regime-Switching with HMMs
No ratings yet
Regime-Switching with HMMs
15 pages
Machine Learning by Joerg Kienitz
No ratings yet
Machine Learning by Joerg Kienitz
5 pages
0082 A Multi-Factor Model For Energy Derivatives
No ratings yet
0082 A Multi-Factor Model For Energy Derivatives
20 pages
Deep Learning for Economic Forecasting
No ratings yet
Deep Learning for Economic Forecasting
40 pages
Data Pre-Processing - by Quant Arb - The Quant Stack
No ratings yet
Data Pre-Processing - by Quant Arb - The Quant Stack
9 pages
Building A Simple Backtester - Quantitative Endeavor (1) .
No ratings yet
Building A Simple Backtester - Quantitative Endeavor (1) .
5 pages
Markov Chain
100% (1)
Markov Chain
28 pages
Market Microstructure in Practice 2nd Edition Charles-Albert Lehalle Online Reading
No ratings yet
Market Microstructure in Practice 2nd Edition Charles-Albert Lehalle Online Reading
91 pages
MScFE 610 ECON - Compiled - Video - Transcripts - M4
No ratings yet
MScFE 610 ECON - Compiled - Video - Transcripts - M4
9 pages
Greek Letters of Finance
No ratings yet
Greek Letters of Finance
40 pages
How To Spot Backtest Overfitting: Lawrence Berkeley National Lab (Retired), and University of California, Davis
No ratings yet
How To Spot Backtest Overfitting: Lawrence Berkeley National Lab (Retired), and University of California, Davis
15 pages
Backtest Overfitting Tool Guide
No ratings yet
Backtest Overfitting Tool Guide
9 pages
Mlfinlab Release Hudson & Thames
100% (1)
Mlfinlab Release Hudson & Thames
74 pages
Resume 3
No ratings yet
Resume 3
2 pages
Trading Algorithm Selection
No ratings yet
Trading Algorithm Selection
8 pages
Computing Implied Volatility
No ratings yet
Computing Implied Volatility
3 pages
Common Stocks and Uncommon Profits and Other Writings PDF
No ratings yet
Common Stocks and Uncommon Profits and Other Writings PDF
395 pages
Option Trade Classification
No ratings yet
Option Trade Classification
60 pages
CQF Program: Quantitative Finance Guide
No ratings yet
CQF Program: Quantitative Finance Guide
12 pages
Market Microstructure: Information-Based Models
No ratings yet
Market Microstructure: Information-Based Models
8 pages
Computations in Option Pricing Engines 2020
100% (1)
Computations in Option Pricing Engines 2020
51 pages
Credit Risk Modeling
No ratings yet
Credit Risk Modeling
131 pages
Implied Volatility Surface Construction
No ratings yet
Implied Volatility Surface Construction
40 pages
2004 Option Pricing Valuation Models and Applications
No ratings yet
2004 Option Pricing Valuation Models and Applications
33 pages
Let's Be Rational: J Ac06 Vog07
No ratings yet
Let's Be Rational: J Ac06 Vog07
12 pages
Deep Reinforcement Learning in High Frequency Trad
No ratings yet
Deep Reinforcement Learning in High Frequency Trad
6 pages
Market Microstructure in Practice Charles-Albert Lehalle Instant Access 2025
No ratings yet
Market Microstructure in Practice Charles-Albert Lehalle Instant Access 2025
152 pages
10 Introduction To Electronic Trading
No ratings yet
10 Introduction To Electronic Trading
6 pages
Portfolio Insurance: Determination of A Dynamic CPPI Multiple As Function of State Variables.
No ratings yet
Portfolio Insurance: Determination of A Dynamic CPPI Multiple As Function of State Variables.
23 pages
Markov Switching Model Tool
No ratings yet
Markov Switching Model Tool
39 pages
Q-Learning for Automated Stock Trading
No ratings yet
Q-Learning for Automated Stock Trading
12 pages
Advice Mark Joshi
No ratings yet
Advice Mark Joshi
20 pages
Market-Driven Scenarios BlackRock
No ratings yet
Market-Driven Scenarios BlackRock
36 pages
FX Smile Interpolation Methods
No ratings yet
FX Smile Interpolation Methods
5 pages
Bias in The E Ffective Bid-Ask Spread
No ratings yet
Bias in The E Ffective Bid-Ask Spread
62 pages
Machine Learning Approach To Regime Modeling
No ratings yet
Machine Learning Approach To Regime Modeling
9 pages
Zerosum Trading
No ratings yet
Zerosum Trading
0 pages
Equity Analytics - Modern Portfolio Theory-Jonathan Kinlay
100% (1)
Equity Analytics - Modern Portfolio Theory-Jonathan Kinlay
7 pages
Advances in Active Portfolio Management New Developments in Quantitative Investing 1st Edition Richard Grinold Digital Access
0% (2)
Advances in Active Portfolio Management New Developments in Quantitative Investing 1st Edition Richard Grinold Digital Access
405 pages
Automated Portfolio Algorithms
No ratings yet
Automated Portfolio Algorithms
39 pages
Dynamic Asset Allocation
0% (1)
Dynamic Asset Allocation
29 pages
Literature Review
No ratings yet
Literature Review
22 pages
Unit - 4 Itai & ML
No ratings yet
Unit - 4 Itai & ML
69 pages
Mathophilia
No ratings yet
Mathophilia
18 pages
P14 Final Slides - Beyond GLMs (Priest - Conort) - Demobb
No ratings yet
P14 Final Slides - Beyond GLMs (Priest - Conort) - Demobb
39 pages
1st Exam Question Paper
No ratings yet
1st Exam Question Paper
2 pages
Application of Machine Learning and Deep Learning Algorithms To The Prediction of Stock Market Trends
No ratings yet
Application of Machine Learning and Deep Learning Algorithms To The Prediction of Stock Market Trends
5 pages
New ATRB Fillable Form
No ratings yet
New ATRB Fillable Form
1 page
Efficient Shorthand Techniques Guide
No ratings yet
Efficient Shorthand Techniques Guide
2 pages
Model of Motors Cars
100% (2)
Model of Motors Cars
51 pages
Microbiology and Parasitology Chapter 3 5
No ratings yet
Microbiology and Parasitology Chapter 3 5
30 pages
Testbank and Solutions For Elementary Statistics A Step by Step Approach 11th Edition Bluman
0% (1)
Testbank and Solutions For Elementary Statistics A Step by Step Approach 11th Edition Bluman
17 pages
Physics Lab: Gyroscope Experiment
No ratings yet
Physics Lab: Gyroscope Experiment
4 pages
EB Neuro - STM9000 Products Line PDF
100% (1)
EB Neuro - STM9000 Products Line PDF
36 pages
Feed-Forward Neural Networks (Part 2: Learning)
No ratings yet
Feed-Forward Neural Networks (Part 2: Learning)
17 pages
Cordless Rotary Tool Owner's Manual Models 750, 754 & 770
No ratings yet
Cordless Rotary Tool Owner's Manual Models 750, 754 & 770
68 pages
Environment and Sustainable Development Notes
No ratings yet
Environment and Sustainable Development Notes
5 pages
Practical Research 1 Module 1 3
No ratings yet
Practical Research 1 Module 1 3
75 pages
Wholesale Invoice for PET Goods
No ratings yet
Wholesale Invoice for PET Goods
2 pages
Deep Learning Chorale Prelude
No ratings yet
Deep Learning Chorale Prelude
6 pages
VI. Component Access/Replacement Circulation Pumps - Disassembly
No ratings yet
VI. Component Access/Replacement Circulation Pumps - Disassembly
4 pages
NPCT42x Trusted Platform Module (TPM) : General Description
No ratings yet
NPCT42x Trusted Platform Module (TPM) : General Description
25 pages
WC Basics Summary Bartek Kaplan Main PDF
No ratings yet
WC Basics Summary Bartek Kaplan Main PDF
29 pages
Understanding Drug Abuse and Addiction
No ratings yet
Understanding Drug Abuse and Addiction
2 pages
a. Calculate the characteristic impedances of the 3 sections of λ/4 line in the transformer
No ratings yet
a. Calculate the characteristic impedances of the 3 sections of λ/4 line in the transformer
4 pages
Social Stratification
No ratings yet
Social Stratification
20 pages
Sony dsr-2000 2000p Vol-2 SM
No ratings yet
Sony dsr-2000 2000p Vol-2 SM
452 pages
The Blue Collar Theoretically - John F Lavelle
No ratings yet
The Blue Collar Theoretically - John F Lavelle
288 pages
A Case Study On Fixed Deposit Position of Civil Bank LTD Central Office, Kathmandu
No ratings yet
A Case Study On Fixed Deposit Position of Civil Bank LTD Central Office, Kathmandu
42 pages
The Divergent Channels Jing Bie A Handbook For Clinical Practice and Five Shen Nei Dan Inner Meditation Full Book Download
93% (15)
The Divergent Channels Jing Bie A Handbook For Clinical Practice and Five Shen Nei Dan Inner Meditation Full Book Download
15 pages
SP2
No ratings yet
SP2
8 pages
Leader in The Profession William E. Butler
No ratings yet
Leader in The Profession William E. Butler
8 pages
Class Xii Term 1 Mock Test 2021
No ratings yet
Class Xii Term 1 Mock Test 2021
11 pages
Minecraft Mod Loading Error
No ratings yet
Minecraft Mod Loading Error
7 pages
Newtonianism and The Constitution
No ratings yet
Newtonianism and The Constitution
16 pages
Datasheet MMO 32 - 12
No ratings yet
Datasheet MMO 32 - 12
1 page
Assignment #6 - Seneca Mystery Shopper Report
No ratings yet
Assignment #6 - Seneca Mystery Shopper Report
3 pages