EPO Opposition On GEVC '716
EPO Opposition On GEVC '716
Rechtsanwaltsgesellschaft mbH
Europäisches Patentamt
München
80298 München Düsseldorf
Elisenstraße 3
80335 München
T +49 89 7472660
F +49 89 776424
H www.maiwald.eu
Geschäftsführer
(Board of Directors)
SQL:SQL
Table of contents
2
A. Introduction and Formalities
1. Requests
(a) Article 100(a) EPC because the subject matter of the Patent lacks
novelty contrary to Articles 52(1), 54 EPC and/or lacks inventive
step contrary to Articles 52(1), 56 EPC; and
(b) Article 100(b) EPC because the subject matter of the Patent lacks
sufficiency of disclosure contrary to Articles 83 EPC; and
(c) Article 100(c) EPC because the subject matter of the Patent
extends beyond the content of the European Application as
originally filed contrary to Article 123(2) EPC.
3 In the event that the Opposition Division is not in a position to revoke the
Patent in its entirety (see request (1) above), Oral Proceedings pursuant
to Article 116 EPC are hereby requested.
3
Compression", PROCEEDINGS OF THE
2007 IEEE INTERNATIONAL SYMPOSIUM
ON CIRCUITS AND SYSTEMS (ICASP
2007), IEEE, Piscataway, NJ, US,
(20070501), ISBN 978-1-4244-0920-4, pages
3147 - 3150, XP031181972 (“Zhang”)
D4 US Pat. Pub. 2008/0219578 to Lee (“Lee”)
D5 US Pat. Pub. 2008/0310745 to Ye et al.(“Ye”)
D6 “The Quadtree and Related Hierarchical Data
Structures”, by Hanan Samet, ACM
Computing Surveys, June 1984
Vol. 16, No. 2, June 1984 (“Computing
Surveys”)
D7 IEEE Transactions on Circuits and Systems
For Video Technology, Context-Based
Adaptive Binary Arithmetic Coding in the
H.264/AVC Video Compression Standard, by
Marpe et al. (2003) (“IEEE CABAC”)
D8 PENNEBAKER W B ET AL, "An overview of
the basic principles of the Q-Coder adaptive
binary arithmetic coder", IBM JOURNAL OF
RESEARCH AND DEVELOPMENT, IBM
Corporation, New York, NY, US, (198811),
vol. 32, no. 6, ISSN 0018-8646, pages 717 –
726 (“Pennebaker”)
D9 Shi et al., Image and Video Compression for
Multimedia Engineering, CRC Press (2000)
(“Shi”)
D10 Very Low Complexity MPEG-2 to H.264
Transcoding Using Machine Learning, by
Fernandez et al. (2006) (“H.264 Transcoding”)
D11 BRADY N ET AL, "Context-based Arithmetic
Encoding of 2D Shape Sequences",
PROCEEDINGS OF THE IEEE
INTERNATIONAL CONFERENCE ON IMAGE
PROCESSING (ICIP 1997), IEEE, Los
Alamitos, CA, US, (19971026), vol. 1,
doi:10.1109/ICIP.1997.647376, ISBN 978-0-
8186-8183-7, pages 29 - 32, XP010254100
[A] 1-9 * abstract * * page 30, column l, line 8 -
column r, line 6 * (“Brady”)
D12 Video Coding Basics Presentation by Wang
(2003) (“Video Coding Basics”)
D13 IEEE Transactions on Circuits and Systems
For Video Technology, Overview of the
H.264/AVC Video Coding Standard, by
Wiegand et al. (2003) (“IEEE H.264”)
D14 A Context Modeling Algorithm And Its
Application In Video Compression by Mrak et
al. (2003) (“Context Modeling”)
4
D15 Adaptive Scanning for H.264/AVC Intra
Coding, by Lee et al. (2006) (“Adaptive
Scanning”)
D16 Introduction to Statistics by David M. Lane
(2007)
D17 Declaration of Dr. Andrew B. Lippman
Annex M1.
6 Each of the documents listed above was published before the earliest
priority date of the Patent, which is 13.04.2010.
8 Since the vast majority of the cited documents was available to the public
prior to the claimed priority date, the issue of entitlement to priority is not
discussed in detail herein below.
11 The Opponent herewith reserves the right to refer to other sections of the
prior art documents if necessary later in the proceedings.
5
B. European Patent EP 3 410 716 B1
1. Formalities
14 The Patent, EP 3 410 716 B1, derives from European Patent Application
Number 18185240.1 (hereinafter “the European Application”) which has
a filing date of 11 April 2011.
16 The Patent claims the priority date of 13.04.2010, which are the filing
dates of Patent Applications PCT/EP2010/054822 and EP 10159766
(hereinafter “the Priority Applications”).
19 Such coding may, for example, be used in picture and video coding, for
example.
6
“extracting a significance map indicating positions of significant
transform coefficients within the transform coefficient block, and
then the values of the significant transform coefficients within the
transform coefficient block from a data stream, with, in extracting the
significance map, sequentially extracting first-type syntax
elements from the data stream by context-adaptive entropy
decoding, the first-type syntax elements indicating, for
associated positions within the transform coefficient block as
to whether at the respective position a significant or
insignificant transform coefficient is situated.”
23 Thus, the basic concept is to select a scan order that encounters the
significant coefficients early in the scan. The Patent describes the
purported advantages of this adaptive scan order at [0051] et seq. of the
Patent. In particular, the Patent therein in details explains, emphasis
added:
7
coefficient levels equal to zero have to be coded before the
last non-zero transform coefficient value is reached. This can
be avoided if the diagonal sub-scans are started at the side,
where the significant transform coefficient levels are
concentrated.”
26
27 This reverse zig-zag scan order for the sub-blocks may be selected, for
example, if there is a predicted concentration of transform coefficients in
the lower right quadrant of each sub-block.
8
near the beginning of the scan order, see in this regard for instance cited
document D7 at page 10, “zig-zag scanning pattern”, left column.
3. Technological Background
9
35 Therefore, video coding techniques are used to compress (or encode)
video files for more manageable storage and transmission. Such
compression is achieved by removing redundancy in and between
frames, thereby reducing the amount of information necessary to transmit
and reconstruct each frame.
3.1 Partitioning
3.2 Prediction
41 The predicted samples are then subtracted from the samples of the
current block, resulting in a difference signal (i.e., a residual). This
10
residual only contains information about the differences between the
images in the current block and the predictive reference block.
42 The residual contains significantly less information than the block being
encoded, reducing the amount of data to be compressed and transmitted.
Alternatively, regions that cannot be predicted with good efficiency are
coded using intraframe prediction.
46 Typically, the result is a block in which most or all of the coefficients are
zero, with a few non-zero coefficients, as displayed by a generic block
below:
47 Setting QP to a high value means that more coefficients are set to zero,
resulting in high compression at the expense of poor quality of the
decoded image.
11
48 Setting QP to a low value means that more non-zero coefficients remain
after quantization, resulting in better decoded image quality but lower
compression.
50 A significance map is a matrix of the same size as the transform unit, with
each entry being a coefficient flag corresponding to a quantized transform
coefficient in the transform unit, see page 11, left column of D7.
12
54 Each coefficient flag is either a “1” if it corresponds to a significant
coefficient or a “0” if it corresponds to an insignificant coefficient. A
significance map is illustrated below (from the “Lee” reference discussed
herein, e.g. Fig. 3A and 3B of D4):
55 During entropy coding, both the value of the transform coefficients that
result from the above-noted prediction, transform, and quantization
process as well as their significance map are encoded to generate the
bitstream.
13
IEEE H.264 at 568 (D13).
64 According to the selected context, values that appear more frequently are
encoded using the smallest code words to represent the information, and
values that appear less frequently can be encoded with larger code words
to represent the less frequently appearing information.
14
65 By using a context (or probability model) as a guide, the total length of
code words is reduced, and compression is further enhanced. As
suggested by the name, CABAC adapts the probability model used based
on information from nearby elements to optimize the probability estimate.
66 The person skilled in the art would have been a person having ordinary
skill in the art and would be a person having, as of April 13, 2010 at least
an undergraduate degree in computer science, computer engineering,
electrical engineering, or a similar technical field.
67 The person skilled in the art would have further gained a working
knowledge of video coding techniques; and two or more years of
experience (or with a graduate degree in the above-stated fields, one or
more years of experience) in analysis, design, or development related to
video encoding, with additional education substituting for experience and
vice-versa.
15
each of blocks (40) of the sample array, and reconstruct the
reconstructed version by performing an inverse transform on
each of the transform coefficient blocks,
16
3.1 Encoder for encoding an array of information samples into a data
stream in units of blocks, configured to encode blocks (40) of the
array of information samples by
3.6 the first-type syntax elements being sequentially coed into the
data stream in a predetermined scan order among the positions
of the transform coefficient block,
Annex M2.
17
C. Grounds for Opposition
I. Added subject matter – Articles 100 (c), 123 (2) EPC and Articles
100 (c), 76 EPC
73 Granted claim 1 differs from the originally filed claims in that granted claim
1 of the Patent does not contain the features regarding an associator as
disclosed in the originally filed and published claim 1 of the Patent
application.
74 Originally filed claim 1 of the Patent application further specifies that this
associator is sequentially associating the sequentially extracted first-type
syntax elements to the positions of the transform coefficient block in a
predetermined scan order among the positions of the transform coefficient
block.
76 At least for these reasons, granted claim 1 extends beyond the content of
the application as originally filed, contrary to Article 123(2) EPC.
18
II. Lack of novelty – Articles 100 (a), 52 EPC
81 The subject matter of each of claims 1 to 9 of the Patent does not meet
the requirement of novelty over numerous prior art references, as
discussed in detail in the following.
85 Significance maps for decoding and encoding of data streams are also
known from for instance cited document D2, see for instance cited
document D2, page 609, section 5.15 discussing the CREW method,
corresponding to feature 1.3 of claim 1 of the Patent.
87 With regard to feature 1.5 and feature 1.6 of claim 1 of the Patent, it is
submitted that cited document D2 also discusses raster order scan on
pages 31, 259, 261, 275, 352, 395, 411, 416, 425–427, 437–439, 656,
663, 676.
88 With regard to feature 1.7 of the claim 1 of the Patent it is submitted that
cited document D2 discusses in Section 4.11 EIDAC, a method for
19
compressing simple images. Its main goal is the use of two-part contexts.
The intra context of a pixel P consists of several of its near neighbors in
its bitplane. The inter context of P is made up of pixels that tend to be
correlated with P even though they are located in different bitplanes.
96 Therefore, any one of method claims 5 and 6 lacks novelty over cited
document D2.
20
“Computer readable digital storage medium having stored thereon
a computer program having a program code for performing, when
running on a computer, a method according to claims 5 or 6.
99 Therefore, any one of claims 7 and 9 lacks novelty over cited document
D2.
100 The subject matter of each of claims 1 to 9 of the Patent does not involve
inventive step over numerous prior art references, as discussed in detail
in the following.
102 To the extent of features 1.1, 1.1, 1.2, 1.3, and 1.4, cited document D4,
“Lee” is analogous art to the Patent. The field of endeavor of the Patent
relates to adaptive coding of significance maps and transform coefficient
blocks for use in picture and video coding. See, e.g., the Patent at
paragraph [0001], Claim 1.
103 Like the Patent, D4 (“Lee”) also relates to and describes adaptive coding
and decoding of significance maps and transform coefficients for video
coding, see D4 (“Lee”) at [0003]; [0027]; see also Lippman Decl. D17 at
¶ 63.
21
achievement of better coding efficiency.” See the Patent at paragraph
[0011].
105 Thus, the inventors sought to address the problem of increasing coding
efficiency. cited document D4 (“Lee”) highlights the same problem of the
need to increase coding efficiency by stating that “a method of more
efficiently performing context modeling is needed.” See cited document
D4 (“Lee”) at [0026]; see also Lippman Declaration D17 at ¶ 62. Both Lee
and the patent are directed to solutions for increasing coding efficiency
using adaptive coding techniques. Lippman Declaration D17 at ¶¶61-63.
106 Cited document D5, (“Ye”), was filed on January 4, 2008 and claims
priority to provisional application No. 60/979762, filed on October 12,
2007. Cited document D5, (“Ye”), was published on December 18, 2008
as US Pat. Pub. 2008/0310745. Accordingly, cited document D5, (“Ye”),
qualifies as prior art as to the Patent.
107 Cited document D5, (“Ye”), is analogous art to the Patent and discusses
the same technical field of video coding. Like the Patent, cited document
D5, (“Ye”) also relates to and describes adaptive coefficient scanning for
the purposes of video compression.
108 See cited document D5, (“Ye”), at Summary, paragraph [0012]; see also
Lippman Decl. (D17) at ¶ 64.
109 Further, cited document D5, (“Ye”) is reasonably pertinent to at least one
problem with which the inventors of the Patent were concerned. For
example, Ye provides for techniques that utilize “thresholds and threshold
adjustments that can reduce the frequency that the scanning order
adjustments occur, yet still achieve desired improvements in compression
due to such scanning order adjustments.” cited document D5, (“Ye”), at
Summary, paragraph [0012].
110 As such, both cited document D5 (“Ye”) and the Patent are aimed at
increasing the coding efficiency of videos by utilizing adaptive scanning
techniques, allowing a reduction of the number of code bits used to
transmit the significance maps. Lippman Decl. (D17) at ¶¶ 61-62, 64.
111 As further explained below, a person skilled in the art would have been
motivated to combine cited document D4 (“Lee”) and cited document D5
(“Ye”) because cited document D5 (“Ye”) expressly considers the
traditional scan orders in cited document D4 (“Lee”), i.e. zig-zag scanning,
and improves on them.
22
112 Cited document D4 (“Lee”) taught the use of conventional zig-zag
scanning for scanning the coefficients within a sub-block. See e.g., cited
document D4 (“Lee”), at Fig. 7.
114 Similarly, it was well-known before the priority date of the Patent that
grouping zero value and non-zero value coefficients together in the scan
order increases coding efficiency as described in the background section
above.
115 The differing features of claim 1 of the Patent with regard to the cited
documents could be summarized as in that in the prior art the scan order
is not predetermined.
117 For the person skilled in the art the solution to this problem would be
obvious starting for instance with cited document D5.
118 Regarding a predetermined scan order the person skilled in the art would
have considered the adaptive scan orders of D5.
119 Further, a person skilled in the art would have known the teachings of
cited document D5 (“Ye”) to incorporate this beneficial technique into the
system of cited document D4 (“Lee”) so that this well-known benefit could
be achieved in situations where significant coefficients were not
concentrated at the locations within blocks that are quickly reached and
grouped by conventional zig-zag scanning, as expressly described in
cited document D5 (“Ye”). Lippman Decl. D17 at ¶¶ 75-80.
120 Further, applying the adaptive scan order selection technique of cited
document D5 (“Ye”) to the coding method of cited document D4 (“Lee”)
would have been the combination of known prior art elements according
to known methods.
121 As a consequence, a person skilled in the art would have yielded the
predictable result of grouping together the zero and non-zero value
coefficients in a scan order to achieve the improved coding efficiency
23
described in cited document D5 (“Ye”). This is because scan orders, and
adjustments to scan orders, were known to take advantage of
concentrations of zero and non-zero coefficients in a transform block,
which in turn improves coding efficiency.
122 Such a combination would have been routine to implement and would not
have required undue experimentation because techniques involving the
use of various scan orders, and specifically adaptive selection of scan
orders, are implemented in software and would not have required any
changes to the system architecture or hardware.
123 Thus, a person skilled in the art with even minimal experience in coding
could implement the required changes with reasonable efforts.
124 Furthermore, because such techniques were well-known at the time of the
Patent, the coding of such a change would not have required undue
experimentation and would have required little more than the
implementation of already existing routines, providing a high expectation
of success.
125 Therefore, claim 1 lacks inventive step over cited document D4 (“Lee”),
for multiple reasons as stated above, taken alone, or over cited document
D4 (“Lee”) in combination with cited document D5 (“Ye”).
127 Therefore, the Patent is to be revoked in its entirety under Articles 100(a),
54, 56 EPC.
128 Further, also cited document D3 could be considered for CABAC, entropy
coding, context modeling, and binary arithmetic coding, could be derived
from cited document D3, see page 1 and 2 of D3, see Fig. 1 of D3.
129 Thus, at least features 1.1, 1.2, 1.3, 1.4 could be derived from D3 by the
person skilled in the art. Context-based Arithmetic Encoding of 2D Shape
Sequences is also motivated by cited document D11, see abstract of D11.
131 With regard to feature 1.5 and the scan order, such scan orders are known
for the person skilled in the art for instance, zig-zag scanning as scan
24
order, see in this regard for instance cited document D7 at page 10, “zig-
zag scanning pattern”, left column.
132 Cited document D8 provides hints for the person skilled in the art to
implement adaptive binary arithmetic coder as the in feature 1.6 of claim
1 of the Patent claimed context adaptively entropy decoding. See also
cited document 9, entropy of an information source, on page 45 of D9.
133 See also cited document D12 on page 16, D13 in particular the abstract
of D13, and D14, in particular introduction of D14, in this context.
135 This is because granted claim 3 of the Patent directed to the encoder
corresponds to granted claim 1 of the Patent directed to the decoder as
far as the features 3.1 to 3.8 are concerned. In other words, these features
3.1 to 3.8 of claim 3 of the Patent clearly correspond to features 1.1 to 1.7
of claim 1 of the Patent.
25
140 Therefore, any one of method claims 5 and 6 lacks inventive step over
cited document D4 (“Lee”), for multiple reasons as stated above, taken
alone, or in combination with cited document D5 (“Ye”).
26
145 Therefore, any one of claims 7 and 9 lacks inventive step over cited
document D4 (“Lee”), for multiple reasons as stated above, taken alone,
or in combination with cited document D5 (“Ye”).
146 All the dependent claims 2, 4, 8 of the Patent, all relating to a depth map
content, are anticipated either by cited document D4 (“Lee”) taken alone
or by cited document D4 (“Lee”) in combination with cited document D5
(“Ye”), as discussed in the following.
1.5.1 Claim 2
[0006]
149 spatial information signals are processed. Claim 2 of the Patent defines
that the transform coefficient block relates to a depth map content.
However, the description of the Patent defines in paragraph [0011] of the
Patent that in general sample arrays are pictures or sample arrays
representing other spatially sampled information signals such as depth
maps or the like. Thus, a depth map is nothing else than spatially sampled
information signals.
150 Therefore, claim 2 lacks inventive step over cited document D5 (“Ye”).
27
enabled to carry out the invention based on the disclosure of the European
patent application documents of the Patent (see also T 487/89, T 297/90,
T 541/97).
152 In particular, the objected features of granted claim 1 of the Patent recite
as follows:
153 The Patent is directed to techniques for encoding and decoding digital
video data.
155 However, claim 1 is silent on the use of contexts which are individually
selected for each of the significance map syntax elements depending on
a number of positions at which according to the previously extracted and
associated significance map syntax elements significant transform
coefficients are situated.
156 In T 409/91 (OJ 1994, 653; ex parte) and T 435/91 (OJ 1995, 188; inter
partes) it was pointed out that the protection conferred by a patent should
correspond to the technical contribution to the art made by the disclosure
of the invention described therein, which excludes the patent monopoly
being extended to subject-matter which, after reading the patent
specification, would still not be at the disposal of the skilled person.
28
158 The available information in paragraph [0069] of the Patent is too specific
and does not enable the skilled person to achieve the envisaged result
within the whole ambit of the claim containing the respective functional
definition without undue difficulty.
159 The technical teaching of paragraph [0069] of the Patent fails to provide
a fully self-sufficient technical concept as to how sequentially extracting
first-type syntax elements from the data stream by context-adaptive
entropy decoding as such is to be achieved.
160 Thus, in summary, the subject matter of each of the claims 1 and 3 and
corresponding methods claims 5 and 6 of the Patent fails to meet the
requirements of Article 100(b) EPC or Article 83 EPC, respectively.
161 It has been shown hereinabove that the claims as granted contain subject
matter extending beyond the content of application as originally filed as
well as the subject-matter of the European patent extends beyond the
content of the parent application as filed.
162 Therefore, the Patent is to be revoked in its entirety under Articles 100(c),
123(2) EPC and under Articles 100(c), 76 EPC.
163 Each one of the independent claims lacks novelty over the prior art.
Therefore, the Patent is to be revoked in its entirety under Articles 100(a)
EPC in conjunction with Articles 52(1) and 54(1),(2) EPC (lack of novelty)
164 In addition, each of the independent claims lacks inventive step over
numerous prior art documents.
165 Therefore, the Patent is to be revoked in its entirety under Articles 100(a),
54, 56 EPC.
29
166 In addition, it has been shown hereinabove that the claims as granted
claim define an invention, which the person skilled in the art cannot carry
out based on the disclosure of the originally filed patent application
documents.
167 Therefore, the Patent is to be revoked in its entirety under Articles 100(b),
83 EPC.
electronically signed
Dr. Simon Q. Lud
Patentanwalt | European Patent Attorney
Enclosures
Annex M1 (list of references);
Annex M2 (feature analysis of claims 1 and 3);
Documents D1 to D17
30