PHD Thesis
PHD Thesis
by
Ming Dong
Doctor of Philosophy
in
Energy Systems
                                              © Ming Dong
                                                Fall 2013
                                            Edmonton, Alberta
 Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis
and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is
converted to, or otherwise made available in digital form, the University of Alberta will advise potential users
                                          of the thesis of these terms.
 The author reserves all other publication and other rights in association with the copyright in the thesis and,
  except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or
    otherwise reproduced in any material form whatsoever without the author's prior written permission.
                                Abstract
technique that can extract detailed sub-load information from compound load
residential house into specific sub-load levels such as different home appliances
by using only the aggregated metering data of the compound load. Through load
decomposition, users can better understand the usage patterns of individual loads
or load groups and therefore decide on how to save energy. On the utility side,
load decomposition can be very helpful for load forecast, demand response
In the past, traditional methods are either too costly or inaccurate. Therefore,
can identify and track major sub-loads based on only the total signal collected
from the meter-side with acceptable error. Recently, the vast deployment of smart
meters has raised considerable interests in this approach. However, many critical
problems still need to be solved before it truly becomes technically available for
ordinary end-users.
To solve the above problems, at the beginning, this thesis presents a novel
NILM method based on event detection and load signature studies. The key idea
is to model the entire operating cycle of a load and make identification based on
event-window candidates. The proposed technique makes NILM more applicable
for complex loads, more robust for load inventory change and can also simplify
the training process; on the other hand, the thesis addresses a new and critical
which can automatically extract load signatures by using the meter-side data and
requires almost zero effort from users. This thesis also discusses how to estimate
the energy for several key components in a residential house such as the NILM
price.
                      Acknowledgement
Chapter 1
Introduction
   This chapter clarifies several basic questions regarding the thesis’s research
subject: decomposition techniques for power system load analysis. Firstly, the
background and importance of this research are introduced. Secondly, the
research problem is properly defined. Thirdly, three existing research directions
that may solve the defined problem are individually reviewed by investigating the
literature details. Finally, this chapter discusses the scope of the thesis and
presents the thesis outline.
1.1 Background
   For residential customers, a critical link to address the above need is smart
meter. According to [2], smart meter is defined as an electric meter that records
the consumption of electric energy in very short intervals such as an hour or less
and communicates that information at least daily back to the utility for monitoring
and billing purposes. The above two main features of smart meter enable a deeper
and clearer view of home electricity usage by the end-users. However, the smart
meters currently available in the market can provide only the compound load
                                                                                 1
Chapter 1
information or the information about an entire residential house. They cannot tell
which appliance loads in the household consume the most energy or are least
efficient. If households can also understand their usage patterns of concrete
appliances, the following benefits can be achieved:
   Besides, even with flat electric pricing, breaking compound load down to
individual component level can still help customers control and save their energy
usage better. For example, customers can compare the efficiency of a certain load
with his neighbors in the same geographic area. Also, they will naturally pay
more attention to heavy power consumers after identifying them. A preliminary
study from a pilot program between IBM and the City of Dubuque, Iowa has
indicated strong engagement by residents and energy savings of up to 11% by
making comparison among residential profiles [20].
   For the utility side, understanding the decomposed and detailed usage patterns
can be very helpful for load forecast, load settlement and other load studies [10].
Also, it can be used for electricity rate design and residential demand response
                                                                                  2
Chapter 1
Active Power
10000
                   8000
 Power(W)
6000
4000
2000
                     0
                      0   1   2   3   4   5   6    7   8    9   10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
                                                                         Hours(h)
Reactive Power
1500
day1000
     is shown. The compound power is acquired from the meter-side, and as can
be seen,
    500
         all different appliances’ powers aggregate together. This is because for
any circuit,
     0
             power is physically additive regardless of the individual component
                      0   1   2   3   4   5   6    7   8    9   10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
being resistive, inductive or capacitive Hours(h)
                                         [98].The problem to be solved can be
stated as follows: individual loads’ signals aggregate at the entry point of a house
as P (t ) and the intent of this research is to do the reverse or to decode the overall
signal into various components Pi (t ) that are attributed to specific loads
(appliances) i.
                   Assuming each load has its specific operation state, the above problem is
equivalent to the following equiation:
                                      P(t )  P1 (t )  P2 (t ) .  Pn (t )
                                                                                                                        (1.2)
                                               f1 ( S1 (t ), p1 )  f 2 ( S2 (t ), p2 )  ...  f n ( S n (t ), pn )
                                                                                                                           3
Chapter 1
where Sn (t ) is the operation state of individual load n and pn is the power drawn
from a certain state of load n. Equation (1.2) actually implies the problem of load
decomposition can be converted to the identification of activities or states of
individual loads based on the aggregated signal. For example, an incandescent
light bulb has only two states: ON and OFF. For a given instant, once its state is
known, the instant power the bulb is drawing can also be determined. This is a
very important conclusion because many previous methods that will be reviewed
in this chapter are actually based on equation (1.2).
where t0 and t1 are the beginning and ending time of a specific time period and
  Equation (1.3) is especially suitable when dealing with certain load groups or
long-term energy components such as the energy consumed by all the in-home
lights in a day and the stand-by energy. In these cases, the instant power of the
sub-load is not of interest whereas the energy consumed during this period as a
whole is of more concern.
   This research direction aims to provide the estimate of sub-level loads based
on statistical samples and certain given parameters of compound loads. The
common procedure is as follows: As prior knowledge, statistical data such as
house occupant behavior, demographic characteristic, proclivity characteristic and
appliance power characteristics have already been collected from extensive
surveys and measurements done by previous organizations and researchers [21]-
[30]. Secondly, for a new compound load of interest, specific parameters such as
the number and type of occupants and building structure information are also
                                                                                           4
Chapter 1
given as calibration inputs. Then, the above two parts of information are both fed
into a simulation module. Finally, its decomposed energy information can be
estimated [31]-[35].
   [23] is the UK 2000 Time Use Survey which collected 20,981 1-day diaries
recorded at a 10-min resolution. 19,898 of them were considered qualified for
statistical analysis. This survey includes detailed questions regarding how people
spend their time at home. For example, for home cooking category, the related
questions involve “unspecified food management”, “food preparation”, “baking”,
“dishwashing”, “preserving” and “other unspecified food management”. Based on
this survey, researchers constructed high-resolution daily activity profiles [31] and
these profiles for different appliances can be downloaded online [24]. One
example is shown in Figure 1.2.
                                                                                    5
Chapter 1
   Besides statistical time use profiles of loads, other statistical data such as
appliance duration characteristics can be found or derived from Canadian Center
for Housing Technology [25] and standard appliance test methods of the
Canadian Standards Association [28]-[29] .Appliance power characteristics can
be obtained from available measurement data [30],[102].
   After statistical data are obtained as inputs, concrete compound load inputs are
also provided to “calibrate” the statistical models. For example, in [31]-[33],
inputs such as the number and type of household members, availability of
household member during a day, appliance ownership are taken into consideration.
   There are already a few commercial programs based on the above procedure.
For example, DOE-2 is a widely used building energy simulation analysis
program [34]. Its flowchart is illustrated as Figure 1.3. Its library is a huge
database which stores information of specific sub-loads, compound loads and
relevant statistical parameters such as regions, weathers, building structure,
occupant behavior and even materials of enclosures. With user input, specific
building descriptions of compound load can be generated. Along with statistical
weather data, complicated and dynamic simulations can be performed. This
program is especially useful for energy estimation of heating, ventilating, and air-
conditioning (HVAC) equipment. Finally, the output of different sub-loads and
other detailed energy predictions can be obtained.
                                                                                  6
Chapter 1
   From the above examples, the advantages and disadvantages of the method
based on survey data can be seen:
                                                                                 7
Chapter 1
                                                                                     8
Chapter 1
However, such appliances are not commonly seen in the market yet at present. In
addition, the direct measurement based method can also be applied to specific
loads such as HVAC and VFD in commercial and industry buildings.
  [50]-[51] are also “indirect” sensors but they measure only the state transitions
or ON/OFF changes of appliances. To estimate the energy, the state recordings
need to be associated with the meter-side power data. The working principle can
be explained by using formula (1.2). [3] uses a clamp based magnetic
sensor ,which can be clamped around the supply cable of an appliance. Although
the summation of the cable’s flow-in and flow-out currents is equal to zero, after
signal amplification, a minor magnetic difference can still be observed. In [51],
                                                                                 9
Chapter 1
the concept “binary sensor” is proposed and it can be used to detect ON/OFF
states of appliances.
   Unlike the measurement based method, a NILM system uses only the
information acquired from the main breaker level or meter-side. This system is a
viable alternative to the Home Automation Network.
   Recently, with the fast development and vast deployment of smart meters,
more research attention has been drawn into this area. Smart meter’s high-
resolution data acquisition capability, communication capability along with its
computation capability [52]-[54] can provide sufficient support for the
implementation of NILM in residential houses. On the other hand, NILM can add
                                                                                10
Chapter 1
great intelligence and value to the smart meters and make smart meters a truly
“smart” solution for residential energy management.
                          Aggregated
                        meter-side signal
                      Decomposed signal of
                  individual loads or load groups
Output
Applications
   Generally, all the NILM studies can be divided into the categories shown in
Figure 1.6. Detailed literature reviews will be presented in the overview sections
of relevant chapters. Here, a rough division is given:
                                                                                 11
Chapter 1
                                              NILM
                                            Researches
  Compared with the survey based method and the measurement based method,
the characteristics of NILM based method are as follows:
                                                                                  12
Chapter 1
           High accuracy: the accuracy may be lower than that from the direct
            measurement based methods but is much higher than that from the
            survey based method and indirect measurement based methods.
   Overall, the NILM based method provides good balance between cost and
accuracy and is therefore the most promising load decomposition technique so far.
   The purpose of this thesis research is to solve the remaining but critical
challenges and limitations related to the existing NILM based methods:
                                                                                13
Chapter 1
   The thesis is organized to present different studies to tackle the above listed
problems. The outline is below:
                                                                                 14
Chapter 1
                                                                                  15
Chapter 2
Chapter 2
Event detection
   In addition, this chapter also addresses the special problems of event detection-
--double-phase event detection, adjacent event handling and event overlap. Some
contents in this chapter have been submitted as publication [103].
2.1 Overview
             Power/w
               900
800
700
600
500
10:30 11:00 11:30 12:00 12:30 13:00 13:30 14:00 14:30 15:00 15:30
                                                         Time/Clock
            Figure 2.1: Real-time power data acquired from meter-side
                                                                                                          16
Chapter 2
                                                                                  17
Chapter 2
exceeded according to the following equation (Pi , Pi+1 and Qi , Qi+1 are successive
samples). The process is shown in Figure 2.2.
                                                                            | ( Pi  Pi 1 ) / 2  Pi 3 | dPtres
                                                                            
                                                                            | (Qi  Qi 1 ) / 2  Qi 3 | dQtres
                                                                                                                                                                         (2.1)
                                                                            | Pi  4  Pi 3 | dPtol
                                                                            | Qi  4  Qi 3 | dQtol
   The above step-change methods can deal with simple events. However, there
are some remaining challenges that need to be coped with.
1000
                      800
       Power(W)
600
400
200
                             0
                                                      2.295                        2.3                          2.305                  2.31                2.315
                                                                                                Time(sec)                                                      x 10
                                                                                                                                                                      4
2000
                             1500
                  Power(W)
1000
500
                                 0
                                            3.143                   3.144                   3.145                       3.146          3.147           3.148
                                                                                                    Time(sec)                                             x 10
                                                                                                                                                                 4
450
400
                       350
           Power(W)
300
250
200
                       150
                                     5.08      5.09           5.1           5.11         5.12          5.13             5.14    5.15     5.16   5.17           5.18
                                                                                                    Time(sec)                                                        4
                                                                                                                                                          x 10
                                                                                                                                                                            18
Chapter 2
   As can be seen from Figure 2.3, (a) is a spike-type event that can be often seen
when a motor device is started. This event is composed of a sharp positive step
change and a smaller negative step change that follows right after the positive
change. The traditional step-change based methods may consider this event as two
independent ON/OFF events; (b) shows a slow ON event from a microwave.
However, it actually has two small step changes followed by a big step change.
The       traditional   method   such    as    [58]    will   discard     this   event
because | Pi  4  Pi 3 | dPtol ; (c) shows the mis-detection case---these quick step
changes are actually caused by signal noises and should not be considered as
events.
   To deal with the above challenges, this chapter proposes two novel data-
segmentation based methods. As shown in Figure 2.4, the new philosophy behind
is that instead of looking for step change like the previous methods do, the
proposed methods look for all the continuous data segments and then the data
portions between two neighboring segments are considered as the events no
matter how complicated they are. Besides, the continuous data segment should be
capable to include limited abnormal points as long as the continuity of the data
segment is not significantly broken by them.
                                                                                    19
Chapter 2
   Two methods that can detect the above defined data segments are proposed.
The data portions between two neighboring segments are considered as events.
They are explained as follows.
Output
                                                                                20
Chapter 2
   Step 1: Divide the entire dataset X into small continuous data segments {X1,
X2, X3…} with a preset minimal data length such as 5 points per segment. Since
the data is acquired every 1 second, 5 points represents 5 seconds.
   Examples of good and bad segments are shown in Figure 2.6. Each circle
represents a data point. Compared with (a), in (b) the variation of five points is
too big or in other words, its SD exceeds the predefined threshold, hence this
segment is not considered acceptable.
                                                                                21
Chapter 2
  Figure 2.7: Merge good segments together based on SD and range calculation
   For example, assume X3,X4,X5,X6, X7 segments are “good” segments and
they are continuous. If SD (Ave3, Ave4, Ave5) <2.0 then {X3,X4,X5}=LX1.
LX1 is the merged longer segment. Similarly, if SD(Ave3, Ave4, Ave5,
Ave6)>2.0, X6 is excluded from LX1 and LX1 is finalized as a whole segment
that includes X3,X4 and X5.
                                     dx x
                                                                             (2.2)
                                     dt t
   Figure 2.8 below shows the slope xn for a specific data period. Here
xn  xn  1  xn .Power curve is shown in dash line and the slope line below it
shows the corresponding the slope values of each point.
                                                                                 22
Chapter 2
            2000
                                                                                                        Slope
                                                                                                        Power
            1500
            1000
  Powe(W)
            500
                                       Data segment                 Data segment
-500
                    Event                                                                Event
                   2.6      2.65             2.7                2.75               2.8           2.85
                                                        Time(sec)                                           x 10
                                                                                                                4
The flow chart of the proposed slope algorithm is shown in Figure 2.9.
                                                          Calculate slopes:
                                                        xn  xn  1  xn
                                             N
                                                           Reach the end?
Output
                                                                                                                    23
Chapter 2
   Step 1: Subtract each data point from the next data point following it.
xn  xn  1  xn . In other words, the slope values of all data points are calculated.
   Step 2: From a starting location, try to join the following points with small
slopes. A special concern here is the occasional noisy points shown in Figure 2.10.
The connection stops when both of the following two conditions are met:
            The difference of the average of the steady points after this point and
             before the point is also beyond the threshold
   Step 3: After the interruption point, start the processing of a new segment from
the first point that has a close-to-zero slope value after the interruption point.
Redo step 2. The whole iteration ends until all points of the dataset have been
processed.
Step 4: Output.
                                                                                      24
Chapter 2
700
600
                    500
         Power(W)
400
300
200
100
                      0
                       4.9       5          5.1        5.2      5.3       5.4      5.5 4
                                                  Time(sec)                         x 10
600
                   500
        Power(W)
400
300
200
100
   First of all, the performance of the two proposed methods on noisy point
handling is compared. Figure 2.12 shows four consecutive fridge’s operations.
They are all polluted by signal noises from the meter-side. The two proposed
methods are applied to the four cycles respectively. As can be seen from (a), the
data segments (in red color) captured by using the segment-conjunction method
are interrupted many times by noises; while in (b), there are only 8 big data
segments that represent the ON and OFF states of fridge. Accordingly, 8 events
                                                                                           25
Chapter 2
are captured (in blue color) between neighboring data segments. Hence, the slope
method seems much more powerful when dealing with noisy segments.
  Referring to the other challenges mentioned in 2.1, both of the two methods
can effectively deal with the spike-type event and slow event that contains more
data snapshots. Examples are shown in Figure 2.12.
1400
1200
                           1000
                 Powe(W)
800
600
400
200
2000
                       1500
            Power(W)
1000
500
  The two methods have also been applied to the data acquired from a single hot
phase of a local residential house on a typical day. The histograms of captured
data segments using the two proposed methods are shown in Figure 2.13. As can
be seen, the number of segments captured from the slope method is smaller but
the average length of data segment is longer. Again, it proves that the slope
method can catch longer segments containing noises. Accordingly, the number of
                                                                                                                                         26
Chapter 2
events captured by the slope method will also be smaller but much more
reasonable.
2000
                                1500
        Number of segments
1000
500
                                          0
                                           0   200       400     600    800          1000        1200   1400   1600   1800   2000
                                                                              Length of segments
600
                                        500
                   Number of segments
400
300
200
100
                                          0
                                           0   200       400     600    800          1000        1200   1400   1600   1800   2000
                                                                              Length of segments
                                                                                                                                      27
Chapter 2
currents. Typical values of Thresmin and Thresmax are 0.9 and 1.1.
                                                      Phase A
           5000
           4000
Power(W)
3000
2000
1000
              0
              4.4      4.45   4.5       4.55    4.6     4.65      4.7   4.75   4.8    4.85      4.9
                                                      Time(sec)                              x 10
                                                                                                    4
                                                      Phase B
           3500
3000
           2500
Power(W)
2000
1500
1000
500
              0
              4.4      4.45   4.5       4.55    4.6     4.65      4.7   4.75   4.8    4.85      4.9
                                                      Time(sec)                              x 10
                                                                                                    4
                                                                                       Actual
                                                                                     power curve
                                                                                       Acquired
                                                                                      data points
dP
                                                                                                        28
Chapter 2
   One solution is to increase the sampling rate. For example, doubling the
sampling the rate will result in different results as shown in Figure 2.16. In this
case, three segments can be captured and thus two independent events dP1 and
dP2 can be found.
                                                                   Actual
                                                                 power curve
                                                                    Acquired
                                                                   data points
                                        dP2
dP1
                                                                                 29
Chapter 2
                                                                                                                                                               Actual
                                                                                                                                                             power curve
                                                                                                                                                               Acquired
                                                                                                                                                              data points
                                                                                                           dP2
dP1
                         1000
              Power(W)
                          800
                                                                Fridge
                          600
400 2000
                                                                                                                                          Overlapping
                                                                                                             Power(W)
                          200
                             0        50         100          150         200    250     300         350
                                                                                                                        1500
                                                       Time (Second)
                         1200                                                                                           1000
                         1000
                                                              Furnace                                                    500
            Power(W)
800
200
Time (Second)
                                                                                                                                                                               30
Chapter 2
                                               AMN K
                                    P  1  (     )
                                                MN
                                                                              (2.4)
                                    AN        M!
                                     M
                                            ( M  N )!
   In practice, for a specific hour, assume 3600 points are captured, 5 appliances
are involved and each appliance shows up 3 times on average. According to (2.4),
the probability of event-overlap occurrence in this hour is
                                          5
                                         A3600
                             Ph  1  (       5
                                                )3  0.84%                     (2.5)
                                        3600
   The above results may be higher or lower than the ones in reality because
appliances events are not uniformly distributed along the time line. In some hours,
different appliance operations are more expected to be seen while in some hours
are not. Also the event frequency of some particular appliances such as stove and
washer within certain hours could be extremely high.
   It is also found that increasing sampling rate can also help solve the problem.
For example, now assuming the sampling rate is increased to 0.5sec/snapshot,
there will be 7200 snapshots in an hour. Therefore, we have:
                                                                                 31
Chapter 2
                                           5
                                          A7200
                            Ph  1  (          )3  0.42%
                                        72005                                 (2.7)
                            Pday    1  (1  Ph ) 24  9.5%
2.4 Summary
   In addition, specific common issues with event detection were also discussed:
a simple method to detect double-phase event was proposed; misidentification due
to adjacent events was also discussed. Both hardware and software solutions were
proposed and explained; in the end, the problem of event overlap was also
introduced and the method to calculate its probability was addressed. Based on a
hypothetical calculation, it was also found higher sampling rate can effectively
reduce the probability of overlap.
                                                                                32
Chapter 3
Chapter 3
   The aim of this chapter is to establish a technical foundation for the proposed
event-window based load identification method which will be further discussed
afterwards. Some contents in this chapter have been submitted as publication
[104].
3.1 Overview
   Most of the existing non-intrusive load monitoring methods treat all the
appliances as a single-state model which has a pair of identical ON/OFF edges
and a constant power demand between them [55]-[60]. In other words, it assumes
the operation of appliance has a constant steady point which does not change its
electric status such as real power, reactive power and harmonic content
dramatically. This assumption simplifies the load models but unfortunately, it
cannot accurately reflex the reality--- many complex loads cannot be represented
by this model and thus large errors could occur for the future identification steps.
                                                                                   33
Chapter 3
1. When signatures are only compared in a short time scale such as within a
second. In this case, 10 ms is only half cycle (50Hz system).
2. When the studied loads have only single operation state. In Figure 2-1, only
water boiler can fall in this category. Air conditioner, TV and induction cooker
are all complex loads which have multiple or continuous varying states. They
should not to be treated as single-state appliances.
                                                                                 34
Chapter 3
   1. Some loads do not have significant turn-on transients such as resistive loads.
The loads with observable turn-on transients are limited to florescent lights,
motors and some electronic appliances. Some major loads such as stove and
incandescent lights may not contain observable transients.
   2. The NILM based on turn-on transient models can only determine the
operating schedule of loads but cannot track the whole process of loads. It is
because a large portion of load information is not considered except for the turn-
on transient. For example, turn-off events usually do not contain sufficient
transient information as turn-on events and cannot be easily identified in this way.
                                                                                 35
Chapter 3
   To solve the drawbacks of steady-state load model and transient load model,
event-window load model is proposed and its signatures are studied.
            Power
                                      Continuous       Multi-state
                       Single-state    varying
Time
                                                                                  36
Chapter 3
                                                                                  37
Chapter 3
Power
Fridge
                      Fridge
                                                      Light
                  Non-overlapping       Overlapping
                    window 1             window 2
Time
     Event signatures
     Event pattern signatures
     Power trend signatures
     Time/Duration signatures
     Phase signature
   Comprehensively, these signatures are able to describe the load operation from
a more complete and accurate perspective. They allow the load model to contain
multiple steady states because different events’ signatures and event pattern
signatures are now included. In the meanwhile, power variation which can be
often seen in continuous-varying loads is also described. Besides, time/duration
signatures are proposed because they can reflect the usage behaviors of appliances.
Finally, phase signature is also explained to describe the electric “location” of the
load in a residential house. These signatures are respectively discussed as below.
   An event refers to the change of the operating state of an appliance, which can
be often seen as a step change or edge in its power, reactive power or harmonic
content versus time plot. The edge can be either rising or falling. Each edge can
be characterized by the changes in power (P), reactive power (Q) and current
waveform (W) as shown in Figure 3.5. For a certain appliance, its attributes stay
generally fixed for each time operation if the system voltage does not change
sharply. In single-state model, only one set of P-Q-W is considered since the OFF
                                                                                  38
Chapter 3
Δ:P,Q,W
   As shown in Figure 3.6, a load is connected to the power system and it can be
represented by using a two-port network [98].
 Figure 3.6: Two-port network representation of load and its voltage, current and
                                    power
                                                                                39
Chapter 3
   Real power is defined as the average power absorbed by the above network
within a cycle. Its value is:
                                1 T
                         P        pdt  UI cos(u  i )  UI cos 
                                T 0
                                                                               (3.1)
   Heavily affected by the designed functions, different loads can have very
distinct real powers. For example, a stove usually has a large power since it is a
cooking device and a lot of heat is converted from electricity; a LED light on the
other hand normally consumes a little power since it only generates a little light.
Typical values of real powers of different residential load types are shown in
Table 3-2.
                                                                                 40
Chapter 3
   In order to introduce the concept of reactive power of loads, the equation (3.1)
can be rewritten as [98]:
                P  UI cos(u  i )  UI cos(2t  u  i )
                   UI cos(u  i )  UI [cos(u  i ) cos(2t  2i )
                   sin(u  i ) sin(2t  2i )                               (3.2)
                   UI cos [1  cos((2t  2i )]  UI sin  sin(2t  2i )
                   p1  p2
   In the above equation, p1 does not change its direction of power transferring
and thus represent the actual power consumed by the two-port network. On the
other hand, p2 ’s frequency is 2 and in a cycle under the actual frequency , its
average value is zero. In reality, this part of power is transferring back and forth
between the network source and network load. Its magnitude UI sin  is defined
as the reactive power of load in power system and its unit is “var”:
Q  UI sin  (3.3)
   As can be seen from (3.3), when  >0, Q  UI sin  >0 and it means the load is
inductive and can absorb reactive power; when  <0, Q  UI sin  <0 and it
means the load is capacitive and can generate reactive power. cos  is defined as
the power factor of load.
                               S  UI  P 2  Q 2
                              
                                   P  S cos                                  (3.4)
                                   Q  S sin 
                               
                                                                                  41
Chapter 3
   In power system, many resistive loads such as incandescent light, stove, oven
and kettle has a power factor which is close to 1. It also means they have
negligible Q or p2 . For motor based loads such as fridge, freezer, washer,
dishwasher and dryer, their equivalent circuits are similar to inductors. Thus they
have a lower power factor and have a notable Q or p2 . Hence, reactive power
levels can be very useful signatures when distinguishing between active loads and
reactive loads.
                                                                                42
Chapter 3
  Figure 3.8: Example of a power electronic circuit--- Three phase SCR rectifier
   Circuits like the above can generate distorted current waveforms which can be
described by a function of f(t) with period T and fundamental frequency of f0=1/T.
f(t) can be expressed by a Fourier series:
                                                        
                                           f (t )  c0   ck cos(2 hf 0t  h )   (3.5)
                                                        h 1
                     T
                     2
                 1
where ck h 
                 T      T
                             f (t )e  j 2 hf0t dt .
                     
                         2
                                                                                      43
Chapter 3
                                                                                 44
Chapter 3
(a) Linear-load
                                                                                45
Chapter 3
   Stoves, dryers or some coffee makers are typical multi-state appliance with
repetitive sequence due to their internal integer-cycle controllers [96]. Figure 3.12
shows an operation process of stove.
                                                                                  46
Chapter 3
                                                                                   47
Chapter 3
Power 900
(W) 800
700
600
500
400
300
200
100
                               0
                               4300    4400   4500   4600   4700        4800   4900   5000    5100   5200
                                                             Time(sec)
               Figure 3.15: Trend signature 1---rising spike (Fridge)
  After start, as shown in in Figure 3.16, with the motor speed gradually
increases, its current drawn may sometimes decrease and form a gradual falling
curve.
Power 2200
(W) 2000
1800
1600
1400
1200
1000
800
600
                               400
                                  35           40            45                50            55             60
Time(sec)
(W) 700
600
500
400
300
200
100
                       0
                           0             20          40            60           80           100       120
                                                            Time(sec)
                Figure 3.17: Trend signature 3---Falling spike (TV)
                                                                                                                 48
Chapter 3
  Pulses are usually caused by electronic switches. A lot of stoves have pulses
because they have integer-cycle controllers inside [96]. It prevents itself from
overheating. Another example is an inverter based motor device that adjusts its
frequency/speed all the time. An example can be seen from Figure 3.18.
Power 1400
             (W)
                    1300
1200
1100
1000
900
                     800
                                1650   1700   1750     1800   1850    1900   1950   2000
                                                     Time(sec)
                    Figure 3.18: Trend signature 4---Pulses (Washer)
  A lot of appliances have negligible transient characteristics and present almost
flat power curves during operation. Actually, this is the only type of appliance
which can be ideally represented by the old single-state load model used in
previous researches. An example can be seen as below:
Power 800
(W) 700
600
500
400
300
200
100
                        0
                            0            5            10         15          20
                                                     Time(sec)
                     Figure 3.19: Trend signature 5---Flat (Kettle)
  In contrast, as can be seen from below, some appliances may have continuous
low-frequency fluctuations all the time instead of a steady state. An example can
be seen from Figure 3.20.
                                                                                           49
Chapter 3
Power 400
                  (W)
                           350
300
250
200
150
100
                            50
                                 0   200     400         600         800     1000   1200
                                                    Time(sec)
                Figure 3.20: Trend signature 6---Fluctuation (Freezer)
   In addition, some audio and video electronic devices such as audio box, laptop
and desktop PC may have high-frequency noises which are shown in Figure 3.21.
It is because their power consumptions constantly vary with the sound, visual
display or other tasks their internal analogue or digital circuits are processing.
Power 80
                    (W)
                            70
60
50
40
30
20
10
                             0
                                 0   50    100     150         200     250    300   350
                                                    Time(sec)
            Figure 3.21: Trend signature 7---High frequency noise (Laptop)
   It should be noted some appliances such as fridge may have more than one
type of trend signatures. Also, some trends are not only found in one type of
appliance but usually in several types of appliances due to their common electrical
characteristics.
                                                                                           50
Chapter 3
   Since continuous power points are being measured from smart meter, trend
signatures can be          further represented and defined by several                         slope
(P / t ) variation modes described in Table 3-4. Also, these slope modes can be
  The time of load window appearance relates closely to its function. There are
some statistical studies on residential load modeling which present typical load
on-hours such as shown in Figure 3.22 [31],[97].
Microwave
Kettle
Toaster
Fridge
Lighting
PC
0 2 4 6 8 10 12 14 16 18 20 22 24 hr
                                                                                                51
Chapter 3
   There are two 120V hot wires installed in a typical North American residential
house as shown in Figure 3.23. Hereby, the two wires can be named as A and B.
Most appliances are connected between A or B and neutral. However, some heavy
appliances such as stove and dryer are connected between A and B to gain a 240V
                                                                               52
Chapter 3
             Meter side
                                                             Phase-A
               CT-A
                                                             Phase-B
               CT-B
                                                             Neutral
3.7 Summary
                                                                                53
Chapter 3
loads was explained. Besides, based on the behavior statistics of different types of
appliances, time/duration signatures were presented. Finally, phase signature was
introduced to label the electric “location” of the load in a North American
residential house.
   The proposed event-window model along with the above load signature
studies established a solid foundation for proposed event-window based load
identification method that will be discussed in Chapter 4.
                                                                                 54
Chapter 4
Chapter 4
4.1 Overview
   Load identification is the most essential task for NILM algorithm. Researchers
have attempted to solve the problem using different algorithms. In [55], the
authors suggest dividing these algorithms into two categories---signal-
combination based algorithm and change of appliance state or event based
algorithm. This chapter intends to introduce a new category---event-window
based algorithm. The structure of NILM algorithms is shown in Figure 4.1. The
                                                                                55
Chapter 4
   One example is shown in Figure 4.2, in an enclosed system with limited and
fixed number of appliances, specific appliances are turned on and their aggregated
signals at the meter-side are respectively recorded and labeled. In other words,
training sets can be generated this way and each training set is relevant to a
specific combination. Since each appliance has an ON and OFF state (if not
considering the middle states), for a system that consists of N appliances, there
are in total 2 N combinations. It is therefore impossible to try all the possible of
combinations when N is big. However, limited number of tries may be enough
                                                                                 56
Chapter 4
already to set up the classification model. Then the classification model should be
able to predict future combinations.
                                                                                              2                                                                                             40
         3
                                                   10
                                                    5
                                                                                             1.5
                                                                                              1
                                                                                                                                                                              Generate      30
                                                                                                                                                                                            20
         1                                                                                   0.5
                                                                                                                                                                                            10
         0                                          0                                         0
                                                                                                                                                                                             0
                                                                                             -0.5
         -1
                                                    -5                                                                                                                                      -10
                                                                                              -1
         -2
                                                                                             -1.5                                                                                           -20
         -3
                                                   -10
                                                                                              -2
                                                                                             -2.5
                                                                                                                                                                             Training set   -30
        20
                    Freezer                         1
                                                               Computer                            0.5
                                                                                                                      TV                                                                    50
                                                                                                                                                                                                     Meter side
        15                                         0.8                                                                                                                                      40
        10
                                                   0.6
                                                   0.4
                                                                                                    0
                                                                                                  -0.5
                                                                                                                                                                              Generate      30
         5                                                                                                                                                                                  20
                                                   0.2
                                                                                                    -1
         0                                                                                                                                                                                  10
                                                    0
         -5                                                                                       -1.5                                                                                       0
                                                   -0.2
        -10
                                                   -0.4                                             -2
                                                                                                                                                                                            -10
-15
        -20
                                                   -0.6
                                                   -0.8
                                                                                                  -2.5                                                                   Training set       -20
-30
        -25                                         -1                                              -3
           0   50   100   150   200   250   300       0   50   100   150   200   250   300            0         50     100         150         200         250         300                  -40
                                                                                                                                                                                               0    50   100   150   200   250   300
……
                                                                                                                                                                                                                                       57
Chapter 4
                                                                               58
Chapter 4
   1. Since this stream directly deals with aggregated or combined signal, all
loads in the system have to be covered. For example, in a 20-appliance house,
even if only 3 appliances are interested to be identified, the other 17 appliances
still need to participate in the training process because they can significantly
affect the aggregated signal when they are turned on together with the interested 3.
   Generally speaking, the signal-combination based methods only work well for
system consisting of ON/OFF appliances. In the meanwhile, the studied system
should have a fixed load inventory. Besides, the users have to be knowledgeable
and patient enough to go through such a long initial process. In order to more
clearly reveal the above drawbacks, a signal-combination based algorithm based
on [69] is implemented and compared with the proposed method in Section 4.5.
                                                                                   59
Chapter 4
is based on this approach. As shown in Figure 4.6, only real power and reactive
power are considered as the event signatures due to the hardware limit and
processing speed at that time. When an event is detected, it is compared to the P-
Q map and identified as the name of the closest “circle” in the map.
                                                                               60
Chapter 4
   Agreed with the comments given in [94], this research stream is more robust
and practical. The event based algorithms only focus on interested target loads
and do not need to cover all the loads in the system. It can also resist the change
of load inventory. However, it has the following drawbacks that need to be
improved:
   2. Since events are separately identified, the whole operation process and
energy of load cannot be re-organized and tracked. In [62], it is claimed that only
operating schedule of loads can be obtained. Also, it cannot effectively deal with
multi-state and continuous varying loads.
   Its procedure is shown in Figure 4.8. The events of interested loads are
detected, identified and finally re-organized as output. Firstly, meter-side current
and voltage are acquired and transferred to event detector. Related data
acquisition will be explained in Section 4.3.1. Secondly, as explained in Chapter 2,
event detector detects all the events of all the loads. Thirdly, through specific data
preprocessing, real power, active power, harmonic of all events are retrieved and
organized. Also trend signatures are scanned along the power curves. This will be
addressed in Section 4.3.2. The next steps are the core of load identification
procedure--- through signal split, window candidate selection and evaluation,
                                                                                   61
Chapter 4
decision of identification can finally be made with respect to the prior collected
appliance signatures. These signatures are used as benchmarks and they are
named as appliance candidates. The details of this core step will be discussed in
Section 4.2. Finally, output results are organized and displayed on user interface.
This will also be presented in Section 4.3.3.
Meter-side signal
                                              Event
                                            detector
                                              Data
                                         preprocessing
                                            Appliance
                                            identifed        Decision making
Output
                                                                                      62
Chapter 4
   Two CTs inside meter naturally separate the aggregated signal acquired by
smart meter into signals of two phases: phase-A signal and phase-B signal.
Accordingly, to deal with phase-A signal, only phase-A loads will remain as
candidates. So is for phase-B signal. Normally, a light bulb connected to Phase B
will never be seen from CT-A. This is a very important step because many
candidates can be ruled out very easily.
   Two exceptions should be addressed: for phase A-B appliance, since any of its
edges shows up simultaneously at both phases, it will be treated as appliance
candidates only if two CTs can detect two identical edges at the same time. Since
the processes at both phases are the same, any phase signal can be chosen for
identification purpose; for portable appliance, on the other hand, since it has an
uncertain phase signature, it will be left as candidates for both phase signals.
                  Power/w
                 380
                 350
                                    2
                 270
                                                3
                 150
                            1
                                                         4
                  50
                             0       10    12       20
                                                             Time/min
                                                                                   63
Chapter 4
                 Light
                Furnace
   Firstly, applying power slope based event detection to the data, 2 rising edges
and 2 falling edges can be located (two large positive slopes and two large
negative slopes) and labeled. As discussed in section II, signal collection between
any pair of rising and falling edges is considered as a window candidate. In Fig.9,
there are in total 4 window candidates: 1-3, 2-4, 1-4, 2-3. Those window
candidates are waited to be compared with appliance candidates one by one.
   Typically, a residential house may have more than 500 rising and falling edges
per day. It indicates the potential number of window candidates per day can be
250,000 in maximum. This will cause too much computing burden. One way to
reduce the window number is to discard some window candidates by using
appliances’ possible window lengths.
   According to Table 4-1, window candidate 2-3 is too short to be possible for
kettle, fridge and furnace. Candidate 1-4 is also too long for kettle. Those
windows are firstly thrown away even before they enter into next evaluation step.
In fact, this window length limit has stronger effect on refining longer period data.
For a day period, only 120-200 window candidates will be left based on multi-
case studies.
   This is the core step of identification. In this step, the collected signatures of
appliance candidates in prior are set as initial benchmarks. Then each of the left
window candidates will be compared to these benchmarks and obtain their
individual scores on event, event pattern, trend and time signatures
( Sevt , S ptn , Strd , Stime ). Then those 4 scores will be synthetically considered to get an
overall score. Afterwards, the overall score is used to judge if this window
candidate is matching the benchmark---the appliance candidate. The mathematical
judgment is completed by using the following linear equation.
                                                                                            64
Chapter 4
g ( x)   T x   (4.1)
with
                                    Sevt        evt 
                                   S            
                               x   ptn  ,    ptn                        (4.2)
                                    Strd        trd 
                                                     
                                    Stime      time 
   x includes   the scores of each signature with respect to the prior benchmarks.
The determination on individual scores will be elaborated in Section 4.2.2.  is
the weight vector since for different types of appliances, different signature sub-
scores are not equally important. T x is therefore the overall score.  is the
qualification threshold. The result g(x) has two scenarios: when g ( x)  0 , this
window candidate is determined as this appliance; otherwise not.
                                                                                 65
Chapter 4
   The weights  are firstly estimated based on the observation and analysis for
different appliance types. For example, knowing furnace and washer have unique
event pattern signatures, S ptn will be emphasized; knowing microwave is often
   Usually weight vector  does not change much for the same type of loads from
a house to another. This is because the same type of loads normally share very
similar working principles and follow a certain standard in a particular region.
The row “Average” in Table 4-2 gives a rough setting when the load type is not
known. This can be used to cope with unfamiliar loads whose weights have not be
studied and optimized. The advantage of the weight vector is that when
comparing, there is no absolutely dominant signature---various signatures are
bonded together to ensure fairness and accuracy.
D. Make decision
                                                                                66
Chapter 4
    In the end, Table 4-3 is calculated according to equation (4.1) and the g ( x)
values are filled in Table 4-3.
    From the signs of classifier values, window 1-3 is determined as a kettle while
window 2-4 is a fridge since their values are greater than 0. If no positive value is
found, it means the events could be due to mis-match of events or an unknown
appliance not registered in the database yet (maybe not interested by users either).
         From the signature database, an appliance candidate only includes its own P-
Q-W event sets. In contrast, a window candidate may include other events caused
by overlapped appliances. The comparison is trying to answer if this window
candidate includes all of the appliance candidate’s events. Thus the process is like
                                                                                    67
Chapter 4
this: each of registered events will be compared throughout all the events in
window candidate one by one. Then:
                                                         Ne'
                                                Sevt                                                      (4.3)
                                                         Ne
         As shown in Figure 4.10, both of the two registered events B-D are found in
the window ( Ne  Ne'  2 However, if only B exists, it is very likely the window
candidate is only one part of the appliance process and its Sevt  0.5 ( Ne  2, Ne'  1 ).
Power/w Power/w
   380
   350
                       B          Ne’=2
                                                                                     Ne=2
   270
                              C
               A                                                                    Candidate
   150                                                            110               Appliance
                                            D
                    Candidate Window                              80
   50                                                                       B                        D
                0       10   12        20                                       0               10
                                                       Time/min                                          Time/min
                                                                                                             68
Chapter 4
   For ON/OFF type appliance, it has a fixed sequence of events; for multi-state
appliance, as discussed in section II, fixed sequence and repetitive sequence may
either be found.
   For fixed sequence events, they always follow a certain order pattern. For a
window candidate, its event order should comply with the order pattern defined in
appliance candidate. For example, as shown in Figure 4.11, a space heater has 5
events in the order of A-B-C-D-E. It is expected to find A-B-C-D-E in the
window. On the other hand, an A-B-D-C-E sequence may imply a different
appliance process and B-C-A-D-E is even more different.
Power/w Power/w
                                                               300
                                                                                  C          A:+100
                        D                                      250
                                                                                             B:+200
             B                             C   A
                                                   D                     B                   C: -50
                                       B                                                 D   D:-150
                               C                                                             E:-100
                   Window                   Window
       A         candidate 1
                                   E       candidate 2    E    100
                                                                             Appliance
                                                                     A       candidate       E
Time Time
                                                                                                             69
Chapter 4
   It is easily known that B-C-A-D-E and C-B-A-D-E are more disordered than
A-B-D-C-E compared to the original sequence A-B-C-D-E based on their lengths
of changed positions. For a given sequence composed of n letters/events, the
maximum possible length of changed position is:
                                 L                       n 1
                           M   [n  (2k  1)] , L                       (4.4)
                                k 0                       2
Based on the discussion above, Strd for appliance with fixed sequence can be
quantified as
                                                  Nf
                                     Strd  1                             (4.5)
                                                  M
Table 4-4.
                                                                             70
Chapter 4
repetitive event.
                                                 N r'
                                       Strdr                                           (4.6)
                                                 Nr
    As for a combination sequence load such as furnace, Strd can be decided based
on its two sub-indices fixed pattern score Strdf and repetitive pattern score Strdr .
    As discussed in Section 3.4, power slope based scanning can effectively scan
the window candidate and further determine the existences of trend signatures
with respect to appliance candidate.
                                                Nt'
                                       Strd                                            (4.7)
                                                Nt
                                                                                          71
Chapter 4
                                        1, t  T
                                Stime                                         (4.8)
                                        0, t  T
   As explained in Section 4.2.1, for different loads, their weights on event, event
pattern, trend and time are different. In addition, their events also have different
weights on real power, reactive power and harmonic contents. Rough ranges of
weights are pre-defined based on the experience and understanding of the
electrical/physical attributes of different loads, but their values should be further
fine-tuned and verified through a specific program. Once the weights are
determined, they do not need to be changed from one house to another. This
advantage also implies no local training process is necessary when the proposed
NILM is applied to a new system.
                                                                                  72
Chapter 4
   Similar to the “face recognition” process, for a specific load, this optimization
program generates numerous pre-labeled event-windows based on the signatures
of this load and also other types of loads that can mimic different “backgrounds”
(other appliances). Besides, certain amount of signal noises (<10%) are added
onto these windows. By doing this, the large number of pre-labeled testing event-
windows includes not only the actual objective loads but also the other loads.
   The process of the above optimization is based on empirical initial ranges and
method of exhaustive search. Since the accuracy needed for  and  is not
extremely high, for each try, the step change is set as 0.02. By using this simple
optimization method, optimal  and  values can be found for each load type.
                                                                                              73
Chapter 4
                                                                                         74
Chapter 4
   The setup of the data acquisition system is shown in the above picture:
Electricity panel of a residential house is first opened; two current transducers are
respectively connected to two phases. For each phase, the current transducer is
clamped around its live wire; also a voltage transducer is connected between one
of the live wires and the neutral wire; currents and voltages are being transferred
to the National Instrument (NI)-DAQ system. The NI-DAQ system supports
simultaneous inputs from multiple channels and has a high-resolution A/D
converter in it; finally, digital signals of voltage and current are sent to the
connected laptop via a USB port. Generally speaking, this data-acquisition system
behaves like a smart meter.
   The data is acquired at per second basis. For each second, a snapshot
composed of 6 cycles of synchronous currents and voltage are recorded. For each
cycle, 256 points are acquired. Example of a snapshot is shown in Figure 4.14.
                          3
                Phase A
               Current(A) 2
                          1
-1
-2
                          -3
                            0        200     400       600    800    1000    1200    1400   1600
                 Phase B
                           6
                Current(A)4
                           2
-2
-4
                           -6
                             0       200     400       600    800    1000    1200    1400   1600
               Voltage(V) 200
                           150
100
50
-50
-100
-150
                          -200
                              0        200     400      600    800    1000    1200   1400   1600
                                                                                                   75
Chapter 4
connected to the live or neutral wire of the load supply cable. The voltage
transducer is connected to a vacant electric outlet adjacent to the outlet the load is
plugged into. The data format and sampling rate stay the same as the meter-side.
   The amount of data acquired from the previous step is usually very large and
not convenient for processing by the identification module. Hence, a data
preprocessing step that can compress the data but keep the important information
becomes necessary. The purpose of this step is to generate event matrix and trend
scan table.
                                                                                   76
Chapter 4
   An example of event matrix is shown in Table 4-5: event No., time points,
rising or falling type and electric signatures of all the events are calculated and
recorded for the data of a given period such as a day. As discussed in Chapter 2,
events can be divided into 3 groups: phase-A events, phase-B events and phase A-
B events or double phase events. Accordingly, 3 event matrices are generated
respectively for the above three groups. Later on, in the identification step,
different event matrices can be selected according to the phase signature of the
objective load.
   It should also be especially noted that since only 6-cycles snapshots are
acquired within each second, P,Q and Ih from different snapshots have to be
aligned up before subtraction. For alignment, voltage can be used as reference
since the phase angle of voltage stays constant before and after a load state
transition. Two approaches are proposed for the calculations of P,Q and Ih. The
snapshot shown in Figure 4.14 is used as an example.
                                           I 1  I 1 ( I1  V1 )
                                          
                                           I h  I h ( h  h   1 )
                                                           I        V
                                           1
                                          V  V V
                                                     1     1
                                                                                           (4.11)
                                          
                                           P  I V cos(V   I )
                                                   1   1         1     1
                                          
                                          Q  I 1 V 1 sin(V1   I1 )
                                                                                               77
Chapter 4
   Compared with Approach 1, Approach 2 is much faster since there is only one
time Fourier transform on 5-cycle currents. However, it is less accurate since not
                                                                                 78
Chapter 4
all cycles are fully utilized and the assumption that voltage has no harmonic
content is not always true.
                                  I 1  I 1  I1
                                 
                                  I h  I h  h
                                                 I
                                                                             (4.12)
                                  P  I V cos( I )
                                          1         1
                                 
                                 Q  I 1 V sin( I1 )
   The above 3-phase event matrices have included event, event pattern,
time/duration and phase connection signatures. However, as discussed in Chapter
3, trend signatures should also be fed into load identification module for judgment.
The other task in the data preprocessing stage is to generate a trend scan table
shown as below:
   As can be seen from Table 4-6, for a given window, different trend slope
signatures are recognized according to the slope characteristics listed in Table 3-4.
The above process is conducted throughout all types of phase connections for the
studied period. Finally, 3 trend scan tables that each represents one type of phase
connection can be generated.
                                                                                  79
Chapter 4
   The results are updated every half an hour and displayed in the interface shown
in Figure 4.17. As can be seen, appliance electricity consumption information is
formatted into the table and charts. The table summarizes the total energy counted
from a certain date and converted expenses with respect to local electricity rates
(say,7.4¢/KWH). The pie chart presents the percentage composition of individual
appliance so users can be aware of the significance of reducing a certain
appliance’s consumption. Finally, from the time distribution of energy use chart,
user can understand his identified appliance activities statistically with respect to
hours. This information is quite essential for residential house owners to adopt
proper demand response strategies such as load shifting according to utility’s
TOU rates. More discussion can be found in Chapter 5.
   The above algorithms and devices were tested in several real residential houses
in Edmonton, Canada for several weeks. A laptop based data acquisition system
was hooked to the electricity panel and behaved like a smart meter. A Zig-bee
transceiver was connected to its USB port to bridge the communication with the
appliance register. After registration was finished, a computer program based on
the proposed algorithms was launched and started to process. Interested appliance
                                                                                  80
Chapter 4
   The results for house #1 are listed in Table 4-7 (this house has no furnace). The
definitions of all the table columns are explained as below:
       Actual operation times (AOT): the actual observed and inferred operation
        times by manual identification.
                                                                                 81
Chapter 4
                                        CIT  FIT
                           Accuracy              100%                     (4.13)
                                          AOT
  Therefore, using equation (4.13), false identified time has a negative impact on
identification accuracy and thus the index can be more objective. In order to more
clearly present the results and discussions, examples of identified event-windows
are automatically marked in red by the algorithm and some failures are circled in
green. These examples are shown as below:
                                                                               82
Chapter 4
                                                                                 83
Chapter 4
(b) Failure
                                                                             84
Chapter 4
Figure 4.22: Examples of identification for stove elements using low power
Figure 4.23: Examples of identification for stove elements using high power
                                                                                  85
Chapter 4
                                      (b) Failure
                 Figure 4.25: Examples of identification for kettle
                                                                        86
Chapter 4
   Similar to the fridge, sometimes when the events of kettle overlap with other
frequent appliances, identification failure may occur due to event detection failure.
                                                                                    87
Chapter 4
  Microwave            72              71                  0                 99
     Kettle            14              14                  1                 93
    Washer              4               4                  0                100
     Dryer              5               5                  0                100
Stove elements          8               8                  0                100
 (low power)
Stove elements          5              5                   0                100
 (high power)
   Average             ---             ---                 ---              96
   As can be seen, even if the furnace’s operations sometimes overlap with the
fridge’s operations, identification is almost not affected. This is because fridge is
                                                                                  88
Chapter 4
   Also, it is found the proposed algorithm can withstand certain level of signal
noises such as the figure shown as below. As can be seen, even if the steady state
of two later fridge operations have significant noises, they can still be correctly
identified as long as their events ON/OFF are not heavily polluted.
                                                                                 89
Chapter 4
   The results are shown in Table 4-9. It is found TV has a lower identification
rate. This is because TV has a flexible duration. The duration can be very long
                                                                                 90
Chapter 4
such as hours but can also be shorter than half an hour. The identification rate is
therefore reduced since many more window candidates will be included which
lead to potential interference.
   REDD, a public dataset available for NILM research was released by MIT in
2011 [95]. The data was acquired from the greater Boston area in US. The dataset
can be used to validate the NILM algorithm since several appliances have been
labeled using extra data-logging devices that are directly connected to the target
appliances. The No.3 house from the dataset is selected for the validation purpose.
In general, the dataset contains 1427284 seconds or roughly 16 days and the
dataset is not continuous. Many appliances are pre-labeled. For some unknown
reason, the washer is not pre-labeled by MIT, however, it is labeled by us through
realistic inference with respect to dryer activities. Several major appliances have
been identified and the results have been listed in Table 4-10.
                                                                                91
Chapter 4
           When the load window duration is not changing significantly or not too
            long;
                                                                                  92
Chapter 4
   Firstly, specific appliances were measured in the lab and their harmonic
signatures were collected. Not like the proposed approach, no process related
signatures is considered by neural networks. Harmonic contents of aggregated
signal are used as input layer while appliance composition list as output layer.
Since both magnitude and phase of a certain harmonic order are considered, the
input layer has 16 nodes of up to 15th harmonic content (only odd ones). Hidden
nodes are set to be 20. As shown in Figure 4.2, according to [69], numerous
training sets are generated mathematically by adding up harmonic contents
(waveforms) of designated individual appliances. Also, to make training more
reliable, a less than 10% deviation is added to original magnitude as noises.
  For test stage, a bottom-up based aggregating program turns on/off each load
according to a certain operation probability. The aggregated meter-side signal is
formed this way. Then both of the two approaches were tested to decode the
overall meter signal and their performances are discussed as below.
                                                                                93
Chapter 4
   Comparison is firstly conducted when there are only single-state type loads.
This is because single-state type loads only have a steady-state harmonic content.
The results are shown in Table 4-11.
                                                                               94
Chapter 4
   As can be seen, for a system composed of only single-state type loads, the
proposed approach has a performance similar to NN based approach. This is
because there is no change in each appliance’s operation process. However,
results are heavily changed when complex loads are brought in.
                                                                                 95
Chapter 4
                       bulb
              Fluorescent light bulb         38.6            95.8
                      Fridge                 51.3            96.1
                      Freezer                56.3            90.6
                      Washer                 45.4            92.7
                     Furnace                 44.3            95.6
                       Stove                  ---            ---
   Another obvious advantage of proposed approach is it only identifies loads that
users are interested in and willing to register. In contrast, NN based approach’s
training process has to cover all major appliances. Also, once user purchases
another heavy load such as a stove, the accuracy of identification will become not
reliable at all. This is because the neural network model needs to be changed due
to the newly added element. As shown in Table 4-13, any aggregated signal that
has stove on at the same time will become unidentifiable (this is especially severe
for other kitchen appliances). However, stove hardly has any impact to the
identification of registered loads using proposed approach because the proposed
approach is based on searching relevant events of registered appliances only.
Those non-relevant events from stove will be excluded from the window
candidates of the registered loads.
                                                                                 96
Chapter 4
           Training process does not need to cover all major appliances any more.
            Users only need to register their interested loads they want to track
            down;
4.6 Summary
           It does not require a local training process since the weights of different
            signatures for different loads can be optimized in the lab before taking
            to the field. Only simple optimization process is needed;
           The average identification rates in the four tested houses are all above
            90%. The overall accuracy is very satisfactory;
                                                                                    97
Chapter 4
   However, the proposed algorithm does have some disadvantages for the
following scenarios:
           If the appliance has a very long and flexible duration, the number of
            window candidates will increase dramatically which could lead to
            larger identification error.
   Overall speaking, the proposed algorithm makes a good balance between being
effective and being practical.
                                                                               98
Chapter 5
Chapter 5
Loads
   The results can have various applications. One is to create signature databases
for the NILM applications. Another is for load condition monitoring. Validation
results conducted on the data collected from three actual houses and a laboratory
experiment have shown that the proposed method is a promising solution to the
problem of non-intrusive load signature collection. Some contents in this chapter
have been submitted as publication in [105].
5.1 Overview
   To perform load decomposition, all NILM techniques must rely on the unique
signatures of individual loads. In order to obtain these signatures, the existing
                                                                                   99
Chapter 5
                                                                                 100
Chapter 5
Figure 5.2: Relations of magnetic field, electric field and the EMF event detector
   In [83], a smart phone based signature extraction system is developed. It
requires manual confirmations from human experts or house residents to label the
events of appliances. The process is shown in Figure 5.3.
   Figure 5.3: Smart phone and human confirmation based signature extraction
                                   system
                                                                               101
Chapter 5
meter (or the device which does appliance identification).Smart meter does two
things: capture the event window of this appliance and determine the signatures
of event window.
   In fact, the above signature extraction processes discussed in 5.3.1 and 5.1.2
have the following common disadvantages:
   1. They are intrusive methods. They all require additional hardware devices to
either directly measure load signatures or label load events. Fundamentally, this
                                                                               102
Chapter 5
intrusive step is opposite to the basic principle and intention of non-intrusive load
monitoring research. In other words, it makes NILM not purely non-intrusive.
   3. They are hardware based methods. Extra cost will be added into the
solutions. Unfortunately, so far, there is no other pure-software based approach on
signature extraction for NILM.
 It is clear that the above intrusive ways of signature collection are not desirable
by ordinary house owners. Therefore, there is a need for methods that can collect
the appliance signatures non-intrusively. If successful, unique signatures that are
specific to an appliance in a particular home can be extracted and archived. To
solve the above drawbacks, this chapter proposes a novel unsupervised non-
intrusive signatures extraction (NISE) approach which does not require any
measurement or direct input from users and is purely software based. Given the
meter-side data of a certain days, specific major appliance events can be located,
associated and then their complete operation cycles can be reconstructed
automatically. For the proposed NISE method in this chapter, the key is to
intelligently study these events and establish knowledge of appliances
automatically.
   As shown in Figure 5.5, firstly, all appliance events are captured through the
event detection method discussed in Chapter 2. Secondly, event filtration is
adopted to identify the suspect events of interested appliances. Thirdly, event
clustering is utilized as a tool to pick out the appliance’s authentic events from its
suspect events. Fourthly, based on the authentic events, event association
algorithm is further applied to reconstruct complete operation cycles of appliances.
Finally, both electric and event pattern signatures of appliances can be extracted
from the reconstructed cycles to fit any existing signal combination based NILM
                                                                                  103
Chapter 5
                     Proposed
                                                  Data flow
                     flowchart
                       Start                        Data
                                                Representative
                 Event Association
                                                operation cycle
                                                 Knowledge
                        End
                                                 (signatures)
NILM
   One operation cycle of each appliance may have two or more events depending
on if it is an ON/OFF type appliance or a multi-state appliance. An ON/OFF type
appliance such as a light bulb has only a pair of ON and OFF events while multi-
state appliance such as furnace may have a series of operation state changes in the
middle. An appliance’s events are heavily characterized by its function and
physical electric attributes and can be roughly located. For example, a fridge
usually has an ON event with real power of 70-300 W and a reactive power of 30
to 200 Var. Due to the cooling function of a fridge, the ON events can be
                                                                               104
Chapter 5
observed during 24 hours even when users are sleeping; A microwave usually has
an ON event that has an real power of 800 to 2500 W and a heavy third harmonic
content. As a cooking device, a microwave is likely to be observed before
mealtime.
   In this chapter, the events that match the specific conditions of a certain
appliance are defined as the “suspect events” of this appliance. The aim of event
filtration is to locate the suspect events of a given appliance that may lead to the
reconstruction of its entire operation cycle by carrying out future steps. To
implement filtration, the conditions which can restrict the captured events are the:
real power range, reactive power range harmonic content range, with or without
spike, single phase or double-phase and searching time.
   The real power, reactive power and harmonic content ranges are closely
determined by the electric attributes of specific appliances. Residential loads can
be roughly divided into four categories based on their linearity and reactivity:
linear/active appliance, linear/reactive appliance, non-linear/active appliance and
non-linear/reactive appliance. Depending on the type of category a certain
appliance belongs to and its designed function, the numeric ranges of the above
conditions can be quantified. For example, as a linear/active appliance, a resistive
kettle has a very low reactive power and almost zero harmonic contents. Also, due
to its water-heating purpose, its real power may statistically range from 1300 to
3000 W. Motor based appliances such as fridges can be viewed as inductors
which lead to large reactive power. Another category is the switch-mode power
supply based electronic appliances such as TVs and computers. They are neither
inductive nor capacitive, but produce a large amount of harmonic contents. In
addition, some appliances are both non-linear and reactive. Table 5-1 lists some
examples of the above four appliance categories.
                                                                                105
Chapter 5
   A Spike can be another ancillary condition that helps locate some induction-
motor based appliance events. A large inrush current occurs at the first moment of
operation when the rotor is triggered from the station into movement. This unique
feature is accompanied by an ON event, which can be reflected as a sharp edge on
the appliance’s real power curve.
   The phase condition can easily separate some events from other appliances’
events. In North America, some appliances are connected between two hot phases
to gain a 240V voltage while others are connected between a single hot phase and
a neutral to gain a 120V voltage. From identification perspective, the events of
double-phase appliances can be observed simultaneously at both hot phases,
whereas the events of single-phase appliances occur at only one of them. In a
residential house, only a few heavy-consumption appliances are connected to
double-phase electric outlets. These appliances are the stoves, ovens and clothes
dryers.
   The search window is another very important restriction that greatly reduces
the searching space of suspect events. Some statistical studies are available on
residential load behaviors which present typical appliance runtimes [97].For
example, microwave’s operations are more expected to be seen before breakfast,
lunch and supper; lights are usually turned on in the early morning or after dark;
fridges and furnaces are likely to run day and night. These occurrences show that,
to locate the suspect events of specific appliances easily, it is best to search their
less overlapped time ranges. For example, for a fridge, it is best to search from
2:00 AM-5:00 AM while many other appliances are generally inactive. For
                                                                                  106
Chapter 5
microwave, it’s best to search the periods before meals. By doing so, the
interference from other appliances can also be minimized.
Hours
                       Day
                                       Kettle’s joint data piece
                                                                         With     Phase
 Appliance         P(W)             Q(Var)             THD (%)
                                                                        spike?   Condition
   Fridge          70-300           30-200              0-20             Yes      Single
  Furnace         120-800           200-800             0-20             Yes      Single
Microwave        800-2500           80-500              20-50             No      Single
 Stove (big
                 1800-3000           0-30                0-5              No      Double
  element)
                                                                                           107
Chapter 5
  element)
   Oven         2200-3600          0-30         0-5          No        Double
   Kettle       1300-3000          0-30         0-5          No         Single
Clothe dryer    3000-6000         60-250        0-5         Yes        Double
  Washer
                  80-300          <100         65-95        Yes         Single
(Front-load)
  Washer
                300-1000         300-1200      0-20         Yes         Single
 (Top-load)
   Appliances may have different working modes. For example, a stove usually
has 4 heating elements (two big ones and two small ones) on its panel. The small
elements of a stove consume only approximately half the power of the big
elements. Since their signatures are quite different, they should be treated as two
different types of appliances.
                                                                                 108
Chapter 5
   It should also be noted that some appliances may have different working
principles which result very different signatures. For example, most modern front-
load washers are controlled by a variable-speed drive which outputs significant
harmonic contents. In contrast, an old top-load washer behaves more like a
regular big motor which outputs higher P/ Q and little THD. These two washers
should also be treated as two different types of appliances.
                    THD 
                           ik2
                                , k  3,5, 7...
                               i1                                          (5.1)
                                                                              109
Chapter 5
   One basic assumption is that inside a suspect event group of appliance X, the
number of events belonging to appliance X should be much larger than the
number of events belonging to other appliances. The following examples below
explain this assumption: suppose 50 suspect events of a fridge are located from a
data piece for 2:00-5:00 AM of a whole week. It is possible that the user wakes up
between 2:00-5:00 AM on Tuesday and for an unknown reason, uses an appliance
such as a ceiling fan that accidentally meets the same filtration conditions as those
of the fridge. Thus this fan’s event will be mis-recognized as the fridge’s suspect
event and included in the fridge’s 50-suspect-event group. This scenario might
occur occasionally on certain days; however, this scenario is highly unlikely to
occur as frequently as the fridge since such operations would be abnormal---he
would have to turn on/off the fan not only every day but also as frequently as the
fridge’s kickins during sleeping time. Another example is a microwave. The user
might turn on/off an unknown appliance that draws similar P/Q/Harmonic signals
as a microwave does before mealtime, but it is unlikely to occur as often as a
microwave. For any of these abnormal scenarios, the proposed unsupervised
approach is not intended for and not able to deal with.
   Furthermore, in some cases, corrupted events may occur that are usually due to
simultaneous occurrences of more than one appliance event. This rarely happens
                                                                                 110
Chapter 5
but is possible. For example, the accidental overlapping of two single events
belonging to a fridge and furnace will result in a power jump that is roughly the
summation of the two events. Normally, the frequency of these corrupted events
is much smaller than the authentic events.
   Based on the above assumption, if the number of groups of events and the
number of events each group possesses are known, the authentic events can be
determined as those in the event group with the maximum number of members.
Clustering is an effective tool to obtain the information. A clustering algorithm
determines which events are roughly the same and can be grouped together as one
cluster. After event filtration, since the clustering space is greatly reduced, many
uncertainties and noises can be ruled out and clustering can be done much faster
and more accurately compared to applying clustering to all the events of a day in
the beginning. For instance, as Table 5-3 shows, suppose 88 suspect events of a
fridge are located and after applying clustering algorithm, only four clusters are
found. Then from the number of events each cluster owns, the dominant cluster
which has 75 event members can be identified as having been caused by the
fridge and these 75 event members can be labeled as the fridge’s authentic events.
On the other hand, smaller clusters are discarded because they are likely to be
from other appliances or noises and cannot represent the objective appliance. The
clusters and their mean values are listed in Table 5-3. Two types of clustering
methods are applicable to our study.
                                                                                111
Chapter 5
   Our proposed approach selects the mean-shift algorithm for clustering. Before
introducing mean-shift clustering, it is necessary to explain another related
clustering method named K-means clustering. Comparison of the two will also be
given afterwards.
(a) (b)
(c) (d)
                                                                               112
Chapter 5
   As shown in Figure 5.7, before the procedure begins, the number of clusters K
must be initialized. In this example, K is set as three. Accordingly, in step (a),
three “means” are randomly chosen from the given dataset. In (b), three clusters
are created by grouping every sample with its nearest mean. In (c), based on the
three clusters that are determined from (b), centroids of the three clusters are
calculated and the three old means are now “shifted” towards the three new
centroids. In the end, three new clusters are formed in (d). After this, (b) and (c)
are repeated until means are not shifted (convergence has been reached).
                                                                                113
Chapter 5
   For mean-shift clustering, assuming the sample data is a finite set S in the n-
dimensional Euclidean space, X, its kernel function is shown as below:
                                                                              114
Chapter 5
                                         1, x  
                                K ( x)                                      (5.2)
                                          0, x  
                              m( x ) 
                                          sS K ( s  x)s                    (5.3)
                                          sS K (s  x)
   Step 3: For all the clusters, shift the cluster centers towards the new means
calculated in step 2;
   Step 4: Repeat step 2 and 3 until the centers of all the clusters are not moving
anymore. In other words, the means of all the clusters are not updated anymore;
   In our study, P/Q/THD are selected as three attributes used for clustering
purpose. The “mean” linkage type is selected to determine the distance between
two clusters. All features are firstly normalized to the range of 1 to 100 through
min-max scaling [113]. For mean-shift clustering, its bandwidth parameter is set
from 5-20 due to the estimated variation of the electric signatures caused by the
local system voltage fluctuations. Generally, for the above two clustering methods,
when the power variance inside the suspect event group is large, the bandwidth of
mean-shift clustering can be relatively lowered in order to strengthen the cluster
differentiation; when the power variance inside the suspect event group is small,
the bandwidth of mean-shift clustering can be relatively increased to strengthen
the cluster fusion.
                                                                               115
Chapter 5
   The above standard mean-shift clustering has one drawback due to the
Euclidean space it uses where different attributes will be treated as equally
important. However, as explained in Table 5-1, depending on which category the
appliance type belongs to, different electric features should not be treated equally
because some features may be almost noise features. For example, for a fridge, as
a linear/reactive appliance, it produces considerable active/reactive powers. But it
barely produces any harmonic content and thus the THD feature is actually a
noise feature that might be caused by misreading from the meter-side; for a stove,
both of its reactive power and harmonic contents should be disabled because it is
basically a resistor.
   To cope with the above noise feature problem, P/Q/THD features can be easily
selected according to Table 5-1. This is because for the proposed non-intrusive
signature extraction algorithm, the type of appliance is already known in advance
and the selection of features can be therefore determined based on the category
the appliance type falls into. The effect of feature selection is similar to using a
Minkowski metric weighted space explained in [112] where the “relative
distance” on low-weight features contributes little to the total distance.
                                                                                 116
Chapter 5
2 events caused by the fan. Generally, mean-shift with feature selection has a
better performance than mean-shift clustering in the proposed application.
                                                                             117
Chapter 5
   After the event clustering, the knowledge of the appliance is still incomplete
since the other events except for the ON event in an appliance’s operation cycle
are still unknown. For accurate energy-tracking purposes, all the events and even
the event pattern signatures [104] of an appliance have to be obtained. For
example, for a fridge, its OFF event needs to be known; for multi-stage appliance
such as a furnace, it is even more important since the ON event is only a small
part of the furnace’s full operation cycle. Its middle events and OFF event also
need to be found. Knowing how these middle stage events occur in a particular
pattern is also valuable. The overall purpose of event association is to determine
the other events which also belong to the objective appliance and hence
reconstruct its full representative operation cycles. .
   The proposed method counts the number of event occurrences and calculates
the frequency to judge whether an event is associated with a specific appliance
and determines the event’s association type. An operation study of actual
appliances [104] defines three association types:
                                                                              118
Chapter 5
                                                                              119
Chapter 5
  The way to locate associated events and determine their association types can
be determined by examining the data segments starting from the authentic ON
events (determined by the event clustering in the last step). Each data segment is
defined as a piece of data that has an authentic ON event in its beginning. The
                                                                               120
Chapter 5
length of the piece is properly defined so it will not be too short (may fail to
include all the associated events of the appliance) while it will also not be too
long (may include too many unrelated events as interferences). The reasonable
lengths can be defined as 1.5 times the average durations of the appliance
operation. Table 5-4 shows the average durations of typical appliance operation
and their segment lengths.
   Suppose there are M authentic events, there will also be M corresponding data
segments which may include not only associated events of appliance but other
irrelevant events. Applying event clustering methods discussed in Section III to
all events in all segments together, clusters of events can be determined.
According to the definitions of three association types, theoretically, the criteria in
Table 5-5 can be used to examine association of those clusters individually.
   For single event, they should appear in each data segment and thus event
number N should be equal to data segment number M; for repetitive event, since
in some data segments, multiple events may appear, event number N should be
larger than M; for occasional event, its occurrence should be more frequent than a
threshold ruled by b. b is a factor to differentiate if an event is related to objective
appliance or not. Considering possible mistaken clustering or missing events due
to improper data segmentation, the above theoretical criteria should be corrected
using confidence c and additional conditions shown in Table 5-6. Normally,
values of b,c can be set as 0.3 and 0.8.
                                                                                    121
Chapter 5
   An example of how event association works is shown in Figure 5.14 and Table
5-7. For the given appliance, ON-event “1” is the initial authentic event
determined by the event clustering step previously. Accordingly, 4 data segments
of this appliance are located and each segment has the same length determined
referring to Table 5-4. At the beginning of event association, clustering is applied
again to all the events included in these segments. As a result, each event can be
labeled by the index of the cluster it individually belongs to. After the occurrences
of each cluster are counted, the association type can be determined by using
criteria in Table 5-6. The results are presented in Table 5-7 .
Power/w Power/w
3 4 3 4 6 3 4
                  1                    2                                2
                                                       1
                                           Time                             Time
            Power/w                               Power/w
                                                                5
                                                                    4
                      3    43      4                        3
                                                                        2
                                       2
                  1                                    1
                                           Time                             Time
       Figure 5.14: An example of event association from 4 data segments
                                                                                   122
Chapter 5
above association accuracy is to have more data segments, which may require
more data pieces from more days, say a week. The more data segments being used,
the less likely an unrelated event is to be incorrectly judged as a related one due to
the error of the confidence factors b and c; also, the impact of a possible
clustering error from the previous step or the missing events due to improper data
segmentation can be reduced to a minimum.
   The above algorithms were tested by using data acquired from a real
residential house in Edmonton, Canada for a week with no special attention from
the house owner. A laptop-based data acquisition system was hooked the house’s
electricity panel and continuously collected all the voltages and currents from the
two hot phases (A and B) inside. It behaves just like a smart meter. The data were
sampled every second. In each second, 6 consecutive cycles are acquired and each
cycle has 256 points. Data acquired was then processed using the proposed
                                                                                  123
Chapter 5
   Table 5-8 shows the search window and authentic events determined in the
step of event clustering. As can be seen, for each appliance, the number of
authentic events (the largest no. of events in a single cluster) is larger than the
average number of events per cluster. This result supports the assumption in
Section III that authentic events will dominate inside suspect events.
                                                                                    124
Chapter 5
      (PhaseA)       5:00 AM
       Fridge        2:00AM-           31         2        7(22.6%)
      (Phase B)      5:00 AM
                     2:00AM-            4         1         4(100%)
      Furnace        7:00AM
                     8:00PM-
                     0:00AM
                     6:00AM-
                     9:00AM;
       Micro-       11:00AM-           54         7         27(50%)
       Wave          2:00PM;
                     4:00PM-
                     8:00PM
      Stove(big      4:00PM-           517        9       505(97.7%)
      element)       8:00PM        (Repetitive)
    Stove (small     4:00PM-           77         5        60(77.9%)
      element)       8:00PM        (Repetitive)
        Oven         8:00AM-            7         5        3(42.9%)
                      8:00PM
                     6:00AM-
                     9:00AM;
       Kettle       11:00AM-           11         4        8(72.7%)
                     2:00PM;
                     4:00PM-
                     8:00PM
                     8:00AM-           92         5           61
    Clothes dryer    11:00PM       (Repetitive)             (66.3%)
                    (Weekend)
       Washer       3 hrs before       380        10      189(49.7%)
                       dryer       (Repetitive)
  As for the reconstructed operation cycles, the following two verification
methods are conducted:
                                                                       125
Chapter 5
(a) Furnace
(b) Microwave
                                       126
Chapter 5
(e) Oven
(g) Kettle
                                      127
Chapter 5
(h) Dryer
(i) Washer
   Figure 5.16.Reconstructed cycles (red) vs. Labeled real cycles (blue) in house
                                       #1
                                                                              128
Chapter 5
with an event is needed, the corresponding event sample can be simply loaded for
study.
   Table 5-9 shows acceptable accuracy between the reconstructed cycles and
labeled cycles. Please note the electric signatures of appliances can change within
+/- 5% due to system voltage fluctuation. As
   Table 5-9 shows, most of the real power errors are lower than 5%. Some errors
of reactive power or harmonic THD are a little higher, because the corresponding
appliances produce little reactive power or harmonic contents. The true values of
these attributes are comparable to signal noises and can greatly fluctuate between
measurements, especially when they are small. For example, even if several
measurements are directly performed for the kettle’s harmonic THD, differences
at the listed level may still be observed. Another example is the washer. Since it is
controlled by a variable speed driver, which continuously generates many
repetitive event pairs with heavy noises (as shown in Fig.7), the inherent signature
consistency of these events is lower than that of other appliances. Thus, the above
errors are quite normal. Furthermore, they will not affect NILM’s identification
significantly. Very small or zero weights will be given to these unstable attributes
when making identification [104]. Similarly, classifiers such as neural networks
[69] will “ignore” these signatures automatically during their training stages
because these signatures in their training sets vary a lot too. The mean values
being used can also balance out the variations of abnormal events inside clusters.
Generally speaking, the electrical signature accuracy is very satisfactory.
                                                                                 129
Chapter 5
  Table 5-9 Electric signature error between reconstructed cycles and reference
                                cycles for house #1
    Also, the speed of the proposed algorithm was found to be fast. Based on a
regular desktop PC (Intel 2-Quad CPU 2.33GHz, 4GB memory), the 3-stage time
required to process the appliances are recorded. The most time-consuming
appliances “Stove(big element)” and “Washer” in house #1 is listed in Table 5-10.
                                                                              130
Chapter 5
Table 5-10: 3-stage time required for the most time-consuming appliances
         Figure 5.17: The total power data on a typical day from house #2
   Similarly to house 1, another real residential house with a smaller family size
in Edmonton was also measured for a week. The house contained 1451 events
(>50W) in Phase A, 1698 ones in Phase B and 564 ones in Phase A-B. The
house’s total power on a typical day is shown in Figure 5.17. It should be noted in
this house an old top-load washer is used. Its reconstructed cycle referring to a
real labeled cycle is shown in Figure 5.18, which shows that between ON and
OFF, its power gradually decreases and becomes stable after a few minutes.
   By using the proposed approach, 8 major appliances were found (the house has
only one fridge). As Table 5-11 shows, the electric signature is quite accurate
with respect to the signatures contained in the labeled reference cycles.
                                                                                 131
Chapter 5
 Figure 5.18: Reconstructed cycles (red) vs. l with labeled real cycles (blue dash)
                      for the top -load washer in house #2
 Table 5-11 Electric signature error between reconstructed cycles and reference
                               cycles for house #2
                                                                                132
Chapter 5
    Figure 5.19. The total power data of the first 86400 points from house #3
   To further verify the proposed approach, the data from house 3 in the REDD
dataset [95] was used. REDD, a public dataset available for NILM research was
released by MIT in 2011. The data was acquired from the greater Boston area in
US. Please note: 1. In this dataset, many middle hours in a day are missing. For
example, activities of the stove, oven and kettle were not recorded. 2. Although
REDD labels specific appliances but unfortunately, some appliances are not
clearly labeled such as the washer. In general, the dataset contains 1427284
seconds or roughly 16 days. The sampling rate was 275 points/cycle. The house
contains 1160 events (>50W) in Phase A, 2497 in Phase B and 523 in Phase A-B.
As an example, the total power of the first 86400 seconds (the length of a day) is
plotted in Figure 5.19.
     In spite of the above issues in the dataset, 5 major appliances can still be
extracted using proposed approach. Surprisingly, even when the washer activities
were not labeled in the REDD dataset, they were still able to be located and
“mined” by using the proposed method. With the help of the proposed method,
                                                                                133
Chapter 5
the washer’s cycles were re-labeled. Its reconstructed cycles verses real cycles are
plotted in Figure 5.20. Like the washer used in house 1, this is a front-load washer
with a variable-speed drive inside.
 Figure 5.20: Reconstructed cycles (red) vs. real cycles (blue dash) for washer in
                                    house #3
 Table 5-12 Electric signature error between reconstructed cycles and reference
                               cycles for house #3
             Appliance           Power      Reactive Power Harmonic THD
               Event            Error %       Error %            Error %
         Fridge        ON        -1.04          -1.23             -4.16
         Fridge        ON        -0.20          -0.55             0.25
        Furnace        ON        -2.36          -2.49             -1.92
        Furnace     Middle 1     -1.47          -5.46             5.39
        Furnace     Middle 2      7.21          -1.81             -0.96
        Furnace     Middle 3      2.46          3.88              -7.12
        Furnace        OFF        0.61          0.81              -2.68
                                                                                134
Chapter 5
   Scenario 1: The heater ran with only its heating function on; this scenario was
conducted 5 times.
                                                                              135
Chapter 5
                                                                            136
Chapter 5
                                                                              137
Chapter 5
   Overall, the above verification procedures and discussions show that the
proposed signature extraction approach is capable of providing accurate electric
and event pattern signatures for specific appliances automatically as long as the
amount of feeding data is sufficient(more than a week). With this knowledge,
most non-intrusive load identification algorithms can be adopted without intrusive
measurements for the training or registration stages. In other words, by combining
proposed NISE with a NILM method such as [104], a truly non-intrusive load
monitoring solution can be provided.
                                                                                 138
Chapter 5
5.6 Summary
                                                                                139
Chapter 5
   This chapter focused on major appliances. Future research could study the
smaller or unique appliances. Meanwhile, event filtration conditions can be
updated with vast measurements in different geographic regions.
                                                                               140
Chapter 7
Chapter 6
6.1 Overview
           Ordinary appliance activities are usually the ones that can be tracked
            based on NILM’s identification results. Thus their energy can be
            estimated accordingly. This chapter firstly discusses energy estimation
            methods for this type of energy component.
                                                                                141
Chapter 7
   This method takes the operations times of appliances and its average energy
consumption into consideration. The calculation is pretty simple but less accurate.
It can be presented in the following formula:
                                                                                 142
Chapter 7
E  nEave (6.1)
Where Eave is the average energy consumption of one time operation and n is
the operation times that are detected. Eave can be averaged based on several
operation samples.
  For a certain appliance, this method takes the ON/OFF time and the average
power of each operation into consideration. It can be mathematically presented as
below:
                                      N
                                E   (tnoff  tnon ) Pave                   (6.2)
                                     n 1
Where tnon and tnoff are the ON/OFF time of a certain time operation; N is the
total operation times for energy estimation; Pave is the average power and it can
be calculated based on several operation samples.
                                                                               143
Chapter 7
   For single-state appliance, Pave can be chosen from the real power signature
of the steady-state or the ON/OFF event; for a continuous varying ON/OFF
appliance such as a fridge, Pave can be approximated as:
                                            ( Pon  Poff )
                                 Pave                                      (6.3)
                                                      2
where Pon and Poff are the real power signatures of ON and OFF events.
   As for the multi-state appliances, the following universal equation can be used
no matter what type of appliance it is:
                                               N
                                                En
                                               n 1
                               Pave    N
                                                                             (6.4)
                                         (t  t )
                                                off
                                                n
                                                          on
                                                          n
                                        n 1
   where N is the number of operation samples that are taken into consideration
in order to obtain Pave ; tnon and tnoff are the ON/OFF times of each operation
sample.
   Generally, this method has a good performance for all types of appliances as
long as enough samples N are considered in (6.4). However, this method may be
less accurate than the third method as follows.
   Compared to method B, this method uses all the events belonging to a certain
appliance, which includes not only the ON/OFF events but also the middle events.
In addition, this method uses integration of power points to accurately track the
power variations between events. This method requires the time information of all
events inside an event-window and is only applicable to specific NILM methods
such as the proposed event-window based method.
   As shown in Figure 6.1, the objective appliance has two related events #2 and
# 4 while #1 and #3 are events coming from other appliances. The energy of this
                                                                              144
Chapter 7
appliance is therefore the area summation of A1 and A2. In this case, even if the
power curve between event #2 and event #3 is not constant, it can still be
calculated by adding all the acquired power points between event #2 and event #3
together with the 270W base power before event #2 subtracted.
                            m1
                    N M 1 tn
              E    [  P(t )  Pref (tnm )(tnm1  tnm )]                     (6.5)
                   n 1 m 1 t tnm
Where:
   N is the total number of operations that are taken into energy calculation;
   M is the number of events that are included in an event-window. In the
example shown in Figure 6.1, it is three;
   P(t) is the acquired total power at a given instant t;
                                                                                  145
Chapter 7
value of initial power plus this event’s power. For example, Pref(10) to Pref(12) is
270W; after 12th min, Pref(12) to Pref(20) is 270+(-220)=50W.
   To summarize, method A is the simplest and fastest but also the least accurate
method. It is especially suitable when the target load has a fixed operation
process or when only its schedule information is known; method B is applicable
to all types of appliances and it makes a balance between accuracy and
simplicity; method C is the most accurate method but it requires the time instant
information of all window events. Also, it is the most complicated method.
                                       Proposed
                                       flowchart
                                          Start
Edge detection
Event filtration
Event clustering
ON-OFF match
Energy calculation
End
                                                                                  146
Chapter 7
treating each light individually. The main technique the method bases on is the
mean-shift clustering. The proposed method consists of 4 core steps: event
filtration, event clustering, ON-OFF match and energy calculation Some ideas
used in this method can be referred back to Chapter 4. The flow chart is shown in
Figure 6.2. Energy estimation for other load groups may also be referred to the
incandescent light that is explained as follows.
   All the events in a house are firstly captured. Then according to the unique
features that most incandescent lights have in common, suspect events of all ILs
are picked out. This step is implemented through a filtering step---only the
specific events, which can satisfy a certain set of filtration conditions, are further
determined as the suspect events of IL. An example of filtration conditions used
for IL is listed as below:
                                                                                  147
Chapter 7
                                The model used for IL is a single-state load model which has the same absolute
electric characteristics for both its ON event and OFF event. The only difference
is that their signs are opposite to each other. It implies in an event cluster that is
caused by an IL, the number of its OFF members should be approximately equal
to the number of its ON members.
20
10
-10
-20
                               -30
                                     0      10      20       30             40            50             60             70   80   90              100
                                                                            Normalized Active Power(P)
                                Mean-shift clustering is applied to all the ON events and OFF events together
and only their absolute event signatures are considered. In other words, both ON
and OFF events of a certain appliance should be found inside a single cluster.
During clustering, only active power feature is considered. This is because its
reactive power and harmonic contents are too small and can be noises. To rule out
the noisy impact from Q, the nominal values of Q of all events are set to zero. A
typical clustering map for one phase is shown in Figure 6.3.
                                                                                                                                         148
Chapter 7
   It can be seen from the clusters in different colors, there are likely 10 (kinds of)
incandescent lights at this phase. Among these clusters, only the ones which
satisfy:
are selected and taken into the following steps. This is because as a lighting
device, it is expected the frequency of light activities is sufficient; also, as a
single-state load, it is expected to see both ON and OFF events.
   This step digs into a certain cluster obtained from the above step and examines
its inside ON-OFF events. The purpose is to check if they have roughly the same
number of appearances. The philosophy behind this is that theoretically, most of
incandescent lights (simply resistors) are single state appliances of which an
ON event should always match an OFF event. In reality, exceptions may occur
due to the following reasons:
1. Not all the ON and OFF events are successfully captured. Some may be lost;
   2. Events coming from other appliances are mistakenly included in. For
example, for cluster A, we find 10 ON events and 12 OFF events. It can still be an
incandescent light, but two (12-10=2) OFF events could result from a third
appliance, for example the middle events of a furnace.
   When one of the above is true, mismatch on numbers of ON and OFF will
happen.
   Hypothetically, if all true ON and OFF events can be identified and selected,
accurate energy calculation can be done. However, it is impossible to do so.
Alternatively, as a statistic estimation method, all possible combinations of ON
                                                                                   149
Chapter 7
and OFF events can be calculated and an energy distribution can be obtained. An
example is shown as below:
  The above 3 matches will result different energy results. Thus, an energy
distribution can be formed.
  Generally speaking, for a cluster which has m OFF events and n ON events
(m>n), the total number of possible combinations can be calculated using factorial:
                                           m!
                              N cmb                                         (6.6)
                                        n!(m  n)!
                                                                              150
Chapter 7
It should be noted, when (m-n) is large, N cmb can be a very large number and
obtaining all the N cmb combinations can result in a heavy computation burden.
Alternatively, in actual implementation, 10000 different ON-OFF matches are
randomly generated and they can roughly represent all the possible combinations.
   In addition, some ON-OFF match can result in a negative energy such as the
#3 match shown in Table 6-1. It is because the selected OFF events are before the
selected ON events. This kind of matches should be ruled out from the possible
ON/OFF matches because the energy is not allowed to be negative.
   It should be noted, for a given ON-OFF match, paring of ON and OFF event
will not make a change on the energy calculation result. Still taking the above
light A as example, for match#1---(1,2)à(1’,2’), we have:
Power
                    1         1’    2 2’
                            A1   A2  A3                                 3
                                                                               time
                        Figure 6.5: Energy blocks of light A
   Scenario 1: if 1 is paired with 1’ and 2 is paired with 2’, its energy is
E1  A1  A3 (6.7)
E  ( A1  A2 )  ( A3  A2 )  A1  A3  E1 (6.8)
                                                                                  151
Chapter 7
same. In this example, match #1’s energy is always E1 no matter which ON pairs
with which OFF. But it can be different from match #2 and match #3 since events
from different matches can happen at different times.
                  1. For each qualified cluster, calculate the energy consumption for all possible
matches;
                  2. Add the energy of all the qualified clusters together to get overall IL energy
values in this house. In other words, an estimation of energy distribution can be
obtained. This is shown in the following section.
6.3.5 Results
A. House #1
2.5
                 2.0
Probability(%)
1.5
1.0
0.5
                  0
                   6             8             10          12                  14   16    18          20
                                                                Energy (kWh)
                                                                                               152
     Chapter 7
                                          Emin  Emin
                                                     phA
                                                          Emin
                                                            phB
                                                                  1.24  3.45  4.69 kWh
                                         
                                           Emean  Emean  Emean  6.52  5.60  12.12kWh
                                                      phA     phB
                                                                                               (6.9)
                                          
                                           Emax  Emax  Emax  11.66  10.95  22.61kWh
                                                     phA     phB
                   The mean energy for IL is 12.12 kWh. This value is 10.54% of the total energy
     of this house in the 6 days, which is about 115 kWh.
B. House #2
                   In house #2, 7 days in June are processed. The energy distribution that is due to
     different possible ON-OFF matches is shown in Figure 6.6. In total, 10000
     random generated ON-OFF matches are taken into calculation. In this histogram,
     the width of per unit interval is 0.1kWh.
1.2
1.0
                 0.8
Probability(%)
0.6
0.4
0.2
                  0
                  10             15             20            25            30          35      40
                                                              Energy(kWh)
                                       Emin  Emin
                                                  phA
                                                       Emin
                                                         phB
                                                               5.40  5.02  10.42kWh
                                      
                                        Emean  Emean  Emean  10.61  12.68  23.29kWh
                                                   phA     phB
                                                                                             (6.10)
                                       
                                        Emax  Emax  Emax  20.14  23.42  43.56kWh
                                                  phA     phB
                   The mean energy for IL is 23.29 kWh. This value is 16.88% of the total energy
     of this house in the 7 days, which is about 138 kWh.
                                                                                                153
Chapter 7
   Generally, the above results comply with the statistic results obtained in
[114],[115]. Unfortunately, there is no feasible measurement method to accurately
validate the above results.
6.4.2 Results
   The above method is applied to two houses and each for 7 days. The extracted
background power is shown as below:
                                                                             154
Chapter 7
(a) House #1
(b) House #2
                                                                        155
Chapter 7
6.5.1 House #1
  After all the energy components are estimated using the above introduced
methods, the total energy of a house can be broken down into different parts. A
pie-chart of energy consumption for a week in a residential house is plotted and
shown in Figure 6.10.This residential house is located in Edmonton, Canada and
the processed week is in June. Please note, other lights such as fluorescent lights
and compact fluorescent lights are roughly estimated as ¼ of incandescent light
according to their energy efficiency [116]-[118].
                                                                                156
Chapter 7
   As can be seen, a single fridge takes a large portion of energy which is 8.23%.
Although the fridge’s power in this house is only about 140W, it is running
throughout 24 hours a day and the accumulated energy is significant.
   Some kitchen appliances like microwave, kettle and waffle iron (in total 3.97%)
do not consume a lot although they have a large power (>1000W). This is because
for each time operation, it only lasts for minutes or sometimes even shorter.
   Other kitchen appliance such as stove (15.67%) contributes a lot to the total
energy consumption because it is connected to double live phases and its cooking
time lasts much longer, for example, 30 minutes.
   For laundry activities, washer does not consume a lot while dryer consumes an
important portion of energy (5.59%).
                                                                                157
Chapter 7
   “Background energy” and “others” take almost a half of the house’s total
consumption including stand-by power, always-on appliances and all the other
appliances such as TVs, computers and dishwasher which have not been
registered or processed by NILM.
6.5.2 House #2
                                                                             158
Chapter 7
   As can be seen, in May, furnace is still used in this house due to the cold
temperature in Edmonton. It consumes 4.59% of the total energy. Compared to
fridge and freezer (10.51%), it consumes less because its frequency of operation is
lower than the frequency of fridge and freezer.
   Similar results for kitchen appliances and lighting can be found compared to
house #1. The consumption by dryer is smaller than house #1 and it is possibly
due to smaller washing load in this house. Also, some clothes may be dried
outside in the sun.
   A week in late October from House #2 was also processed in comparison with
spring data to reveal the seasonal changes. The results are shown as below:
                                                                               159
Chapter 7
                                                                                160
Chapter 7
  Generally, the total energy consumed in fall is higher than the energy
consumer in spring. This is partly due to the significant increase of the furnace
operation. The energy consumed by furnace increases from 5.72 kWh to 14.38
kWh. This is easy to be understood since the house temperature in October is
much lower than spring. In the meanwhile, the energy of fridge and freezer
increases from 13.1 kWh to 15.98 kWh. Also the dryer increases significantly
from 2.87 kWh to 10.74 kWh. This is probably because the washing load in fall
are generally higher than spring (more clothes). Also in late October, clothes are
not likely to be dried outside. The lighting and stove/oven energy consumed by
this particular week is smaller than spring and this is believed to be caused by
some unknown change of house owner’s behaviors such as less time in the house
and more dining in restaurants during the studied week.
  2. Small power loads can still consume a lot of energy due to energy
accumulative effect. Examples are fridge and freezer.
                                                                              161
Chapter 7
                                                                                162
Chapter 7
tools to encourage load shifting and energy saving for residential customers.
According to [16], the current TOU price is shown in Figure 6.13.
   In Table 6-3, the only available appliances for load-shifting are microwave,
washer, dryer, coffeemaker, oven, stove and waffle iron. Hypothetically, changing
all of them from On-peak price to Off-peak price can save only
19.82  (11.8  6.3)  4( weeks)  4.36 dollars per month.
   However, even for this little amount of money, the required load-shift can
greatly affect customers’ comfortableness and sometimes is even impossible. For
example, cooking is normally always before mealtime and cannot be shifted.
   On the other hand, based on Table 6-3, replacing all the incandescent lights
with fluorescent/compact fluorescent lights may be able to reduce the energy by
about 12%. But this requires additional money investment since fluorescent lights
are much more expensive than incandescent lights [118].
   In addition, a typical electricity bill from Hydro One [119] is shown in Figure
6.14. It can be seen that electricity charge ($81.40) is only 49% of the electricity
bill. The delivery charge, regulatory charge and retirement charge are relatively
fixed charges that do not change with the actual amount of electricity spent by the
residential customers.
                                                                                163
Chapter 7
6.7 Summary
  In this chapter, different energy estimation methods were proposed for three
major energy components in a residential house --- ordinary appliances, appliance
group (incandescent lights) and background energy.
  Using these methods, meter-side data for several weeks which were acquired
from two local houses were processed. The energy estimation results were
presented. Also, common characteristics were revealed and summarized.
                                                                              164
Chapter 7
Chapter 7
   This thesis has approached topics related to event detection, load signature
studies, non-intrusive load monitoring algorithm, non-intrusive signature
extraction algorithm, energy estimate methods and the characteristics of
residential house. The thesis has presented a complete solution for power system
load decomposition, especially for residential end-users. Two major closely
related issues—non-intrusive load monitoring and non-intrusive signature
extraction were discussed in detail. But not limited to this, some other questions
related to event detection, load modeling, load group energy estimation, and
residential energy characteristics were also answered in the thesis.
                                                                               165
Chapter 7
                                                                                 166
Chapter 7
  As with any study, something can always be done to extend and improve the
research. Several extensions and modifications of this thesis can be explored as
follows:
                                                                              167
Chapter 7
                                                                                168
Chapter 8
Chapter 8
References
  [4] H. Farhangi, “The Path of the Smart Grid”, IEEE Power & Energy
       Magazine, Jan/Feb 2010, pp. 18-28.
  [7] S. Rahman, “Smart Grid Expectations”, IEEE Power & Energy Magazine,
       Sept/Oct 2009, pp. 83-85.
  [9] ACEEE, “Study: Smart Meters Alone Not Enough to Significantly Reduce
       Power Use”, available online at:
http://tdworld.com/customer_service/aceee-smart-meter-study-0610/
                                                                           169
Chapter 8
http://www.hydroone.com/TOU/Documents/Introducing_TOU_Prices.pdf
 [18] Guelph Hydro , “Time-of-Use Rates for Electricity”, available online at:
       http://www.guelphhydro.com/en/residential/timeofuserates.asp
 [20] Milind Naphade and et al., “Smarter Electricity Portal Pilot Study Report”,
       IBM            Research,            available          online             at:
       http://www.cityofdubuque.org/DocumentCenter/View/15984
                                                                             170
Chapter 8
 [21] ISTAT, Time Use Survey, Notiziarion. 10, 1990 (in Italian).
 [22] ISTAT, Household consumption: year 1987, vol. 14, 1989 (in Italian).
 [23] Ipsos-RSL and Office for National Statistics, United Kingdom Time Use
       Survey, 3rd ed., UK Data Archive, Colchester, Essex, September 2003
 [29] CSA, “Test Method for Measuring Energy Consumption and Drum
       Volume of Electrically Heated Household Tumble-Type Clothes Dryers”,
       CSA Standard CAN/CSA-C361-92, 1992.
                                                                             171
Chapter 8
 [33] Capasso, A.; Grattieri, W.; Lamedica, R.; Prudenzi, A., "A bottom-up
       approach    to   residential   load   modeling," Power   Systems,    IEEE
       Transactions on , vol.9, no.2, pp.957,964, May 1994
 [36] John N. Sharood, Graham Bailey, D. Mitchell Carr, James Turner and
       David Peachey. “Appliance retrofit monitoring device with a memory
       storing an electronic signature”, U.S. Patent 6 934 862, Aug 23, 2005.
 [37] Digi Smart Plug, “XBee Smart Plug” , available online at:
       http://www.digi.com/wiki/developer/index.php/XBee_Smart_Plug
http://www.blueline innovations.com/Products/PowerCost-Monitor/
 [40] Energy Inc., “TED The Energy Detective”, available online at:
       http://www.theenergydetective.com/
                                                                                172
Chapter 8
 [42] Engadget, “LG Thinq linqs your smart appliances with WiFi and
       smartphone apps“, available online at:
       http://www.engadget.com/2011/01/04/lg-thinq-linqs-your-smart-
       appliances-with-wifi-and-smartphone-ap/
 [43] TriplePundit, "Study Shows Bright Green Future for Smart Appliances",
       available online at: http://www.triplepundit.com/2010/03/smart-
       appliances-could-be-the-next-big-thing/
 [47] W. Qiu, E.Skafidas, and P. Hao, “Enhanced tree routing for wireless
       sensor networks,” Elsevier on Ad Hoc Networks, 2009, vol. 7, pp.638-650.
                                                                              173
Chapter 8
                                                                              174
Chapter 8
 [59] Jian Liang, Simon K. K. Ng, Gail Kendall, and John W. M. Cheng, "Load
       Signature Study—Part I: Basic Concept, Structure, and Methodology",
       IEEE Trans. Power Delivery, vol. 25, no. 2, pp.551-560, 2010
 [60] Jian Liang, Simon K. K. Ng, Gail Kendall, and John W. M. Cheng, "Load
       Signature Study—Part I: Basic Concept, Structure, and Methodology",
       IEEE Trans. Power Delivery, vol. 25, no. 2, pp.551-560, 2010
 [62] S.B. Leeb, S.R. Shaw and J.L. Kirtley, "Transient event detection in
       spectral envelope estimates for nonintrusive load monitoring," Power
       Delivery, IEEE Transactions on , vol.10, no.3, pp.1200,1210, Jul 1995
 [63] S.B. Leeb, S.R. Shaw and S.R.Shaw, “Multiprocessing Transient Event
       Detector For Use In A Nonintrusive Electrical Load Monitoring System”,
       United States Patent, US 5,717,325, Feb. 10,1998.
 [65] S.N. Patel, T. Robertson, J.A. Kientz, M.S. Reynolds, and G.D. Abowd,
       "At the Flick of a Switch: Detecting and Classifying Unique Electrical
       Events on the Residential Power Line", In: UbiComp 2007, pp. 271-288
                                                                               175
Chapter 8
 [70] Duan, J.; Czarkowski, D.; Zabar, Z., "Neural network approach for
       estimation of load composition," Circuits and Systems, 2004. ISCAS '04.
       Proceedings of the 2004 International Symposium on , vol.5, no., pp.V-
       988,V-991 Vol.5, 23-26 May 2004
 [72] Suzuki, K.; Inagaki, S.; Suzuki, T.; Nakamura, H.; Ito, K., "Nonintrusive
       appliance load monitoring based on integer programming", SICE Annual
       Conference, 2008, pp. 2742 – 2747, 2008
 [73] Baranski, M.; Voss, J., "Genetic algorithm for pattern detection in NIALM
       systems," Systems, Man and Cybernetics, 2004 IEEE International
       Conference on , vol.4, no., pp.3462,3468 vol.4, 10-13 Oct. 2004
                                                                               176
Chapter 8
 [75] Prudenzi, "A Neuron Nets Based Procedure for Identifying Domestic
       Appliances Pattern-of-use from Energy Recordings at Meter Panel", IEEE
       Power Engineering Society Winter Meeting, pp. 941-946, 2002
 [76] HH Chang; Kun-Long Chen; Yuan-Pin Tsai; Wei-Jen Lee; , "A New
       Measurement Method for Power Signatures of Nonintrusive Demand
       Monitoring and Load Identification," Industry Applications, IEEE
       Transactions on , vol.48, no.2, pp.764-771, March-April 2012
                                                                              177
Chapter 8
 [84] K. D. Lee, S.B. Leeb, L.K. Norford, P.R. Armstrong, J. Holloway, S.R.
       Shaw, "Estimation of Variable-Speed-Drive Power Consumption From
       Harmonic Content," , IEEE Trans. Energy Conversion, vol.20, no.3, pp.
       566- 574, 2005
 [85] W. Wichakool, A.T. Avestruz, R.W. Cox, S.B. Leeb, "Resolving Power
       Consumption of Variable Power Electronic Loads Using Nonintrusive
       Monitoring," Power Electronics Specialists Conference, IEEE PESC 2007,
       pp.2765-2771, 2007
 [86] W. Wichakool, A.T. Avestruz, R.W. Cox, S.B. Leeb, "Modeling and
       Estimating Current Harmonics of Variable Electronic Loads", , IEEE
       Trans. Power Electronics, vol.24, no.12, pp.2803-2811, 2009
 [87] S.R. Shaw, S.B. Leeb, L.K. Norford, R.W. Cox, "Nonintrusive Load
       Monitoring    and   Diagnostics    in   Power   Systems",   IEEE     Trans.
       Instrumentation and Measurement, vol.57, no.7, pp.1445-1454, 2008
                                                                              178
Chapter 8
 [91] D.C. Bergman, Jin Dong , J.P. Juen, N. Tanaka, C.A. Gunter, A.K. Wright,
       "Nonintrusive Load-Shed Verification," Pervasive Computing, IEEE ,
       vol.10, no.1, pp.49-57, 2011
 [95] J. Zico Kolter and Matthew J. Johnson. REDD: A public data set for
       energy disaggregation research. In proceedings of the SustKDD workshop
       on Data Mining Applications in Sustainability, 2011
 [98] Chenzhi Fan, Dun Sun and Mei Tong, “Circuit Theory” , China Machine
       Press, Edition 2, 2008
                                                                           179
Chapter 8
                                                                           180
Chapter 8
                                                                                    181
Chapter 8
                                                                              182
Appendix
Appendix
   The proposed technique aims to break down the total commercial building
power usage into the level of individual load groups such as all lighting devices
and all computing devices in the building.
         From the statistical perspective, a certain load group which has many
          individuals will follow a generally stable signature pattern and individual
          appliance difference amongst it is negligible when treated collectively.
   For example, an office building has 500 computers, there are two brands A and
B---300 computers with brand A and 200 with brand B. Brand A computer has a
total harmonic distortion (THD) of 30% while brand B has a THD of 40%.
However, since the mixed ratio of brand A and B generally stays the same, as a
group, all computer devices approximately always produce a stable aggregated
THD, say 37%. The prerequisite of this assumption is that brand A and brand B
computers have almost the same behavior characteristics (People do not use them
                                                                                     183
Appendix
differently). Statistically, for any random given time, the number of brand A
computers being used vs. the number of brand B computers being used are
generally still 3:2. Thus, the aggregated THD signature can stay approximately
the same for this load group “computer devices”. However, if for example, brand
A is only used in night-time and brand B is only used in day-time, they have to be
treated as two independent groups. In reality, this rarely happens since for most
cases the same function oriented appliances behave consistently.
   Based on the above assumptions, some group signatures such as THD and
individual harmonic distortion (IHD) as well as power factor can be used to label
specific certain load groups.
   In total, its 1500 loads can be classified into the 7 groups under 4 categories---
non-linear, reactive (linear), resistive (linear) and background. Each of these
categories has their own unique collective group signatures such as IHD ratio for
non-linear load category and power factor for reactive load category. Based on the
collective group signatures, aggregated profile acquired from the building’s
                                                                                 184
Appendix
A. Background load
B. Non-linear load
 CFL lighting
                                                                               185
 Appendix
 can be randomly chosen and put together. Then their collective IHDs and phase
 angles can be measured. This measured representative harmonic pattern can
 approximately represent the aggregated pattern of all computer devices in this
 building at any given time. The table below is an example of representative
 spectrums of all four groups (CFL /Computer /HVAC-A / HVAC-B).
    In the meanwhile, total harmonic signal of the commercial building after the
 background load deducted can also be analyzed and contents of harmonic orders
 can be extracted, which should be equal to the summation of the four non-linear
 groups. The following equations can be listed:
                                                                                                /135.42
            I3    I CFL  I CFL 3( p.u.)  I CMP  I CMP 3( p.u.)  I HA  I HA3( p.u.)  I HB  I HB 3( p.u.)
           
            I5    I CFL  I CFL 5( p.u.)  I CMP  I CMP 5( p.u.)  I HA  I HA5( p.u.)  I HB  I HB5( p.u.)
           
           I7     I CFL  I CFL 7( p.u.)  I CMP  I CMP 7( p.u.)  I HA  I HA7( p.u.)  I HB  I HB 7( p.u.)
           
            I9    I CFL  I CFL 9( p.u.)  I CMP  I CMP 9( p.u.)  I HA  I HA9( p.u.)  I HB  I HB 9( p.u.)
           ...
                                                                                                                  (A.1)
    where the left side are the harmonic contents of total aggregated signal. The
 per-unit value can be read from Table A-1. For example, I CFL3( p.u.) is complex
 value 0.8154 . ICFL , ICMP , I HA , I HB are unknown variables that need to be solved. If
 there are only four groups, harmonic contents of four orders ( I 3 , I 5 , I 7 , I 9 ) are
 sufficient to solve the variables. However, more harmonic contents can also be
 used to convert the problem to an over-determined problem in which case some
                                                                                                                    186
Appendix
optimization methods such as least square method can be applied to improve the
accuracy. Once ICFL , ICMP , I HA , I HB are solved, their power and energy profiles can
be immediately obtained.
   Based on the above discussions, the summation of reactive power from the
non-linear load category can also be calculated. Apart from non-linear load and
background load, the rest aggregated reactive power all comes from the reactive
(linear) loads. Similar to the measurement taken to get representative harmonic
spectrum, representative power factor can also be measured. For example, a
couple of refrigerators and fans can be measured together according to a mixing
ratio. Thus power profile of reactive load can be obtained. Also, based on the
mixing ratio, refrigerators and fans can be further separated.
   In the end, the residual power apart from background load, non-linear load and
reactive load belongs to resistive load.
   Due to limited resources, two simulations are conducted based on the bottom-
up model and program developed for multiple residential houses in [102].
However, it is still sufficient to verify the basic working principle for commercial
buildings and reveal some interesting findings. In this case, a commercial building
is assumed to include 6 groups of loads--- 800 CFL lights, 300 computers, 100
furnaces, 100 inductive motors, 2400 incandescent lights and background power.
Their powers and mixing ratios are listed in Table A-2.
   The program uses Monte Carlo based simulation to generate the aggregated
signal of all the loads according to certain statistical time of use probability
profiles for different loads [102].
                                                                                    187
Appendix
   Simulation 1: At any given moment of a day, the mixing ratios are kept fixed
and the same with the above table’s values. Also, the background power is
removed. This is a very ideal case.
   Using the proposed method explained in A.1, the power curve of individual
load group can be separated from the aggregated signal. In simulation 1, it is
found the separated load group power has exactly the same power has the actual
one. An example is shown in Figure A-2.This is because the mixing ratios are
constant. When solving the problem using equations such as (A.1), the exact
results can be obtained.
                                                                             188
Appendix
                                 4
                              x 10               Actual power of furnace group
                        2.5
             Power(W)
                        1.5
0.5
                         0
                             0            500                                    1000   1500
                                                          Minutes
                                 4
                              x 10              Decomposed power of furnace group
                        2.5
                         2
             Power(W)
1.5
0.5
                         0
                             0            500                                    1000   1500
                                                          Minutes
Power(W)
Minutes
                                                                                               189
Appendix
                        Power(W)
                                       Decomposed inductive motor group
Minutes
  As can be seen from Figure A-2, the error on power curves becomes larger,
especially for reactive load groups. Quantitative comparison can be seen in Table
A-3.
                                                                                          190
Appendix
   As can be seen from Table A-3, the above decomposed results are close to the
actual results. The error for non-linear load category such as CFL group and
computing devices is very low. However, for reactive and resistive loads, errors
are larger. This is because they do not have as plentiful information as the
harmonic spectrum. In other words, they cannot be accurately solved or optimized
by using equations like (A.1). But the results can still be used as reasonable
references. Besides, it is found that the trend shapes of all types of load groups are
very similar to the actual ones.
   Some suggestions are given for future work on load disaggregation research
for commercial buildings:
         More realistic simulations that consider the actual time of use probability
          profiles in commercial buildings [121] can be done.
         Field data based verification should also be performed. This is the most
          challenging part since the number of loads in a building is massive and
          difficult to be measured directly. Energy auditing based methods may be
          considered.
191