0% found this document useful (0 votes)

51 views10 pages

Mobility-Aware Vehicle-to-Grid Control Algorithm in Microgrids

Uploaded by

Fadi Hamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views10 pages

Mobility-Aware Vehicle-to-Grid Control Algorithm in Microgrids

Uploaded by

Fadi Hamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

This article has been accepted for inclusion in a future issue of this journal.

Content is final as presented, with the exception of pagination.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1

Mobility-Aware Vehicle-to-Grid Control

Algorithm in Microgrids
Haneul Ko, Sangheon Pack , Senior Member, IEEE, and Victor C. M. Leung, Fellow, IEEE

Abstract— In a vehicle-to-grid (V2G) system, electric vehi- still challenging owing to the properties of RES production
cles (EVs) can be efficiently used as power consumers and (e.g., variability, discontinuity, and poor predictability) and
suppliers to achieve microgrid (MG) autonomy. Since EVs can MGs (e.g., distributed generation).
act as energy transporters among different regions (i.e., MGs),
it is an important issue to decide where and when EVs are To achieve efficient DRM in MGs, there is an increasing
charged or discharged to achieve the optimal performance in interest in vehicle-to-grid (V2G) technology, which provides
a V2G system. In this paper, we propose a mobility-aware V2G reliability by supplying power from the energy storage in
control algorithm (MACA) that considers the mobility of EVs, electric vehicles (EVs) [6], [7]. Specifically, EVs can adjust
states of charge of EVs, and the estimated/actual demands of MGs their charging or discharging behaviors depending on the
and then determines charging and discharging schedules for EVs.
To optimize the performance of MACA, the Markov decision load profile and their current states of charge (SOC) [8], [9].
process problem is formulated and the optimal policy on charging That is, EVs can have a dual role in the electricity market [10]:
and discharging is obtained by a value iteration algorithm. Since 1) power consumer when their batteries are charged and
the mobility of EVs and the estimated/actual demand profiles 2) power supplier when they sell excessive energy from their
of MGs may not be easily obtained, a reinforcement learning batteries. Since most EVs are parked for long time (up to
approach is also introduced. Evaluation results demonstrate
that MACA with the optimal and learning-based policies can 22 hours per day) [11], they can effectively perform both
effectively achieve MG autonomy and provide higher satisfaction two roles. However, since each EV has different conditions
on the charging. (e.g., arrival/departure time and SOC), an efficient V2G con-
Index Terms— Vehicle-to-grid (V2G), electric vehicle (EV), trol algorithm is needed to determine the operation for each
microgrid, Markov decision process (MDP), reinforcement learn- EV (i.e., charging and discharging). To address this issue,
ing (RL). several works have been reported in the literature [12]–[15].
Shi and Wong [12] proposed a V2G control algorithm based
on a Markov decision process (MDP), where price uncer-
I. I NTRODUCTION tainty is considered by exploiting a Q-learning algorithm.
Deilami et al. [13] suggested a real-time smart load man-
I N TRADITIONAL power systems, electricity can be gen-
erated according to the demands of consumers. That is,
a day-ahead schedule can be generated by predicting con-
agement control strategy to minimize the total cost of
generating energy and the associated grid energy losses.
sumers’ load profiles [1]. Nowadays, renewable energy sources Chen and Duan [14] introduced a two-stage solution algorithm
(RESs) have received high attention [2] and microgrids (MGs) based on a genetic algorithm to find the optimal number of
have been developed as a localized grouping of electricity parking numbers under the optimal scheduling of EVs in MGs.
generations, energy storages, and loads [3], [4]. Even with Liu et al. [15] proposed a V2G control algorithm to achieve
these new trends, demand response management (DRM)1 is frequency regulation and maintain the battery energy over a
certain level.
Manuscript received April 25, 2017; revised October 19, 2017; accepted Even though these works can improve the performance of
March 14, 2018. This work was supported in part by the Korean Govern-
ment (MSIP) through the National Research Foundation (NRF) of Korea under
the V2G control algorithm, they cannot effectively exploit the
Grant 2017R1E1A1A01073742 and in part by the Basic Science Research salient features of EVs, i.e., EVs can travel across different
Program through the NRF of Korea supported by the Ministry of Education regions and thus act as energy transporters among different
under Grant 2017R1A6A3A03006846. The Associate Editor for this paper
was C. Sommer. (Corresponding author: Sangheon Pack.)
MGs. In particular, when MGs are isolated from the main
H. Ko is with the Smart Quantum Communication Research Center, Korea grid, the features of EVs can be exploited in a more efficient
University, Seoul 02841, South Korea, and also with the Department of Elec- manner. For example, most EVs move to working regions
trical and Computer Engineering, University of British Columbia, Vancouver,
BC V6T 1Z4, Canada (e-mail: st_basket@korea.ac.kr).
in the morning, and thus higher electric demand is observed
S. Pack is with the School of Electrical Engineering, Korea University, in the working regions than residential regions. In such a
Seoul 02841, South Korea (e-mail: shpack@korea.ac.kr). situation, EVs can transport the energy from the residential
V. C. M. Leung is with the Department of Electrical and Computer
Engineering, University of British Columbia, Vancouver, BC V6T 1Z4,
regions to working ones to satisfy the high demand of the
Canada (e-mail: vleung@ece.ubc.ca). working regions. To balance the difference in power demand
Color versions of one or more of the figures in this paper are available among regions, the mobility of EVs was explained in [10] and
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TITS.2018.2816935
energy transport problems were investigated. However, how to
1 DRM aims to shape the load profile to balance energy demand and optimize the performance of the V2G control algorithm under
supply [5]. mobility was not studied.
1524-9050 © 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

2 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

actual demand of working MGs maintains a high level from

morning to night. Based on these demands, the electricity price
is determined and therefore it can be changed depending on the
time and MG. Meanwhile, since each EV has a limited battery
capacity (e.g., 10 ∼ 20 kW) [17], an intermediate system
(i.e., V2G aggregator) is needed to collect power from EVs
and supply power to MGs. We assume that each MG has one
V2G aggregator that is connected to a centralized controller.
The controller collects information such as EVs’ SOCs and the
estimated/actual demand profiles of MGs. In addition, to effi-
ciently transport energy among different MGs, the controller
can exploit the probability that EVs move from an MG to other
MGs.3 Based on this information, the controller makes a policy
table consisting of EV entries and their charging/discharging
operations and forwards this policy table to the aggregators.
Then, the aggregators adjust EVs’ charging or discharging
behaviors. For example, the aggregator of an MG whose actual
demand is lower than its estimated demand (i.e., energy-
Fig. 1. MG-based V2G system with a centralized controller. abundant MG) commands charging operations to EVs parked
in its area. When such newly-charged EVs move to other
MGs whose actual demands are higher than their estimated
In this paper, we propose a mobility-aware V2G control demands (i.e., energy-scarce MGs), EVs can be discharged to
algorithm (MACA) that considers the mobility of EVs, SOC achieve MG autonomy, which represents the degree of balance
of EVs, and the estimated/actual demands of MGs,2 and between the energy demand and supply in MG. That is,
determines where and when EVs are charged or discharged. under high autonomy of MGs, they can support their demands
To optimize the performance of MACA, a Markov decision without any additional electricity supplement to/from the main
process (MDP) problem is formulated and the optimal policy grid (i.e., without any involvement of the main grid).
on charging and discharging is obtained by a value iteration Figure 2 shows an operation example of MACA.
algorithm. In addition, since the mobility of EVs and the In this example, we assume that the actual demands of
estimated/actual demand profiles of MGs may not be easily MG 1 and MG 2 are lower than the estimated ones of
obtained, a reinforcement learning (RL) approach is intro- MG 1 and MG 2, respectively (i.e., MGs 1 and 2 are energy-
duced. Evaluation results demonstrate that MACA with the abundant). Meanwhile, MG 3 is energy-scarce MG and thus
optimal and learning-based policies can effectively achieve its actual demand is higher than the estimated demand. At the
MG autonomy and provide higher satisfaction on the charging. first time, EVs 1 and 2 are parked at MG 1, whereas EV 3
The main contribution of this paper is two-fold: 1) to the is parked at MG 3. In addition, EVs 1 and 2 in MG 1 are
best of our knowledge, this is the first work on the optimization expected to move to MGs 3 and 2, respectively, whereas EV 3
of the V2G control algorithm in MGs that considers the is expected to move to MG 1. After parking and plugging
ability of EVs to transport energy between different MGs into the grid, EVs 1, 2, and 3 inform their SOCs and mobility
and 2) extensive evaluation results are presented and analyzed profiles to their aggregators (Step 1). That is, the aggregator
under various environments, providing valuable guidelines for in each MG collects the SOCs of EVs parked in its area. After
the design of V2G-enabled MGs. that, the aggregators forward the collected SOCs and mobility
The remainder of this paper is organized as follows. The profiles to the controller (Step 2). The actual/estimated
system model and detailed operation of MACA are described demand profiles are periodically reported to the controller
in Section II. After that, the MDP model and RL approach along with the collected SOCs. Based on the EVs’ SOCs,
are introduced in Section III. Then, evaluation results are the mobility profiles, and the actual/estimated demand profiles,
presented in Section IV. Finally, concluding remarks are given the controller constructs a policy table that consists of EV
in Section V. entries and their charging/discharing operations. Note that the
optimal policy table is obtained by an MDP, which will be
II. M OBILITY-AWARE V EHICLE - TO -G RID described in Section III. The probability that EVs move to a
C ONTROL A LGORITHM (MACA) region can be changed depending on the time. For example,
the probability that an EV moves to its working region
Figure 1 shows the system model in this paper. Each MG
increases in the morning. Therefore, multiple policy tables
covers one region (e.g., a residential or working region), and
can be constructed depending on the time. After making the
different MGs have different daily electric estimated/actual
policy tables, the controller transmits them to the aggregators
demand profiles [16]. For example, in residential MGs,
(Step 3). Then, the aggregator periodically commands EVs
the actual demand usually peaks in the evening, whereas the
2 The estimated demand of an MG is the electric energy that can be 3 The aggregator preferentially commands charging operations to EVs that
supported by that MG without any involvement of the main grid. have a high probability of moving to energy-scarce MGs.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

KO et al.: MACA IN MGs 3

Fig. 2. Operation example of MACA.

to be charged or discharged following the policy table at the the policy table, the controller sends the updated one to the
current time (Steps 4-5). In this example, since MG 1 has aggregators (Step 8). After that, in Step 9, since EV 1 is in
surplus energy which can be used to charge only one EV and energy-scarce MG 3, EV 1 is discharged (Step 9(a)). On the
EVs 1 and 2 are estimated to move to energy-scarce MG 3 contrary, EV 2 and EV 3 in energy-abundant MGs (i.e.,
and energy-abundant MG 2, the aggregator 1 commands a MG 2 and MG 3) are charged (Step 9(b) and (c)).
charging operation to EV 1 (Step 4(a)) while EV 2 is not
charged (Step 4(b)). Since the actual demand of MG 3 is III. MDP F ORMULATION
higher than its estimated demand, the aggregator 3 commands To achieve the autonomy of MGs, EVs should be
a discharging operation to EV 3 in Step 4(c). However, charged or discharged with the consideration on the esti-
in Step 5, since EVs are expected to move soon,4 the SOCs of mated/actual demand profiles of MGs and their mobility.
EVs should not be decreased below a certain level. Therefore, To this end, we formulate an MDP model5 with five elements:
the aggregator 3 does not command any discharging operation 1) decision epoch; 2) action; 3) state; 4) transition probability;
to EV 3 (Step 5(c)). After EVs’ movements, they inform and 5) reward and cost functions [19], [20]. We also introduce
their SOCs to the aggregators, and then the aggregators an optimality equation and a value iteration algorithm to
forward EVs’ SOCs and mobility profiles to the controller solve the equation. Then, an RL approach is presented for
(Steps 6-7). Based on the updated information, the controller
can reconstruct its policy table. If there is any update in 5 The MDP model represents a mathematical framework to model decision-
making in situations in which outcomes are partially random and partially
4 At this time, the policy table is constructed based on the high probability under the control of the decision maker [18]. Therefore, the MDP model is
that EVs will move to other MGs. suitable for deciding the charging and discharging schedules of EVs.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

4 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

TABLE I where N P,C is the number of possible combinations of EVs’

S UMMARY OF N OTATIONS SOCs. Further, Ck represents the kth element of C, which can
be described as

Ck = [c1 , c2 , . . . , c N E V ] (3)

where c j denotes the SOC of the j th EV and N E V is the

total number of EVs in the system. The unit of an SOC can
be a percentage point (e.g., 0 % and 100 % represent that
the battery of EV is empty or full, respectively) [21]. When
an EV is not sufficiently charged and moves to another MG,
it stops by a charging station to charge its battery. c j = −1
represents this situation. To sum up, c j can be any integer
number between −1 and 100.
G is represented by

G = {G 1 , G 2 , . . . , G N P,G } (4)

where N P,G is the number of possible combinations of MG

identifications. Moreover, G k denotes the kth element of G,
which is given by

G k = [g1 , g2 , . . . , g N E V ] (5)

where g j is the MG identification where the j th EV is located.

H can be defined as

H = {H1, H2 , . . . , H N P,H } (6)

where N P,H is the number of possible combinations of move-

ment phases, and Hk describes the kth element of H, which
can be represented as
practical deployment. Important notations for the MDP model
are summarized in Table I. Hk = [h 1 , h 2 , . . . , h N E V ] (7)

where h j denotes the movement phase of the j th EV. That is,

A. Decision Epoch h j = 0 and h j = 1 represent that the j th EV is in the non-
A sequence Te = {1, 2, 3, . . .} represents the time epochs movement and the movement phases, respectively. In addition,
when successive decisions are made. Random variables h j = 2 refers to the situation right after the j th EV arrives at
St and At denote the state and the action chosen at the decision a certain MG.
epoch t ∈ Te , respectively. Further, τ represents the duration D is denoted by
of each decision epoch.
D = {D1 , D2 , . . . , D N P,D } (8)

B. State where N P,D is the number of possible combinations of the dif-

We define the state space S as ference between the estimated demand and the actual demand
in MGs and Dk denotes the kth possible combination of
S=C×G×H×D (1) differences of MGs, which can be represented by

where C means the vector set that describes EVs’ SOCs. Dk = [d1 , d2 , . . . , d N MG ] (9)
In addition, G represents the vector set that illustrates MG
identifications where EVs are located (e.g., MG 1, 2, or 3 in where dl describes the difference of the lth MG, and
Figure 1), and H denotes the vector set that represents the N M G is the total number of MGs in the system. That is,
movement phases of EVs. D describes the vector set for dl = I Dl − ADl , where I Dl and ADl denote the estimated
the difference between the estimated demand and the actual demand and the actual demand of the lth MG, respectively.
demand (except EV demands) of MGs. That is, if I Dl is larger than ADl (i.e., when dl > 0),
C is denoted by |dl | means the surplus volume of electricity in the lth MG.
Otherwise (i.e., when dl < 0), |dl | represents the shortage
C = {C1 , C2 , . . . , C N P,C } (2) volume of electricity in the lth MG.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

KO et al.: MACA IN MGs 5

C. Action Since EVs are charged/discharged and move independently,

The action vector set A can be described as P[Ck , G k |Ck , G k , Hk , A] = P[c j , g j |c j , g j , h j , a j ].
j
The j th EV is charged or discharged only when the j th EV
A = {A1 , A2 , . . . , A N P,A } (10) is in the non-movement phase (i.e., h j = 0). If the chosen
where N P,A is the total number of possible combinations of action a j for the j th EV is 1 (or −1), SOC of the j th EV
actions for each EV. Also, Ak is the kth possible combination increases (or decreases) by ζ τ , where ζ is the charging (or
of actions, which can be represented by discharging) rate which is the amount of energy that can be
charged (or discharged) per unit time. On the other hand,
Ak = [a1 , a2 , . . . , a N E V ] (11) when the aggregator does not command any action, the j th
EV is not charged or discharged and thus c j does not change
where a j is the action for the j th EV. Here, a j = 1 and at that time. The corresponding probabilities can be defined
a j = −1 represent that the aggregator commands charging as (12), (14), and (15), shown at the bottom of this page.
and discharging operations to the j th EV, respectively. a j = 0 When EVs move to an MG, the SOC of EVs decreases
denotes that the aggregator does not command any action. That because of the energy consumption for movement. That is,
is, when a j = 0, the j th EV is not charged or discharged when h j = 1, c j decreases at the next state by the electric
during the duration of decision epoch τ . Note that, only when energy consumption, ηg j gk , needed to move from MG g j to
the j th EV is in the non-movement phase (i.e., h j = 0), MG gk . If there is sufficient electric energy volume (i.e.,
the aggregator can command an action (i.e., charging or dis- c j ≥ ηg j gk ), c j = c j − ηg j gk . Otherwise, EV stops by
charging) to the j EV. If the SOC of the j th EV is not a charging station to charge its battery. In this situation,
sufficient (e.g., c j = 0), the aggregator cannot command a c j = −1. To sum up, the corresponding probabilities are
discharging action to the j th EV. represented by (16) and (17), as shown at the bottom of this
page, where pg j gk is the probability that the j th EV moves
from MG g j to MG gk .
D. Transition Probability In general, since EVs do not move consecutively, g j
When EVs move to an MG, their SOCs (i.e., C) decrease does not change when h j = 2. Since EVs can be
proportionally to the distance between the source MG and charged/discharged after connecting to the grid, if they do
the target MG (i.e., G) [22], [23]. Therefore, transition prob- not stop by any charging stations (i.e., c j = −1), c j
abilities for C and G should be defined simultaneously, and does not change when h j = 2. Since it is assumed
the transition of C is affected by H (i.e., C decreases when that EVs are charged as much as necessary to move
EVs move to another MG). Also, C is influenced by the when they stop by a charging station (i.e., c j = −1),
chosen action A. On the other hand, since the residence time c j = 0. Therefore, P[c j , g j |c j = −1, g j , h j = 2, a] and
in each MG is heterogeneous, the transition probability of P[c j , g j |c j = −1, g j , h j = 2, a] are given by

H is affected by G. Since the estimated/actual demands of
1, if c j = c j , g j = g j
MGs are independent of other states, D is also independent of P[c j , g j |c j = −1, g j , h j = 2, a] =
other states. Therefore, for a chosen action A, the transition 0, otherwise
probability from the current state, S = [Ck , G k , Hk , Dk ], (18)
to the next state, S = [Ck , G k , Hk , Dk ], can be described and
by
1, if c j = 0, g j = g j
P[c j , g j |c j = −1, g j , h j = 2, a] =
P[S |S, A] = P[Ck , G k |Ck , G k , Hk , A] 0, otherwise.
×P[Hk |Hk , G k ] × P[Dk |Dk ]. (12) (19)

1, if c j = c j + ζ τ , g j = g j
P[c j , g j |c j , g j , h j = 0, a j = 1] = (13)
0, otherwise

1, if c j = c j , g j = g j
P[c j , g j |c j , g j , h j = 0, a j = 0] = (14)
0, otherwise

1, if c j = c j − ζ τ , g j = g j
P[c j , g j |c j , g j , h j = 0, a j = −1] = (15)
0, otherwise

pg j gk , if c j = c j − ηg j gk , g j = gk
P[c j , g j |c j ≥ ηg j gk , g j , h j = 1, a] = (16)
0, otherwise

pg j gk , if c j = −1, g j = gk
P[c j , g j |c j < ηg j gk , g j , h j = 1, a] = (17)
0, otherwise
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

6 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

Since EVs move independently, P[Hk |Hk , G k ] = autonomy of MG is more important than the satisfaction of

P[h j |h j , g j ]. The transition probability of h j can be EV owing to low electricity price, a larger value of ω1 can
j be set. Meanwhile, if the autonomy of MG can be achieved
defined as follows. We assume that the residence time of the without any V2G control, a small value of ω1 can be used.
j th EV in g j (i.e., MG where the j th EV is currently located) When the actual and estimated demands of MG are compa-
follows an exponential distribution with mean 1/μ j,g j . Then, rable, high autonomy of MG can be achieved. Therefore, when
the transition probability from h j = 0 to h j = 1 is given by the actual demand of the lth MG is lower than its estimated
μ j,g j τ [24]. Therefore, when h j = 0, the transition probability demand (i.e., dl > 0), the charging EVs located in the lth MG
of h j can be derived as have advantages to improve the autonomy of MG. In contrast,
⎧ if the estimated demand of the lth MG is lower than its actual
⎪
⎨1 − μ j,g j τ, if h j = 0 demand (i.e., dl < 0), EVs located in the lth MG should
P[h j |h j = 0, g j ] = μ j,g j τ,
if h j = 1 (20)
⎪
⎩ be discharged to supply electricity to MG. Specifically, when
0, otherwise. the total summation of the charging and discharging volumes
Note that a different μ j,g j is used depending on the time in the lth MG is close to dl , high autonomy of the lth MG
to reflect the mobility variance. On one hand, we assume can be obtained. Since the total summation of the charging
that MGs are sufficiently close to each other. Then, an EV and discharging

volumes in the lth MG
can be calculated by
can move to another MG within the duration of a decision a j δ g j = l ζ τ , where δ g j = l is a delta function that
j
epoch. That is, when an EV is in the movement phase (i.e.,
returns 1 if the condition g j = l is true,6 r G (S, A) can be
h j = 1), h j is always 2. Since consecutive movements do
defined as (24) at the bottom of this page, where exp(·) returns
not generally occur, h j should be always 0 when h j = 2.
higher value as the input parameter becomes closer to 0. Note
Therefore, P[h j |h j = 1, g j ] and P[h j |h j = 2, g j ] can be
that EVs can be charged or discharged over the duration of
represented as
decision epoch by ζ τ .
1, if h j = 2 Since the price for electricity and the satisfaction on the
P[h j |h j = 1, g j ] = (21) charging are considered to define the reward function with
0, otherwise
respect to the EV perspective, r E V (S, A) can be expressed by
and
r E V (S, A) = ω2 f P (S, A) + (1 − ω2 ) f L (S, A) (25)
1, if h j = 0
P[h j |h j = 2, g j ] = (22) where f P (S, A) and f L (S, A) are the reward functions for
0, otherwise. the electricity price and the satisfaction on the charging,
Since the estimated/actual demands of MGs change inde- respectively. Also, ω2 (0 ≤ ω2 ≤ 1) is the weighted factor
pendently, the differences between their estimated demand between f P (S, A) and g L (S, A). Note that ω2 can be decided
and actual demand change also independently. Therefore, based on the driver preference. For example, if the driver is
sensitive to the price of electricity, a large ω2 is set to weight
P[Dk |Dk ] = P[dl |dl ], where P[dl |dl ] can be defined in
l f P (S, A). Otherwise, a small ω2 can be used to maximize the
a statistical manner. satisfaction on the charging.
The electricity price is influenced by the difference between
E. Reward and Cost Functions the estimated demand and actual demand (i.e., d). That is,
To define the reward and cost functions, we consider both the electricity price is non-decreasing function of d. Moreover,
the grid perspective and the EV perspective. In terms of when EV is charged (discharged), the EV owner should pay
the grid perspective, the autonomy of MGs can be taken (receive) an electricity fee. That is, the electricity price is
into account. In terms of the EV perspective, the price for affected by dg j and a j . For example, an EV owner should
electricity and the satisfaction on the charging are considered. pay an expensive electricity fee to charge its EV at an energy-
Therefore, the total reward function, r (S, A), is defined as scarce j th MG. Meanwhile, if an EV located at the j th MG
is discharged, the EV owner can receive high profit. Then,
r (S, A) = ω1 r G (S, A) − (1 − ω1 ) r E V (S, A) (23) f P (S, A) can be described by
where r G (S, A) and r E V (S, A) are the reward functions with 1

f P (S, A) = PM dg j , a j (26)
respect to the grid and EV perspectives, respectively. In addi- NE V
j
tion, ω1 (0 ≤ ω1 ≤ 1) is a weighted factor to balance
where PM (d, a) is the price model. This price model can be
r G (S, A) and r E V (S, A). The relative importance of reward
defined according to the policy of the grid operators.
functions, r G (S, A) and r E V (S, A), can be changed depending
on the perspective of either the grid or EV. For example, if the 6 If condition g = l is not true, a delta function returns 0.
j

⎛ ⎛ ⎞ ⎞

1

r G (S, A) = ⎝
exp − dl − ⎝ a j δ g j = l ζ τ ⎠
⎠ (24)
NM G
l j
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

KO et al.: MACA IN MGs 7

If EV is sufficiently charged while parked, the driver does G. Reinforcement Learning (RL)
not need to stop by any charging station during his trip. If P[S |S, A] cannot be easily obtained due to some
Otherwise, the driver should visit a charging station to recharge reasons,7 an RL approach, where an agent (i.e., the controller)
his EV. Since the charging is a time-consuming procedure, learns what action to take by trial-and-error, can be used and
the latter case degrades the satisfaction of the driver in the turns out to be a good solution for deriving a near-optimal
V2G system. f L (S, A) is defined by considering this situation. policy [27]. In this paper, Q-learning [28] is exploited due to
We assume that the j th EV stops by a charging station its simplicity. The agent interacts with its environment over
and is charged as much as necessary to move to a MG sequence Te . The quality function (i.e., Q-value) of state-
if there is no sufficient SOC (i.e., c j = −1). In such a action pair, Q(S, A), is defined as the expected long-term
situation, the satisfaction of the j th EV’s user can be degraded. discount reward of state S with policy π. The objective of
Therefore, f L (S, A) can be represented by the Q-learning algorithm is to find an optimal policy πopt
1
that maximizes the Q-value of each state S, i.e., πopt =
f L (S, A) = L j δ c j = −1 . (27) arg max Q(S, A). To this end, the agent iteratively learns
NE V A∈A
j
optimal Q-values without knowledge of P[S |S, A]. That is,
where L j denotes the satisfaction degradation degree of the when the agent in state S conducts action A, the agent receives
j th EV. reward r and updates the Q-value of state-action pair (S, A)
as

F. Optimality Equation Q (S, A) ← Q (S, A) + ρ r +λ max

Q S , A −Q (S, A)
A ∈A
To maximize the expected total reward and obtain the (30)
optimal policy, we choose the expected total discount reward
optimality criterion [25], [26]. Let v(S) be the maximum where ρ is the learning rate that determines to what extent the
expected total reward when the initial state is S. Then, we can learned Q-value affects the old Q-value.
describe v(S) as At each decision epoch, the agent decides what action
to take 1) based on the previously learned Q-value (i.e.,
v(S) = max v π (S) (28) exploitation mode) or 2) randomly (i.e., exploration mode).
π∈ In the exploitation mode, the agent chooses the action
where v π (S) is the expected total reward when the policy π which is tried in the past and found to give high reward.
with an initial state S is given. In the exploration mode, the agent tries actions it has not
The optimality equation is given by [19] selected before, which can enhance future decisions. In this
paper,
-greedy exploration-exploitation method is used [27],

in which the agent explores with probability
(S), whereas

v(S) = max r (S, A) + λP[S |S, A]v(S ) (29) exploits Q-value with probability 1 −
(S).
A∈A
S ∈S

where λ is a discount factor in the MDP model. λ closer IV. E VALUATION R ESULTS
to 1 gives more weights to the future rewards. The solu- For the performance evaluation, we compare the proposed
tion of the optimality equations correspond to the maximum scheme, SM AC A , with the following three schemes: 1) SF C
expected total reward and the optimal policy. To solve the where EVs are fully charged while they do not conduct
optimality equation and obtain the optimal policy, δ, we use any discharge actions; 2) SMC where the SOCs of EVs are
a value iteration algorithm, as shown in Algorithm 1, where maintained at the minimum level (i.e., EVs are charged or dis-
|v| = max[v(S)] for S ∈ S. charged to the minimum level); and 3) S D BC where EVs are
charged based on the difference between the estimated demand
and the actual demand, i.e., EVs located in MGs whose actual
Algorithm 1 Value Iteration Algorithm demands are lower than their estimated demands are charged,
while EVs located in MGs whose actual demands are higher
1: Set v 0 (S) = 0 for each state S. Specify
> 0, and set
than their estimated demands are discharged.
k = 0.
The number of MGs is set to three. We assume that
compute v (S) by
2: For each state S, k+1
the estimated and actual demands are dynamically changed.

v (S) = max r (S, A) +
k+1 λP[S |S, A]v (S )
k The price is proportional to the difference dg j between the
A∈A S ∈S estimated demand and the actual demand in MG where the
3: If |v k+1 (S) − v k (S)| <
(1 − λ)/2λ, go to step 4. j th EV is located. The other default parameter settings are
Otherwise, increase k by 1 and return to step 2. summarized in Table II.
4: For each state s ∈ S, compute the stationary optimal policy
7 Some EVs do not want to submit their mobility profiles to the controller

δ(S) = arg max r (S, A) + λP[S |S, A]v (S )
k+1 and some MGs (aggregators) do not provide their actual/ideal demands due
A∈A S ∈S to their privacy issues. In these situations, it is not easy to derive P[S |S, A].
and stop. In addition, the exact transition probability for P[S |S, A] cannot be obtained
if sufficient statistics are not collected.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

8 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

Fig. 3. Effect of ω1 to balance the grid perspective reward function and the EV perspective reward. (a) Expected total reward. (b) Autonomy degree.

TABLE II
D EFAULT PARAMETER S ETTINGS

A. Effect of ω1
The effect of the weighted factor ω1 to balance the
reward functions with respect to the grid perspective and
the EV perspective on the expected total reward is shown
in Figure 3(a). From Figure 3(a), it can be found that SM AC A
has the highest expected total reward regardless of ω1 . This
can be explained as follows. SM AC A chooses the most appro-
priate action by considering comprehensively the estimated Fig. 4. Effect of satisfaction degradation degree L on the expected number
of situations where sufficient SOC is not supported.
and actual demands of MGs, electricity price, and satisfaction
on the charging. In other words, SM AC A commands actively
charging actions to EVs located at an energy-abundant MG
the grid perspective reward (i.e., the autonomy of MG) is more
with low electricity price. In addition, in SM AC A , EVs are
important than the EV perspective reward. In such a situation,
discharged when they are in an energy-scarce MG while
SM AC A preferentially commands charging (or discharge)
considering the satisfaction on the charging (i.e., drivers need
operations to EVs in energy-abundant (or energy-scarce) MGs.
not to stop by a charging station during his trip, because
Moreover, EVs expected to move to energy-scarce (or energy-
SM AC A does not command excess discharging actions even
abundant) MGs are aggressively charged (or discharged)
though EVs are in an energy-scarce MG). Meanwhile, other
to efficiently act as energy transporters among different
comparison schemes follow fixed actions without considera-
MGs. In so doing, energy can be naturally transported from
tion of these parameters. Specifically, in S D BC , EVs located
energy-abundant MGs to energy-scarce MGs, and therefore
in MGs whose actual demands are lower than their estimated
SM AC A can achieve high MG autonomy. On the other hand,
demands are always charged, while EVs located in MGs
SMC has the lowest autonomy degree among the comparison
whose actual demands are higher than their estimated demands
schemes. This can be explained as follows. In SMC , the SOCs
are always discharged. This operation does not consider any
of EVs are maintained at the minimum level (i.e., EVs are
perspectives of EV such as the electricity price and satisfaction
charged or discharged to the minimum level). Therefore, EVs
on the charging. Therefore, when the reward with respect to
in SMC cannot play a role as energy transporters efficiently.
the EV perspective is important (i.e., ω1 is small), the expected
total reward of S D BC can be very low.
Figure 3(b) shows the autonomy degree as a function of ω1 . B. Effect of L
Note that the autonomy degree of S D BC is normalized to 1. Figure 4 shows the expected number of situations where
In Figure 3(b), it can be found that SM AC A operates adaptively sufficient SOC is not supported, E[δ(c = −1)], as a function
even when ω1 is changed. Specifically, when ω1 is large of L, which represents the satisfaction degradation degree
(i.e., 0.7 ∼ 0.9), the autonomy degree of SM AC A is larger than when sufficient SOC is not supported. In this result, ω1 is
1. This can be explained as follows. Large ω1 represents that set to 0.2. From Figure 4, it can be found that E[δ(c = −1)]
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

KO et al.: MACA IN MGs 9

vehicles (EVs) and the estimated/actual demands of

microgrids (MGs), then decides where and when EVs are
charged or discharged. To optimize the performance of
MACA, a Markov decision process (MDP) problem is
formulated and the optimal policy (i.e., charging/discharging)
is obtained by a value iteration algorithm. Also, since the
mobility of EVs, states of charge (SOC) of EVs, and the
estimated/actual demands of MGs may not be easily obtained,
a reinforcement learning (RL) approach is introduced.
Evaluation results demonstrate that MACA with the optimal
policy can effectively achieve MG autonomy and provide
higher satisfaction on the charging. In our future work,
we will investigate the following challenging issues. First,
a more sophistic energy management policy will be devised
Fig. 5. Effect of discount factor λ on the expected total reward. with the consideration of the dynamics of renewable energy
generation depending on the time and location. In addition,
the effect of the charging/discharging operations on the
of the comparison schemes is constant. This is because the lifetime of the EV battery will be studied.
comparison schemes follow the fixed actions without any con-
sideration on the satisfaction degradation. Specifically, in SMC ,
since EVs are charged to the minimum level, the probability R EFERENCES
that they do not have sufficient SOC becomes high. Therefore,
[1] A. Brooks, E. Lu, D. Reicher, C. Spirakis, and B. Weihl, “Demand
E[δ(c = −1)] of SMC has the largest value. On the contrary, dispatch,” IEEE Power Energy Mag., vol. 8, no. 3, pp. 20–29,
in SF C , EVs are charged to the maximum level, and therefore May/Jun. 2010.
most EVs can have sufficient SOCs. As a result, E[δ(c = −1)] [2] X. Fang, S. Misra, G. Xue, and D. Yang, “Smart grid the new and
improved power grid: A survey,” IEEE Commun. Surveys Tuts., vol. 14,
of SF C has the lowest value. SM AC A adaptively changes its no. 4, pp. 944–980, 4th Quart., 2012.
actions as L increases. That is, when L is large, EVs in SM AC A [3] C. Chakraborty, H. Iu, and D. Lu, “Power converters, control, and energy
are more charged to prevent the situation where sufficient SOC management for distributed generation,” IEEE Trans. Ind. Electron.,
is not supported, and therefore smaller E[δ(c = −1)] can be vol. 62, no. 7, pp. 4466–4470, Jul. 2015.
[4] J. Pan, R. Jain, and S. Paul, “A survey of energy efficiency in buildings
obtained. and microgrids using networking technologies,” IEEE Commun. Surveys
Tuts., vol. 16, no. 3, pp. 1709–1731, 3rd Quart., 2014.
C. Effect of λ [5] R. Deng, Z. Yang, M. Chow, and J. Chen, “A survey on demand response
in smart grids: Mathematical models and approaches,” IEEE Trans. Ind.
Figure 5 shows the expected total reward as a function Informat., vol. 11, no. 3, pp. 570–582, Jun. 2015.
of the discount factor, λ. We compare SM AC A with the [6] X. Hu, S. J. Moura, N. Murgovski, B. Egardt, and D. Cao, “Integrated
optimization of battery sizing, charging, and power management in plug-
optimal policy derived by the value iteration algorithm in hybrid electric vehicles,” IEEE Trans. Control Syst. Technol., vol. 24,
to S R L with a learning-based policy derived by the no. 3, pp. 1036–1043, May 2016.
Q-learning algorithm. In the Q-learning algorithm, the agent [7] X. Hu, C. M. Martinez, and Y. Yang, “Charging, power management,
and battery degradation mitigation in plug-in hybrid electric vehicles:
interacts with its environments over a sequence Te = 1000, A unified cost-optimal approach,” Mech. Syst. Signal Process., vol. 87,
Te √= 10000, or Te = 100000. In addition, ρ and
are set to pp. 4–16, Mar. 2017.
1/ (t + 2) and 1/ln(t + 2), respectively, similar to [27]. [8] X. Hu, Y. Zou, and Y. Yang, “Greener plug-in hybrid electric vehicles
incorporating renewable energy and rapid system optimization,” Energy,
It can be shown that the expected total reward of SM AC A vol. 111, pp. 971–980, Sep. 2016.
increases as the discount factor increases. This is because a [9] X. Wu, X. Hu, S. Moura, X. Yin, and V. Pickert, “Stochastic control of
larger discount factor refers to a longer simulation time [29]. smart home energy management with plug-in electric vehicle battery
When the simulation is carried out over a longer time, more energy storage and photovoltaic array,” J. Power Sources, vol. 333,
pp. 203–212, Nov. 2016.
charging/discharging actions can be conducted, which gives [10] R. Yu, W. Zhong, S. Xie, C. Yuen, S. Gjessing, and Y. Zhang, “Balancing
more total reward. It can be found that the expected total power demand through EV mobility in vehicle-to-grid mobile energy
reward of S R L increases as Te increases.8 This is because networks,” IEEE Trans. Ind. Informat., vol. 12, no. 1, pp. 79–90,
Feb. 2016.
large Te means that the agent (i.e., controller) can more [11] W. Zhong, R. Yu, S. Xie, Y. Zhang, and D. K. Y. Yau,
interact with its environments. That is, the agent can take more “On stability and robustness of demand response in V2G mobile
actions, and then select the near-optimal action at each state by energy networks,” IEEE Trans. Smart Grid. [Online]. Available:
http://ieeexplore.ieee.org/document/7744655/
trial-and-error. [12] W. Shi and W. W. S. Wong, “Real-time vehicle-to-grid control algo-
rithm under price uncertainty,” in Proc. IEEE Int. Conf. Smart Grid
V. C ONCLUSION Commun. (SmartGridComm), Oct. 2011, pp. 261–266.
[13] S. Deilami, A. S. Masoum, P. S. Moses, and M. A. S. Masoum, “Real-
In this paper, we propose a mobility-aware V2G control time coordination of plug-in electric vehicle charging in smart grids to
algorithm (MACA) that considers the mobility of electric minimize power losses and improve voltage profile,” IEEE Trans. Smart
Grid, vol. 2, no. 3, pp. 456–467, Sep. 2011.
8 Theoretically, the performance of the Q-learning algorithm converges to [14] C. Chen and S. Duan, “Optimal integration of plug-in hybrid electric
that of the optimal solution when the agent sufficiently interacts with its vehicles in microgrids,” IEEE Trans. Ind. Informat., vol. 10, no. 3,
environment [27]. pp. 1917–1926, Aug. 2014.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

10 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

[15] H. Liu, Z. Hu, Y. Song, and J. Lin, “Decentralized vehicle-to-grid control Sangheon Pack (SM’11) received the B.S. and
for primary frequency regulation considering charging demands,” IEEE Ph.D. degrees in computer engineering from Seoul
Trans. Power Syst., vol. 28, no. 3, pp. 3480–3489, Aug. 2013. National University, Seoul, South Korea, in 2000 and
[16] J. A. Jardini, C. M. V. Tahan, M. R. Gouvea, S. U. Ahn, and 2005, respectively. From 2005 to 2006, he was a
F. M. Figueiredo, “Daily load profiles for residential, commercial and Post-Doctoral Fellow with the Broadband Commu-
industrial low voltage consumers,” IEEE Trans. Power Del., vol. 15, nications Research Group, University of Waterloo,
no. 1, pp. 375–380, Jan. 2000. Waterloo, ON, Canada. In 2007, he joined the Fac-
[17] W. Kempton and J. Tomić, “Vehicle-to-grid power implementation: From ulty of Korea University, Seoul, South Korea, where
stabilizing the grid to supporting large-scale renewable energy,” J. Power he is currently a Full Professor with the School
Sources, vol. 144, no. 1, pp. 268–279, Jun. 2005. of Electrical Engineering. His current research
[18] H. Ko, G. Lee, D. Suh, S. Pack, and X. Shen, “An optimized and interests include Future Internet, softwarized
distributed data packet forwarding in LTE/LTE-A networks,” IEEE networking (SDN/NFV), information-centric networking/delay tolerant
Trans. Veh. Technol., vol. 65, no. 5, pp. 3462–3473, May 2016. networking, and vehicular networks. He was the recipient of the IEEE/Institute
[19] M. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic of Electronics and Information Engineers Joint Award for IT Young Engineers
Programming. Hoboken, NJ, USA: Wiley, 1994. Award 2017, the Korean Institute of Information Scientists and Engineers
[20] E. A. Feinberg and A. Shwartz, Handbook of Markov Decision Young Information Scientist Award 2017, the Korean Institute of Communica-
Processes: Methods and Applications. Norwell, MA, USA: Kluwer, tions and Information Sciences Haedong Young Scholar Award 2013, the LG
2002. Yonam Foundation Overseas Research Professor Program in 2012, and the
[21] X. Hu, S. E. Li, and Y. Yang, “Advanced machine learning approach IEEE ComSoc APB Outstanding Young Researcher Award in 2009. He served
for lithium-ion battery state estimation in electric vehicles,” IEEE Trans. as a Publicity Co-Chair of IEEE SECON 2012, a Co-Chair of the IEEE VTC
Transport. Electrific., vol. 2, no. 2, pp. 140–149, Jun. 2016. 2010-Fall Transportation Track and the IEEE WCSP 2013 Wireless Network-
[22] X. Wu, D. Freese, A. Cabrera, and W. Kitch, “Electric vehicles’ energy ing Symposium, the Publication Co-Chair of IEEE INFOCOM 2014 and ACM
consumption measurement and estimation,” Elsevier Transp. Res. D, MobiHoc 2015, and a TPC Chair of EAI Qshine 2016. He is an Editor of
Transp. Environ., vol. 34, no. 1, pp. 52–67, Jan. 2015. Journal of Communications Networks and IET Communications. He is a Guest
[23] T. Hyodo, D. Watanabe, and M. Wu, “Estimation of energy consumption Editor of IEEE T RANSACTIONS ON E MERGING T OPICS IN C OMPUTING.
equation for electric vehicle and its implementation,” in Proc. World
Conf. Transp. Res. (WCTR), Jul. 2013, pp. 1–12.
[24] T. Guo, A. Ul Quddus, N. Wang, and R. Tafazolli, “Local mobility Victor C. M. Leung (S’75–M’89–SM’97–F’03)
management for networked femtocells based on X2 traffic forwarding,” received the B.A.Sc. (Hons.) and Ph.D. degrees
IEEE Trans. Veh. Technol., vol. 62, no. 1, pp. 326–340, Jan. 2013. in electrical engineering from University of British
[25] J. Pan and W. Zhang, “An MDP-based handover decision algo- Columbia (UBC) in 1977 and 1982, respectively.
rithm in hierarchical LTE networks,” in Proc. IEEE Veh. Technol. He received the APEBC Gold Medal as the Head
Conf. (VTC-Fall), Sep. 2012, pp. 1–5. of the graduating class in the Faculty of Applied
[26] H. Tabrizi, G. Farhadi, and J. Cioffi, “A learning-based network selection Science. He attended the Graduate School, UBC,
method in heterogeneous wireless systems,” in Proc. IEEE Global on a Canadian Natural Sciences and Engineering
Telecommun. Conf. (Globecom), Dec. 2011, pp. 1–5. Research Council Postgraduate Scholarship.
[27] M. E. Helou, M. Ibrahim, S. Lahoud, K. Khawam, D. Mezher, and From 1981 to 1987, he was a Senior Member of
B. Cousin, “A network-assisted approach for RAT selection in hetero- Technical Staff and a Satellite System Specialist at
geneous cellular networks,” IEEE J. Sel. Areas Commun., vol. 33, no. 6, MPR Teltech Ltd., Canada. In 1988, he was a Lecturer with the Department
pp. 1055–1067, Jun. 2015. of Electronics, The Chinese University of Hong Kong. He returned to UBC as
[28] C. Watkins and P. Dayan, “Technical Nnote: Q-learning,” Mach. Learn., a Faculty Member in 1989, and currently holds the positions of a Professor
vol. 8, no. 3, pp. 279–292, May 1992. and the TELUS Mobility Research Chair in advanced telecommunications
[29] H. Ko, J. Lee, and S. Pack, “MALM: Mobility-aware location man- engineering with the Department of Electrical and Computer Engineering. He
agement scheme in femto/macrocell networks,” IEEE Trans. Mobile has co-authored over 1000 journal/conference papers, 39 book chapters, and
Comput., vol. 16, no. 11, pp. 3115–3125, Nov. 2017. co-edited 14 book titles. His several papers had been selected for best paper
awards. His research interests include the broad areas of wireless networks
and mobile systems.
Dr. Leung is a fellow of the Royal Society of Canada, the Engineering Insti-
tute of Canada, and the Canadian Academy of Engineering. He is a registered
Professional Engineer in the Province of British Columbia, Canada. He was a
Distinguished Lecturer of the IEEE Communications Society. He received the
IEEE Vancouver Section Centennial Award, the 2011 UBC Killam Research
Prize, and the 2017 Canadian Award for Telecommunications Research.
He has co-authored papers that received the 2017 IEEE ComSoc Fred W.
Ellersick Prize and the 2017 IEEE Systems Journal Best Paper Award. He has
Haneul Ko received the B.S. and Ph.D. degrees served on the editorial boards of IEEE J OURNAL ON S ELECTED A REAS IN
from the School of Electrical Engineering, Korea C OMMUNICATIONS —W IRELESS C OMMUNICATIONS S ERIES AND S ERIES
University, Seoul, South Korea, in 2011 and 2016, ON G REEN C OMMUNICATIONS AND N ETWORKING , IEEE T RANSACTIONS
respectively. From 2016 to 2017, he was a Post- ON W IRELESS C OMMUNICATIONS, IEEE T RANSACTIONS ON V EHICULAR
Doctoral Fellow with the Mobile Network and Com- T ECHNOLOGY, IEEE T RANSACTIONS ON C OMPUTERS , IEEE W IRELESS
munications Laboratory, Korea University. He is C OMMUNICATIONS L ETTERS , and Journal of Communications and Networks.
currently a Visiting Post-Doctoral Fellow with Uni- He has guest-edited many journal special issues, and provided leadership to the
versity of British Columbia, Vancouver, BC, Canada. organizing committees and technical program committees of numerous confer-
He is also with the Smart Quantum Communication ences and workshops. He is serving on the editorial boards of T RANSACTIONS
Research Center, Korea University. His research ON G REEN C OMMUNICATIONS AND N ETWORKING , IEEE T RANSACTIONS
interests include 5G networks, mobility manage- ON C LOUD C OMPUTING , IEEE A CCESS , Computer Communications, and
ment, mobile cloud computing, SDN/NFV, and Future Internet. several other journals.

Constrained EV Charging Scheduling Based On Safe Deep Reinforcement Learning
No ratings yet
Constrained EV Charging Scheduling Based On Safe Deep Reinforcement Learning
3 pages
Autonomous Disributed v2g
No ratings yet
Autonomous Disributed v2g
6 pages
Energy Conversion and Management: Masoud Honarmand, Alireza Zakariazadeh, Shahram Jadid
No ratings yet
Energy Conversion and Management: Masoud Honarmand, Alireza Zakariazadeh, Shahram Jadid
11 pages
(2021) An Efficient Short-Term Energy Management System For A Microgrid With Renewable Power Generation and Electric Vehicles
No ratings yet
(2021) An Efficient Short-Term Energy Management System For A Microgrid With Renewable Power Generation and Electric Vehicles
17 pages
Deep Reinforcement Learning For Charging Schedulin
No ratings yet
Deep Reinforcement Learning For Charging Schedulin
22 pages
Et 2018 8549652
No ratings yet
Et 2018 8549652
4 pages
Demand-Side Management Using Deep Learning For Smart Charging of Electric Vehicles
No ratings yet
Demand-Side Management Using Deep Learning For Smart Charging of Electric Vehicles
9 pages
Distributionally Robust Model Predictive Control For Smart Electric Vehicle Charging Station With V2G V2V Capability
No ratings yet
Distributionally Robust Model Predictive Control For Smart Electric Vehicle Charging Station With V2G V2V Capability
13 pages
RZ 3785
No ratings yet
RZ 3785
8 pages
Applied Sciences
No ratings yet
Applied Sciences
21 pages
Energies 16 00146
No ratings yet
Energies 16 00146
26 pages
Optimal Energy Dispatch of Grid-Connected Electric Vehicle Considering Lithium Battery Electrochemical Model
No ratings yet
Optimal Energy Dispatch of Grid-Connected Electric Vehicle Considering Lithium Battery Electrochemical Model
16 pages
Fair Management of Vehicle-to-Grid and Demand Response Programs in Local Energy Communities
No ratings yet
Fair Management of Vehicle-to-Grid and Demand Response Programs in Local Energy Communities
10 pages
Article 2 VE
No ratings yet
Article 2 VE
13 pages
Energy Conversion and Management: Hugo Morais, Tiago Sousa, Zita Vale, Pedro Faria
No ratings yet
Energy Conversion and Management: Hugo Morais, Tiago Sousa, Zita Vale, Pedro Faria
15 pages
Hierarchical Model Predictive Control To Coordinate A Vehicle-to-Grid System Coupled To Building Microgrids
No ratings yet
Hierarchical Model Predictive Control To Coordinate A Vehicle-to-Grid System Coupled To Building Microgrids
11 pages
6415 6683 1 PB
No ratings yet
6415 6683 1 PB
7 pages
Optimal Microgrid Operation with V2G Uncertainty
No ratings yet
Optimal Microgrid Operation with V2G Uncertainty
14 pages
A Reliable Evaluation Metric For Electrical Load Forecasts in V2G Scheduling Considering Statistical Features of EV Charging
No ratings yet
A Reliable Evaluation Metric For Electrical Load Forecasts in V2G Scheduling Considering Statistical Features of EV Charging
15 pages
Optimal Electric Vehicle Charging and Discharging Scheduling Using
No ratings yet
Optimal Electric Vehicle Charging and Discharging Scheduling Using
10 pages
Large Scale Scenarios of EV Charging W A Data Driven Model of Control - Powell
No ratings yet
Large Scale Scenarios of EV Charging W A Data Driven Model of Control - Powell
43 pages
A Coordinated Charging Model For Electric Vehicles in A Smart Grid Using Whale Optimization Algorithm
No ratings yet
A Coordinated Charging Model For Electric Vehicles in A Smart Grid Using Whale Optimization Algorithm
8 pages
EV Demand Response in Microgrids
No ratings yet
EV Demand Response in Microgrids
11 pages
Bidirectional EV Charger Design
No ratings yet
Bidirectional EV Charger Design
10 pages
V2G Strategies To Flatten The Daily Load Curve in
No ratings yet
V2G Strategies To Flatten The Daily Load Curve in
15 pages
Combined Summary of Topics
No ratings yet
Combined Summary of Topics
2 pages
The Energy Management and Optimized Operation of Electric Vehicles Based On Microgrid
No ratings yet
The Energy Management and Optimized Operation of Electric Vehicles Based On Microgrid
9 pages
EV Charging Stations With A Provision of V2G and Voltage Support in A Distribution Network
No ratings yet
EV Charging Stations With A Provision of V2G and Voltage Support in A Distribution Network
10 pages
Optimal Scheduling of Smart Microgrids Considering Electric Vehicle Battery Swapping Stations
No ratings yet
Optimal Scheduling of Smart Microgrids Considering Electric Vehicle Battery Swapping Stations
15 pages
Modeling of Electric Grid Behaviors Having Electric Vehicle Charging Stations With G2V and V2G Possibilities
No ratings yet
Modeling of Electric Grid Behaviors Having Electric Vehicle Charging Stations With G2V and V2G Possibilities
6 pages
Optimal Scheduling of Electric Vehicle Ordered Charging and Discharging
No ratings yet
Optimal Scheduling of Electric Vehicle Ordered Charging and Discharging
18 pages
IEEE Conference Template 1
No ratings yet
IEEE Conference Template 1
10 pages
Liu 2019
No ratings yet
Liu 2019
12 pages
Energies 17 01320
No ratings yet
Energies 17 01320
17 pages
Hashim2019
No ratings yet
Hashim2019
6 pages
Energy Storage Optimization in Electric Vehicles For Improving Vehicles To Grid Performance
No ratings yet
Energy Storage Optimization in Electric Vehicles For Improving Vehicles To Grid Performance
9 pages
An Incentive Based Dynamic Pricing in Smart Grid A Customer's Perspective
No ratings yet
An Incentive Based Dynamic Pricing in Smart Grid A Customer's Perspective
17 pages
Electric Vehicle Charging Management: A Hybrid Optimization Review
No ratings yet
Electric Vehicle Charging Management: A Hybrid Optimization Review
6 pages
Electric Power Systems Research: Sciencedirect
No ratings yet
Electric Power Systems Research: Sciencedirect
14 pages
Energies: Integrating Electric Vehicles To Power Grids: A Review On Modeling, Regulation, and Market Operation
No ratings yet
Energies: Integrating Electric Vehicles To Power Grids: A Review On Modeling, Regulation, and Market Operation
18 pages
Predictive Modeling of Electric Vehicle Loads Through Driving Behavior Analysis (15)
No ratings yet
Predictive Modeling of Electric Vehicle Loads Through Driving Behavior Analysis (15)
8 pages
An Intelligent Hybrid Heuristic Scheme For Smart M
No ratings yet
An Intelligent Hybrid Heuristic Scheme For Smart M
28 pages
EV Charging Stations With A Provision of V2G and Voltage Support in A Distribution Network
No ratings yet
EV Charging Stations With A Provision of V2G and Voltage Support in A Distribution Network
10 pages
Energies 18 00333 With Cover
No ratings yet
Energies 18 00333 With Cover
13 pages
1 s2.0 S030626192401153X Main
No ratings yet
1 s2.0 S030626192401153X Main
15 pages
Control Strategies For Battery Energy Storage Systems
100% (2)
Control Strategies For Battery Energy Storage Systems
114 pages
Real-Time Simulation of V2G Operation For EV Battery
No ratings yet
Real-Time Simulation of V2G Operation For EV Battery
6 pages
A Deep Reinforcement Learning Based Charging and Discharging Scheduling Strategy For Electric Vehicles
No ratings yet
A Deep Reinforcement Learning Based Charging and Discharging Scheduling Strategy For Electric Vehicles
10 pages
2024 V15i11048
No ratings yet
2024 V15i11048
7 pages
Demand Response Strategy For Microgrid Energy Management Integrating
No ratings yet
Demand Response Strategy For Microgrid Energy Management Integrating
19 pages
Demand-Side Management by Regulating Charging and Discharging of The EV, ESS, and Utilizing Renewable Energy
No ratings yet
Demand-Side Management by Regulating Charging and Discharging of The EV, ESS, and Utilizing Renewable Energy
10 pages
s44147 024 00384 Z
No ratings yet
s44147 024 00384 Z
16 pages
A Review of Advanced Control Strategies of Microgrids With Charging Stations
No ratings yet
A Review of Advanced Control Strategies of Microgrids With Charging Stations
25 pages
A Study of Charging-Dispatch Strategies and Vehicle-To-grid Tech For EVs in Distribution Network
No ratings yet
A Study of Charging-Dispatch Strategies and Vehicle-To-grid Tech For EVs in Distribution Network
30 pages
Electric Vehicle
No ratings yet
Electric Vehicle
6 pages
Sustainable Cities and Society: Qun Guo, Sayyad Nojavan, Shi Lei, Xiaodan Liang
No ratings yet
Sustainable Cities and Society: Qun Guo, Sayyad Nojavan, Shi Lei, Xiaodan Liang
14 pages
Decentralized Based Advance Optimized Scheduling Scheme To Charge and Discharge The Electric Vehicles
No ratings yet
Decentralized Based Advance Optimized Scheduling Scheme To Charge and Discharge The Electric Vehicles
7 pages
IET Renewable Power Gen - 2022 - Amir - Agent Based Online Learning Approach For Power Flow Control of Electric Vehicle
No ratings yet
IET Renewable Power Gen - 2022 - Amir - Agent Based Online Learning Approach For Power Flow Control of Electric Vehicle
13 pages
Renewable and Sustainable Energy Reviews: Jia Ying Yong, Vigna K. Ramachandaramurthy, Kang Miao Tan, N. Mithulananthan
No ratings yet
Renewable and Sustainable Energy Reviews: Jia Ying Yong, Vigna K. Ramachandaramurthy, Kang Miao Tan, N. Mithulananthan
21 pages
Renewable Energy: Partha Kayal, C.K. Chanda
No ratings yet
Renewable Energy: Partha Kayal, C.K. Chanda
14 pages
Multimicrogrid Energy Scheduling with EVs
No ratings yet
Multimicrogrid Energy Scheduling with EVs
13 pages
Stochastic Energy Management For Microgrids With Constraints Under Uncertainty
No ratings yet
Stochastic Energy Management For Microgrids With Constraints Under Uncertainty
6 pages
Mental Health Disorder Classification
No ratings yet
Mental Health Disorder Classification
72 pages
Batteries 09 00152 v2
No ratings yet
Batteries 09 00152 v2
24 pages
EV Presentation Template
No ratings yet
EV Presentation Template
17 pages
Triple Active Bridge Converter Control
No ratings yet
Triple Active Bridge Converter Control
12 pages
Electric Vehicles Lab Report
No ratings yet
Electric Vehicles Lab Report
32 pages
Handbook of Electrical Power Systems Energy Technology and Management in Dialogue 1st Edition Oliver D. Doleskipdf Download
100% (1)
Handbook of Electrical Power Systems Energy Technology and Management in Dialogue 1st Edition Oliver D. Doleskipdf Download
53 pages
A Bidirectional On-Board Charging System For LEVs
No ratings yet
A Bidirectional On-Board Charging System For LEVs
18 pages
Dynamic Wireless Charging for EVs
No ratings yet
Dynamic Wireless Charging for EVs
10 pages
Cyber Security in EV
No ratings yet
Cyber Security in EV
10 pages
Charge Coordination and Battery Lifecycle Analysis of Electric Vehicles With V2G Implementation
No ratings yet
Charge Coordination and Battery Lifecycle Analysis of Electric Vehicles With V2G Implementation
11 pages
Survey of Strategies To Optimize Battery Operation To Minimize The Electricity Cost in A Microgrid With Renewable Energy Sources and Electric Vehicles
No ratings yet
Survey of Strategies To Optimize Battery Operation To Minimize The Electricity Cost in A Microgrid With Renewable Energy Sources and Electric Vehicles
16 pages
Presentation China EVs Feb 2024
No ratings yet
Presentation China EVs Feb 2024
28 pages
ASE L3 Study Guide 2025 250109 211021
No ratings yet
ASE L3 Study Guide 2025 250109 211021
16 pages
Unit 4
No ratings yet
Unit 4
10 pages
Wireless Charging For Electric Vehicles
No ratings yet
Wireless Charging For Electric Vehicles
6 pages
eCARS2x Lecture Notes L4-1
No ratings yet
eCARS2x Lecture Notes L4-1
9 pages
Integrating Electric Vehicles Into Smart Grid Using Iec 61850 and Isoiec 15118 Standards
100% (1)
Integrating Electric Vehicles Into Smart Grid Using Iec 61850 and Isoiec 15118 Standards
76 pages
V2G Global Roadtrip:: Around The World in 50 Projects
No ratings yet
V2G Global Roadtrip:: Around The World in 50 Projects
38 pages
Assignment 1 EC19025
No ratings yet
Assignment 1 EC19025
5 pages
V2G 3
No ratings yet
V2G 3
13 pages
Cell Balancing in Electric Vehicle Battery Pack IJERTV11IS040239
No ratings yet
Cell Balancing in Electric Vehicle Battery Pack IJERTV11IS040239
6 pages
Variable Frequency Drive For EVs Applications
No ratings yet
Variable Frequency Drive For EVs Applications
30 pages
Electric Vehicles Security and Privacy: Challenges, Solutions, and Future Needs
No ratings yet
Electric Vehicles Security and Privacy: Challenges, Solutions, and Future Needs
18 pages
Design and Implementation of A CHAdeMO Interface On-Board Vehicle For Vehicle-To-Grid
No ratings yet
Design and Implementation of A CHAdeMO Interface On-Board Vehicle For Vehicle-To-Grid
6 pages
Deepak Project 2
No ratings yet
Deepak Project 2
41 pages
AI-Enhanced Microgrid Optimization
No ratings yet
AI-Enhanced Microgrid Optimization
26 pages
V2G Zeta Converter Design Review
No ratings yet
V2G Zeta Converter Design Review
14 pages
Architecture and Communication of EVPP
No ratings yet
Architecture and Communication of EVPP
6 pages
2025-AI in Energy Management. A Comprehensive Literature Review On ...
No ratings yet
2025-AI in Energy Management. A Comprehensive Literature Review On ...
6 pages
Unit 01: Introduction To Smart Grid: (6 HRS)
No ratings yet
Unit 01: Introduction To Smart Grid: (6 HRS)
3 pages

Mobility-Aware Vehicle-to-Grid Control Algorithm in Microgrids

Uploaded by

Mobility-Aware Vehicle-to-Grid Control Algorithm in Microgrids

Uploaded by

This article has been accepted for inclusion in a future issue of this journal.

Content is final as presented, with the exception of pagination.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1

Mobility-Aware Vehicle-to-Grid Control

2 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

actual demand of working MGs maintains a high level from

KO et al.: MACA IN MGs 3

Fig. 2. Operation example of MACA.

4 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

TABLE I where N P,C is the number of possible combinations of EVs’

where c j denotes the SOC of the j th EV and N E V is the

where N P,G is the number of possible combinations of MG

where g j is the MG identification where the j th EV is located.

H = {H1, H2 , . . . , H N P,H } (6)

where N P,H is the number of possible combinations of move-

where h j denotes the movement phase of the j th EV. That is,

B. State where N P,D is the number of possible combinations of the dif-

KO et al.: MACA IN MGs 5

C. Action Since EVs are charged/discharged and move independently,

6 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

KO et al.: MACA IN MGs 7

F. Optimality Equation Q (S, A) ← Q (S, A) + ρ r +λ max

8 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

KO et al.: MACA IN MGs 9

vehicles (EVs) and the estimated/actual demands of

10 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

You might also like