Ram Ji
Ram Ji
Abstract
Conventional relays face challenges for transmission lines connected to inverter-based
resources (IBRs). In this article, a single-ended intelligent protection of the transmis-
sion line in the zone between the grid and the PV farm is suggested. The method
employs a fuzzy logic and random forest (RF)-based hybrid system to detect faults
based on combined linear trend attributes of the 3-phase currents. The fault location
is determined and the faulty phase is detected. RF feature selection is used to obtain
the optimal linear trend feature. The performance of the methodology is examined for
abnormal events such as faults, capacitor and load-switching operations simulated in
PSCAD/EMTDC on IEEE 9-bus system obtained by varying various fault and switch-
ing parameters. Additionally, when validating the suggested strategy, consideration
is given to the effects of conditions such as the presence of double circuit lines, PV
capacity, sampling rate, data window length, noise, high impedance faults, CT satura-
tion, compensation devices, evolving and cross-country faults, and far-end and near-end
faults. The findings indicate that the suggested strategy can be used to deal with a
variety of system configurations and situations while still safeguarding such complex
power transmission networks.
Keywords: Linear Trend, Inverter-interfaced Renewable Energy Sources, Fault
Detection, High Impedance Faults, Feature Selection, Random Forest, Photovoltaic
Farms, TCSC
1. INTRODUCTION
The generation of electricity using renewable energy sources (RESs) has radically
grown in recent years. This trend is expected to grow as governments, businesses, and
individuals around the world recognize the environmental and economic advantages of
transitioning away from fossil fuels to cleaner energy sources. The predominant portion
of RESs — solar photovoltaic (PV), Type-III wind farm (WF), and Type-IV WF of-
ten integrate into the grid through high-voltage transmission lines (t-lines) to transmit
power generated at remote sites through a power electronic converter. Integration of
2
protection approach is explored in [18], which uses correlation focusing on the similarity
of the waveforms and the polarity of transient signals from line ends. A distance
protection system based on positive sequence networks independent of resistance and
plant parameters for the t-lines connected to the PV plants is proposed in [19]. A
directional relaying scheme relying on positive sequence components of the fault and
pre-fault voltages and currents for the t-line connected with the PV plant is proposed
in [20].
The Power System Relaying and Control Committee’s Oct. 2023 report emphasizes
that with increased availability of massive volumes of high-fidelity sampled data in tem-
poral and frequency domains, the development and implementation of machine learning
(ML)-based solutions for challenging protection tasks could be successful [21]. Research
articles have also suggested leveraging machine learning (ML) systems to enhance fault
detection and classification operation. The work presented in [21] introduces an in-
telligent protection technique which utilizes an adaptive neuro-fuzzy inference system
to detect, locate, and classify fault types occurring in large-scale grid-connected wind
farms (WFs). A fault identification method based on positive-sequence currents, cou-
pled with an empirical mode decomposition (EMD) accompanied random forest (RF)
is suggested for a TCSC compensated line in [22]. However, the above articles use very
few features and it’s improbable that a single feature will be able to capture every fault
characteristic concerning PVs. Therefore, before applying any learning technique, it’s
vital to assess different features and utilize a feature selection process. The effect of
PV-fed t-lines in infeed conditions on distance protection is analyzed and an impedance
calculation method using SVM is proposed for transmission systems with PV integra-
tion [23]. An ML technique to detect and classify faults in t-lines connected to PV
and WFs is proposed in [24]. However, various scenarios are not taken into account in
these works, including high impedance faults, double circuit line faults, evolving and
cross-country faults, CT saturation, and so on. Also, protection systems that rely on
communication and synchronized data to detect faults have limitations in the event of
communication failures [25].
In this work, a novel single-ended combined linear trend (CLT) based intelligent
protection technique is developed for the protection of lines connected to PV farms
taking into account multiple power system scenarios and a variety of features. The
study’s originality and main contributions are summed up as follows:
• It proposes a fuzzy logic and RF-based hybrid intelligent method to detect and
locate faults. The technique is tested against various conditions simulated by
taking into account several parameters that may affect the fault currents.
• The CLT-based protection is verified for the instances of noise, near-end and
far-end faults, high impedance faults, double circuit lines, CT saturation, cross-
country and evolving faults, TCSC, different sampling frequencies, window sizes,
and change in PV capacity and test system.
• Phasor estimation and sequence component analysis are not required for the sug-
3
gested approach.
• Being single-ended, it is fast, without any data loss or synchronization errors, and
there is no need for a communication medium.
• The fault and transient dataset having more than 28000 cases is uploaded to IEEE
Dataport[26].
4
Table 1: Specifications of the various system components
Fault Events
Fault location f1, f2, f3, f4, f5, f6, f7, f8 (8)
Fault resistance 0.01, 1, 10 Ω (3)
Fault inception angle 0◦ , 60◦ , 120◦ , 180◦ , 240◦ , 300◦ (6)
Fault type ag, ab, ac, abg, acg, abcg, bg, bcg, bc, cg (10)
Priority P and Q (2)
Total fault cases =8 × 3 × 6 × 10 × 2 = 2880
5
Table 3: Simulation parameters & values for other transients.
Non-Fault Events
Switching angle 0◦ to 360◦ in steps of 15◦ (25)
Generator (bus-3) Disconnected/Connected (2)
Location Bus-4, 8, 9 (3)
Load/Capacitor Rating 4
Priority P and Q (2)
Capacitor-switching events = 25 × 2 × 3 × 4 × 2 = 1200
Load-switching events = 25 × 2 × 3 × 4 × 2 = 1200
Total non-fault events = 2400
altering the priority mode, fault resistance, fault type, and fault inception angle. The
generator at bus-3 is turned on and off to examine the effect of infeed in the case of
capacitor and load-switching. Tables 2 and 3 provide the parameters and their values
for simulating faults, load, and capacitor-switching scenarios. A lg fault at location
f1 with fault resistance 0.1 Ω and consisting of the dc decaying and ac components is
shown in Fig. 2.
6
are used for fault detection. Third, line 3P V -9 is tripped if the fault locator identifies
the captured transient currents as an internal fault (location f4 & f5). Fourth, the
faulty phase is detected. Preprocessing the data and extracting features are part of the
first step. RF feature selection is used to rank the linear trend features.
where M = no. of samples in a cycle, and Iϕ = phase current. The ED filter records
the 3-phase current samples starting from the time instant x that satisfies the equation
(2).
ED(x) ≥ γ = 0.06 ∀ ϕ ∈ a, b, c (2)
The threshold γ is determined by applying grey wolf optimization, a metaheuristic
algorithm that explores for the optimal solution, drawing inspiration from the social
hierarchy and hunting behaviors observed in packs of wolves [28]. Wolves are rep-
resented as candidate solutions, and their positions are updated iteratively based on
fitness evaluations using an objective function. The γ value depends on the maximum
fault resistance considered (here 10 Ω). It decreases with an increase in fault resistance.
Steps for identifying the optimal γ involve:
I: Initializing positions randomly with population size=25, dimension of search space=1,
lower limit=0, upper limit=1, and maximum iteration=200
II: Assessing fitness values employing the objective function:
(1 − disturbances detected in 1 cycle
T otal disturbances detected
)
III: Iterating and updating wolf positions according to dominance hierarchy. Assess the
updated positions’ fitness.
IV: Tracking and returning the best solution and fitness value.
7
Gini(n) is Gini impurity at node n, C is number of classes, p(i|n) is probability of
class i at node n. RF feature selection chooses the most significant linear trend feature
(Table 4).
f = mg + b (4)
8
P P
fi − m gi
b= (9)
n
Once these values are obtained the linear equation f = mg + b is used to predict
the values of f for any given g. The linear trend line will represent the “best fit” line
through the data points, minimizing the overall distance between the actual values and
the expected values.
Pearson’s correlation coefficient (Pearson’s r) can then be obtained for the data
points using: P
(gi − g)(fi − f )
r = qP P (10)
(gi − g)2 (fi − f )2
where gi and fi are the individual data points , g and f are the means of the variables
g and f , respectively. The summation is over all data points in the dataset.
Combined linear trend (CLT): In addition to SLT attributes, the CLT attributes
are also evaluated. For time series values aggregated across segments, CLT performs
9
a linear least-squares regression. The same attributes: “p-value”, “correlation coeffi-
cient”, “intercept”, “slope”, and “standard error” are extracted with the segment size
varied between 5, 10, and 50. The number of time series values in each segment is
determined by the segment size. For a segment size of 10, the number of segments is
12.8 ≈ 13 (1 cycle = 128 data points), thus reducing the data points from 128 to 13.
Maximum, minimum, mean, or variance of time series values in a segment is used to
get the combined value.
The list of SLT and CLT features evaluated using RF feature selection is listed in
Table 4. CLTs with attribute: ‘Pearson’s r’, segment size: 50, and f : mean for the
3-phase currents are thus chosen after feature selection. The importance of SLT and
CLT are shown in Fig. 4. In determining the feature importance of (say SLT phase a)
from a tree, the process involves first computing the importance specifically for nodes
where the split occurred due to feature SLT phase a, then dividing it by the total
feature importance of all nodes. The RF feature importance is derived by averaging
the importances across all trees. It is apparent from the figure that the CLT is a more
effective feature for identifying faults in t-lines connecting PV farms.
The feature calculation steps for CLT are displayed in Fig. 5. First, the one-cycle
phase currents are split into N = 50 segments. Second, the data of each segment is
aggregated over the mean in a single data point per segment to reduce the number of
measurement points to the number of segments. Third, the acquired data points are
then used to create a linear least-squares regression line. Fourth, the Pearson correlation
coefficient is utilized to describe the data points. Here, the feature is named correlation
coefficient r which is dimensionless.
where q̂ is predicted class label for the input sample p, NT is total number of decision
trees, Tj (p) is predicted class label by the j-th decision tree. The indicator function
I returns 1 in the case that the condition included in parenthesis is true and 0 in the
other case.
10
Table 5: Fault Detection perfor-
mance with chosen five traditional
ML algorithms
11
(14,400)
(14,2)
0.96 0.96
0.94 0.94
0.92 0.92
Accuracy
Accuracy
0.9 0.90
0.88 0.88
0.86 0.86
0.84
0.98
(400,2)
0.98
0.95 0.95
0.92
Accuracy
0.92
0.89
0.86
0.89
0.86
fuzzy sets interact to produce outputs. Genetic Algorithm is used to fine-tune these pa-
rameters of the trapezoidal membership functions of inputs (CLTs of 3-phase currents)
and output for optimal performance (Fig. 6). The fuzzy inference system customized
using a data-driven methodology is used to find the faults.
In research articles on power system protection, RF, SVM, DT, kNN, and NB have
shown promising outcomes. The classifiers take as inputs the 3-phase CLTs that were
chosen using RF feature selection. Table 5 shows the η̄ for different classifiers with
SMOTE and without SMOTE analysis for fault detection. RF outperforms the SVM,
DT, kNN, and NB. The hyperparameters for these classifiers are optimized using grid
search. It is observed that the use of SMOTE didn’t influence the results considerably.
RF classifier gives the best η̄ of 98.0% without SMOTE (2880 faults and 2400 non-
faults) and η̄ of 98.2% with SMOTE (2880 faults and 2880 non-faults). The optimal
hyperparameters of RF are obtained with grid search on n estimators = [400, 800, 1200,
1600, 2000, 2400, 2800, 3200, 3600, 4000], min splits = [2, 3, 4, 5, 6, 7, 8, 9, 10], and
max depth =[2, 4, 6, 8, 10, 12, 14, 16]. The best hyperparameter (n estimators = 400,
min splits = 2, max depth = 14) obtained is depicted in the 3D surface plot in Fig.
7. The 3D surface plots help understand the relationship between any two out of the
three RF hyperparameters and model performance in the hyperparameter optimiza-
tion. The higher accuracy of RF compared to other algorithms can be attributed to
ensemble learning, feature importance, robustness to overfitting, and ability to handle
high-dimensional and non-linear data.
t-SNE [34] plot is used to visualize the high-dimensional data in a lower-dimensional
space while preserving the local structure and relationships between data points (cor-
relation coefficient of CLTs) of faults, load-switching, and capacitor-switching events
(Fig. 8). It reveals clusters of capacitor and load-switching transients with scattered
clusters of fault data having higher variability. The separation of fault and switching
transient clusters visible in the plot makes it easier for the ML classifiers to differentiate
12
Figure 9: Spread of CLT features for 0.1 Ω and 10 Ω fault resistances at location f4
on 60 fault cases each considering different fault types and fault times
them.
It’s essential that the suggested algorithm works well for various fault resistances.
Box plots offer a clear and detailed depiction of a dataset’s distribution encapsulating
key data points – the minimum, first quartile, median, third quartile, and maximum –
providing a comprehensive view of the data’s spread, central tendency, skewness, and
potential outliers. The side by side comparisons of each feature for 0.1Ω and 10Ω using
Violin plots which provide the shape of the distribution in addition to the information
from boxplots are shown in Fig. 9. It is evident that the faults with 0.1Ω resistance
have slightly more variability or spread (width of IQR and length of whiskers) than
10Ω. However, the density plots are similar. This overall similarity in density and box
plots for all three features suggests that a classifier would be effective in distinguishing
fault cases with different resistances from switching transients.
Classifier
SVM RF DT kNN NB
Accuracy(η̄)
Fault location (%) 80.5 92.3 88.3 89.2 69.5 Figure 10: Confusion/Error Matrix for
Phase selection (%) 97.0 97.2 85.3 88.5 68.4 phase selection
13
4.2. Fault location
Once a transient is identified as a fault, the proposed method ascertains the region
of fault. Eight places are used to simulate these faults, with internal faults at f4 and
f5. Faults at locations f1, f2, f3 are considered as backward external faults, and f6, f7,
and f8 as forward external faults. Table 6 shows the η̄ for different classifiers with RF
outperforming the others with a η̄ of 92.3% on 2880 (see Table 2) fault cases.
Figure 11: 3D tSNE plot showing the CLT coefficients for faults with CT saturation
and switching transients
14
Figure 12: Pair plot for the 3-phase CLT features
15
5.2. Effect of Change in Farm Capacity
By changing the PV units from 400 (base value) to 300 and 500, the PV system’s
capacity is altered, and the suggested method is evaluated. The technique recognized
the faults and switching transients with η̄ of 96.0% and 96.4% on 300 and 500 units
respectively using 2400 no-fault transients and 2160 faults simulated by changing pri-
ority (P and Q), fault resistances (0.01, 1, and 10 ohms), fault types (10), and fault
inception angles (6) at locations f2, f5, and f6.
Table 7: Effect of noise in the 3- Table 8: Effect of sampling rate of
phase currents the 3-phase currents
16
5.4. Effect of Sampling Frequency and Window Length
The relay’s ability to operate quickly and reliably can be impacted by the frequency
of data sampling and length of the data window. High sampling frequencies capture
more detailed information about the power system’s behavior, including fast transients
and high-frequency disturbances. Low sampling frequencies may miss critical events
and nuances in the system’s behavior, potentially leading to false alarms or delayed
responses. However, a high sampling rate generates larger volumes of data, which
can be challenging to store, process, and transmit in real time. Therefore, a trade-off
between data processing and accuracy is necessary. Multiple kilohertz (kHz) are used to
sample the 3-phase relay currents in order to assess the effects of sampling frequency.
The quantity of samples utilized to train the RF classifier influences the suggested
method. It is observed that the CLT-based scheme suffers when the sampling rate is
reduced below 5.76 kHz (Table 8).
The optimal data window size depends on the characteristics of transients and the
desired trade-offs between temporal resolution and computational efficiency. The im-
pact of the data window is examined using window sizes of half, one, and two cycles.
η̄s of 98.0%, 98.0%, and 97.2% are obtained correspondingly, showing the scheme’s
resilience to change in window size.
17
Figure 15: Spread of CLT features for different compensating devices
2400 switching transients are simulated. The proposed method correctly recognized the
transients and faults with a η̄ of 96%. Distribution of CLTs for cases with compensation
with series capacitor, no compensation, and compensation with TCSC are shown in Fig.
15. 180 fault cases for each of these 3 scenarios at location f6 are considered. The series
capacitor and TCSC boxplots are more skewed than the no compensator boxplots.
18
Figure 17: 2D tSNE plots of CLT features for different scenarios
variable resistors to connect two anti-parallel DC sources is used. The model for HIF
employed is illustrated in Fig. 16. The dynamic arc is modeled by the two variable re-
sistors, the diodes control the current direction, and the asymmetry in the fault currents
is modeled by the varying DC sources. V ph > V n in positive half cycle, V ph < V n
in negative half cycle, and when V n < V ph < V p current is zero. For the purpose of
simulating the 370 HIFs at fault locations f4, f5, f6, f7, and f8 with P and Q priority for
37 fault inception angles, the lg fault in phase-a with fault resistances between 50 ohms
and 300 ohms obtained arbitrarily every two milliseconds is taken into consideration.
The suggested method successfully distinguished HIFs from other switching transients,
achieving a η̄ of 96%. The t-SNE plot for the 3 CLT features plotted on a 2D plane
shows clusters of similar data points, with the distances between these points reflecting
their relationships in the original 3-dimensional space. The overlaps in 2D t-SNE plot
happens due to the inherent loss of information when reducing dimensions from 3 to 2
(Fig. 17a).
19
Q priority modes for fault resistances (0.01, 1, and 10 ohms), fault types (10), and
fault inception angles (21). Fig. 17b shows the CLT features for faults and switching
transients.
20
Figure 19: IEEE 39-bus with PV at bus-9
6. Comparative Evaluation
This section illustrates the conduct of the conventional distance relay connected to
a PV farm, while also highlighting the lack of comprehensiveness of results from recent
works to establish the effectiveness of the proposed algorithm in this article.
The impedance plane associated with traditional distance relay 3P V 9, near bus 3P V ,
demonstrates the fault behavior. It’s evident that the distance relay exhibits unreliable
responses (Fig. 20). For instance, during an ag fault occurring at location f5 (refer
Fig. 1) within zone 1, the relay remains inactive in case 1 (PV units = 300, Rf = 10Ω,
priority = P) but it triggers in case 2 (PV units = 500, Rf = 1Ω, priority = P). It
remains dormant for a high impedance ag fault at location f4 (zone 1) in case 3 (PV
units = 400, priority = Q). Also, it stays inactive for an ab fault at location f5 in case 4
21
Figure 20: Impedance trajectories for AG and AB components of the distance relay at
CTP V for faults in zone 1
(PV units = 300, Rf = 10Ω, priority = P). The procedure for obtaining the impedance
trajectory is described in [16].
Additionally, Table 9 compares current publications concentrating on criteria, with
particular emphasis on the impact of different transient conditions on the proposed
techniques. This evaluation aims to assess the comprehensiveness of these proposed
techniques. While the methodologies have produced favorable results under some par-
ticular scenarios, performance analysis in many significant situations appears to be
lacking. The aforementioned publications also touch on the proposed algorithms’ exe-
cution times. For double-ended protection, the communication paths between the two
terminals can introduce a time delay of up to 6.87ms, according to IEEE standard
[40]. The proposed technique being single-ended is free from this delay. The intelligent
protection system based on CLT depends on data preparation and inference time. The
event detector takes 0.001ms and the calculation of CLT takes 0.01ms. The fuzzy and
the ML systems work in parallel taking 1.8ms and 0.5ms respectively to test new in-
puts. Hence, net running time considering 1-cycle data = 16.67ms + 1.8ms + 0.01ms
+ 0.001ms = 18.5ms.
7. CONCLUSION
The attributes of dependability, security, selectivity, robustness, and speed should
be present in a protective system. However, the dependability for internal faults and
security for external faults and other transients may get challenged in the case of trans-
mission lines connected to bulk PV farms. The suggested combined linear trend-based
hybrid intelligent protection method is dependable for internal faults, cross-country
22
Table 9: Comparison of recently published articles.
Proposed
Reference [17] [20] [19] [22]
method
signed +tive seq. CLT,
Technique used impedance EMD,RF
correlation network Fuzzy,RF
Signals used i v&i v&i i i
Single or double end double single single single single
System freq.(Hz), Samp. freq.(kHz) 60,1.2 60,1 60,1.2 50,1 60,7.84
Model includes WF PV PV WF PV
FACTS used - - - TCSC TCSC
Time delay (ms) ˜16 16.67 - 8 18.5
Effect of HIF - - yes yes yes
Effect of Noise yes yes yes yes yes
Effect of Double ckt. lines - - - - yes
Effect of farm capacity yes - - - yes
Effect of cross-country faults - - - - yes
Effect of evolving faults - yes yes - yes
Effect of Sampling freq. - - - - yes
Effect of data window - - - yes yes
Effect of CT saturation - yes yes yes yes
Effect of load switching yes yes - yes yes
Effect of capacitor switching yes - - yes yes
Effect of near-end faults - yes - yes yes
faults, evolving faults, double circuit line faults; and responsive to low current levels in
high impedance faults. It provides security for capacitor and load-switching, and exter-
nal fault with CT saturation events. It locates the faults correctly and is selective. It
is resilient to changes in data window size, capacity of the PV, noise in measurements,
change in the test system, and the presence of TCSC. However, the performance of the
model is impacted by the sampling rate of the phase currents. The combined linear
trend-based intelligent approach has been successfully evaluated in the IEEE-9 bus sys-
tem across a wide range of fault parameters and operating scenario variations. It has
been found that the linear combined trend-based decision-making system provides a
thorough protection for transmission lines connected to bulk photovoltaic farms while
being dependable, secure, accurate, and quick. Furthermore, it solely utilizes locally
measured data, eliminating the requirement for a remote-end communication device.
The proposed method can be applied in real time using edge computing for power
system protection with careful planning, cybersecurity measures, and integration with
existing infrastructure, thereby enhancing the speed, reliability, and scalability of pro-
tection functions.
References
[1] R. Bansal, Power System Protection in Smart Grid Environment, CRC Press, New York,
USA, 2019. doi:10.1201/9780429401756.
23
[2] A. K. Singh, I. Hussain, B. Singh, Double-stage three-phase grid-integrated solar PV
system with fast zero attracting normalized least mean fourth based adaptive control,
IEEE Trans. Ind. Electron. 65 (5) (2018) 3921–3931. doi:10.1109/TIE.2017.2758750.
[3] S. Singh, P. K. Nayak, S. Sarangi, S. Biswas, Improved protection scheme for high voltage
transmission lines connecting large-scale solar PV plants, in: 22nd National Power Sys-
tems Conference (NPSC), 2022, pp. 118–123. doi:10.1109/NPSC57038.2022.10069457.
[5] Y. Liang, W. Li, W. Zha, Adaptive mho characteristic-based distance protection for lines
emanating from photovoltaic power plants under unbalanced faults, IEEE Syst. J. 15 (3)
(2021) 3506–3516. doi:10.1109/JSYST.2020.3015225.
[6] R. Chowdhury, N. Fischer, Transmission line protection for systems with inverter-based
resources – part i: Problems, IEEE Trans. Power Del. 36 (4) (2021) 2416–2425. doi:
10.1109/TPWRD.2020.3019990.
[9] Q. Huang, K. Li, Y. Li, R. Fan, A. Wang, G. Zhang, Y. Sun, Adaptability analysis
of traditional differential protection applied to lines connected to PV, in: IEEE 5th
Advanced Information Management, Communicates, Electronic and Automation Control
Conference (IMCEC), Vol. 5, 2022, pp. 1819–1822. doi:10.1109/IMCEC55388.2022.
10019865.
[10] Z. Yang, K. Jia, Y. Fang, Z. Zhu, B. Yang, T. Bi, High-frequency fault component-based
distance protection for large renewable power plants, IEEE Trans. Power Electron. 35 (10)
(2020) 10352–10362. doi:10.1109/TPEL.2020.2978266.
[12] D. S. Kumar, Impact analysis of distributed generators on the protection and stability
of power systems, Ph.D. thesis, National University of Singapore (2017).
24
[14] M. Kavi, Y. Mishra, M. Vilathgamuwa, Challenges in high impedance fault detection due
to increasing penetration of photovoltaics in radial distribution feeder, in: IEEE Power
& Energy Society General Meeting, 2017, pp. 1–5. doi:10.1109/PESGM.2017.8274658.
[15] Y. Fang, K. Jia, Z. Yang, Y. Li, T. Bi, Impact of inverter-interfaced renewable energy
generators on distance protection and an improved scheme, IEEE Trans. Ind. Electron.
66 (9) (2019) 7078–7088. doi:10.1109/TIE.2018.2873521.
[17] A. Saber, M. Shaaban, H. Zeineldin, A new differential protection algorithm for trans-
mission lines connected to large-scale wind farms, Int. J. Elect. Power Energy Syst. 141
(2022) 108220. doi:https://doi.org/10.1016/j.ijepes.2022.108220.
[18] K. Jia, Y. Li, Y. Fang, L. Zheng, T. Bi, Q. Yang, Transient current similarity based
protection for wind farm transmission lines, Applied Energy 225 (2018) 42–51. doi:
https://doi.org/10.1016/j.apenergy.2018.05.012.
[19] A. Ghorbani, H. Mehrjerdi, Distance protection with fault resistance compensation for
lines connected to PV plant, Int. J. Elect. Power Energy Syst. 148 (2023) 108976. doi:
https://doi.org/10.1016/j.ijepes.2023.108976.
[21] O. Noureldeen, I. Hamdan, Design of robust intelligent protection technique for large-
scale grid-connected wind farm, Protection and Control of Modern Power Systems 3 (17)
(2018) 1–13. doi:10.1186/s41601-018-0090-4.
[22] S. Biswas, P. K. Nayak, A new approach for protecting tcsc compensated transmission
lines connected to DFIG-based wind farm, IEEE Trans. Ind. Inform. 17 (8) (2021) 5282–
5291. doi:10.1109/TII.2020.3029201.
[25] A. Chowdhury, S. Paladhi, A. K. Pradhan, Local positive sequence component based pro-
tection of series compensated parallel lines connecting solar photovoltaic plants, Electric
Power Systems Research 225 (2023) 109811. doi:https://doi.org/10.1016/j.epsr.
2023.109811.
25
[26] P. Bera, S. Pani, C. Isik, R. Bansal, Transients in transmission lines connected to pho-
tovoltaic farms (dataset) (2023). doi:https://dx.doi.org/10.21227/scrs-bs27.
[27] E. Muljadi, M. Singh, V. Gevorgian, User guide for PV dynamic model simulation written
on pscad platform, Tech. rep., National Renewable Energy Lab.(NREL), Golden, CO
(United States) (2014).
[29] S. R. Pani, P. K. Bera, V. Kumar, Detection and classification of internal faults in power
transformers using tree based classifiers, in: 2020 IEEE International Conference on
Power Electronics, Drives and Energy Systems (PEDES), 2020, pp. 1–6. doi:10.1109/
PEDES49360.2020.9379641.
[30] P. K. Bera, C. Isik, V. Kumar, Discrimination of internal faults and other transients
in an interconnected system with power transformers and phase angle regulators, IEEE
Syst. J. 15 (3) (2021) 3450–3461. doi:10.1109/JSYST.2020.3009203.
[31] P. K. Bera, C. Isik, Identification of stable and unstable power swings using pattern
recognition, in: 2021 IEEE Green Technologies Conference (GreenTech), 2021, pp. 286–
291. doi:10.1109/GreenTech48523.2021.00053.
[33] N. V. Chawla, et al., SMOTE: Synthetic minority over-sampling technique, J. Artif. Int.
Res. 16 (1) (2002) 321–357.
[34] L. van der Maaten, G. Hinton, Visualizing data using t-SNE, Journal of Machine Learning
Research 9 (2008) 2579–2605.
[37] Q. Cui, K. El-Arroudi, Y. Weng, A feature selection method for high impedance fault
detection, IEEE Trans. Power Del. 34 (3) (2019) 1203–1215. doi:10.1109/TPWRD.2019.
2901634.
[38] Y. Zhang, J. Liang, Z. Yun, X. Dong, A new fault-location algorithm for series-
compensated double-circuit transmission lines based on the distributed parameter model,
IEEE Trans. Power Del. 32 (6) (2017) 2398–2407. doi:10.1109/TPWRD.2016.2626476.
26
[39] V. Ashok, A. Yadav, A. Y. Abdelaziz, Modwt-based fault detection and classification
scheme for cross-country and evolving faults, Electr. Power Syst. Res. 175 (2019) 105897.
doi:https://doi.org/10.1016/j.epsr.2019.105897.
[40] IEEE Guide for Application of Digital line Current Differential Relays using Digital Com-
munication, IEEE Std C37.243-2015 (2015) 1–72doi:10.1109/IEEESTD.2015.7181615.
27