27.03.2014 Views

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

The investigation described in this paper focuses on<br />

forecasting fault events in a distribution grid utilizing historical<br />

data on fault events that occurred in the grid and their<br />

associated electrical values, data on weather conditions at the<br />

time of t he fault, and the type of grid infrastructure, as shown<br />

in the shaded areas in Fig. 1. Although large storms can cause<br />

considerable damage to distribution grids, “relatively normal”<br />

weather conditions can also have a significant impact on faults<br />

in the distribution grid, as shown by the results of this project.<br />

Lu et al. [11] discuss the influence that changing climatic<br />

conditions and weather have on the wear of electric equipment.<br />

Their analyses indicate that during the hot summer months,<br />

when the load on a feeders increase to 60-70%, there is an<br />

over-charge in the lines and assets, which can result in<br />

increased number of faults and reduced voltage of the<br />

distributed power.<br />

III. DATA COLLECTION AND PRE-PROCESSING<br />

The results described in this investigation are associated<br />

with the study of a utility in the US whose identity and name<br />

cannot be revealed due to confidentiality agreements. Hence,<br />

the utility in this study will be referred to as Investor Owned<br />

Utility (IOU).<br />

Several types of historical datasets associated with the IOU<br />

were collected and utilized during this investigation. The<br />

historical dataset types utilized in this investigation include: (a)<br />

fault data and electrical values from the IOU; (b) weather data;<br />

and (c) infrastructure type of the IOU. Fig. 2 shows the dataset<br />

types and their associated data attributes. The fault data from<br />

the IOU was collected utilizing an automated system developed<br />

at ABB. This system consists of in telligent electronic devices<br />

(IED’s) with sensing and analytic capabilities located at the end<br />

of the feeders of distribution lines. These IED’s monitor<br />

electrical values from the distribution lines, and are able to<br />

detect a fault event in the grid soon after it occurs. The<br />

historical fault data utilized includes these electrical values,<br />

which were corroborated with data entries documented by IOU<br />

engineers after restoring service. The weather data was<br />

collected from the US National Weather Service (NWS) and<br />

from the WeatherBug (WBUG) weather services. The NWS<br />

data was collected by their weather station every five minutes<br />

in METAR format. The WeatherBug data were collected from<br />

small weather stations located in various locations close to the<br />

different substations of the IOU. Finally, the lightning data<br />

were also obtained from the WeatherBug weather services<br />

organization. Fig. 1 has three gray rectangles that show the<br />

factors utilized to develop the machine learning algorithms<br />

discussed in this paper. It is e xpected that future studies will<br />

include data associated with the remaining factors in Fig. 1, to<br />

increase the precision and accuracy of the forecasted results.<br />

An essential aspect of any data mining activity is preparing<br />

the data to be util ized for the analyses, or "data preprocessing".<br />

A software utility was developed in this project to<br />

automate the pre-processing of the raw datasets, to generate the<br />

data warehouse used to extract data for analyses, and to<br />

populate the forecasted results. The data pre-processing utility<br />

contains rules that address: (a) weather-fault time zone<br />

alignments; (b) weather-fault distances; (c) weather direction;<br />

and (d) information in free-format comments in the fault<br />

events, among others. Fig. 2 shows a d etailed list of the data<br />

attributes utilized to run the prediction models presented in<br />

Section V.<br />

Figure 2. Primary data attributes utilized by machine learning algorithms<br />

Data pre-processing involved cleaning of the fault, weather,<br />

and lightning data; aligning all of the datasets based on both<br />

time and geographic location; and fusing the various datasets<br />

together. A total of 1725 fault events were obtained over a twoyear<br />

period, across eight feeders in four substations (two<br />

feeders per substation). These feeders range from a few miles<br />

apart to 10 miles apart. The main tasks in cleaning and preprocessing<br />

fault data included generation of “mineable”<br />

parameters from the fault comment texts, which described the<br />

equipment involved in the fault and the type of problem (e.g.<br />

animal contact). The fault infrastructure coding in the fault<br />

events was supplemented with information on the power grid<br />

topology, e.g. which feeders were almost entirely underground<br />

(UG) or overhead (OH). Five-minute weather observation data<br />

were obtained from NWS for one airport within 9-25 miles of<br />

the four substations, with 93% completeness (about 100,000<br />

observation records per year). Hourly weather observation data<br />

were obtained from WeatherBug for four local “Earth<br />

Networks” weather stations within varying distances from the<br />

four substations at IOU (about 8000 observation records per<br />

year for a single weather station).<br />

Preprocessing the five-minute airport weather data required<br />

several transformation steps: first, to translate the coded<br />

“METAR” strings to their equivalent text and numeric<br />

parameters; then, to parse and group the weather conditions<br />

into mineable nominal parameters. The hourly WeatherBug<br />

data contained somewhat different parameters than the fiveminute<br />

METAR data (e.g., it included sunlight level readings<br />

459

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!