ST Nov-Dec 2023
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Dremio 16.qxd 01-<strong>Dec</strong>-23 10:58 AM Page 3<br />
MANAGEMENT: DATA ARCHITECTURE<br />
"<strong>Dec</strong>oupling compute and storage in public clouds is<br />
more straightforward to administer and relatively<br />
inexpensive. Besides, these compute and storage cloud<br />
services are virtually unlimited in scalability, eliminating<br />
legacy hardware procurement issues. They also offer<br />
supreme levels of availability and performance."<br />
procurement overheads. Mass adoption of<br />
separating compute and storage only became<br />
feasible with public cloud computing.<br />
<strong>Dec</strong>oupling compute and storage in public<br />
clouds is more straightforward to administer<br />
and relatively inexpensive. Besides, these<br />
compute and storage cloud services are<br />
virtually unlimited in scalability, eliminating<br />
legacy hardware procurement issues. They also<br />
offer supreme levels of availability and<br />
performance. Therefore, the separation of<br />
compute from data brings forth three<br />
immediate benefits:<br />
A significant reduction in complicated and<br />
expensive data copies and movements as<br />
the data warehouse as the sole source of<br />
truth gets replaced by accessing data in<br />
open formats in the data lake, eliminating<br />
data silos.<br />
Open data standards and formats provide<br />
universal data access from infinite services<br />
and applications, creating the freedom to<br />
pick the best solutions.<br />
An open architecture ensures that future<br />
cloud services can directly access the data,<br />
avoiding going through a data warehouse<br />
vendor's proprietary format or<br />
moving/copying data from the data<br />
warehouse.<br />
THE OPPORTUNITIES OF OPEN<br />
ARCHITECTURE<br />
Cloud data warehouse providers enticed firms<br />
with the allure of scalability and cost-efficiency<br />
that was unsustainable with on-premises<br />
solutions. However, after uploading their data<br />
into the warehouse, organisations were<br />
restricted entirely to the vendor's ecosystem or<br />
denied access to other promising technologies<br />
that could extract more value from their data.<br />
Open architecture is a significant advantage<br />
of cloud data lake/lakehouse over the data<br />
warehouse. As a result, organisations are<br />
reassessing their strategies to use an open<br />
architecture that promotes flexibility and reestablishes<br />
ownership of their data. This shift<br />
signifies three things:<br />
The flexibility to utilise various superior<br />
services and engines on the company's<br />
data. This allows the use of diverse<br />
technologies like superior SQL, Databricks<br />
or any other data-processing tool. Given<br />
that companies have numerous use cases<br />
and requirements, utilising the best-suited<br />
tool yields higher productivity - especially for<br />
data teams - and lower cloud costs. It's also<br />
important to remember that no single<br />
vendor can offer all the processing<br />
capabilities a company requires.<br />
Not being confined to one vendor. Platform<br />
changes become profoundly challenging<br />
when dealing with a data warehouse<br />
holding up to a million tables and hundreds<br />
of complex ingestion pipelines.<br />
Comparatively, if an organisation uses a<br />
superior SQL on its cloud data lake today<br />
and a new tool emerges tomorrow, it's<br />
possible to query the existing data with the<br />
new system without migrating it.<br />
The ability to benefit from future<br />
technological advancements. Avoiding<br />
becoming locked-in is crucial, as it keeps<br />
vendors from exploiting a company<br />
financially. But more significant is the<br />
capacity to adopt and benefit from<br />
emerging technology, even if the current<br />
vendor remains favourable. If a superior<br />
machine learning service or a better batch<br />
processing engine is invented, organisations<br />
can have peace of mind that they can use<br />
the tool freely.<br />
Application architectures have demonstrated<br />
that a service-oriented approach allows<br />
maximum scale, flexibility, and agility. While<br />
separating compute and storage marked an<br />
essential first step in reducing analytic costs, it<br />
doesn't offer the kind of benefits visible in<br />
modern application architectures. However, by<br />
disengaging compute from data, the benefits of<br />
application design can now be used for data<br />
analytics, especially given the critical<br />
importance of data for all businesses.<br />
As a result, open data architecture brings forth<br />
many benefits, from flexibility, independence,<br />
and future-proofing to creating new avenues<br />
for gaining valuable business insights. In the<br />
rapidly evolving digital era, embracing open<br />
data architectures is more than a strategic<br />
choice; it's a decisive move towards a more<br />
flexible, scalable, and insightful future.<br />
More info: www.dremio.com<br />
www.storagemagazine.co.uk<br />
@<strong>ST</strong>MagAndAwards <strong>Nov</strong>/<strong>Dec</strong> <strong>2023</strong><br />
<strong>ST</strong>ORAGE<br />
MAGAZINE<br />
17