You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@crail.apache.org by pe...@apache.org on 2018/12/04 14:48:48 UTC
[18/20] incubator-crail-website git commit: Publishing from 4358827d5a1090bd3324979e1e4af05b4512548f

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2017/11/crail-metadata.html
----------------------------------------------------------------------
diff --git a/content/blog/2017/11/crail-metadata.html b/content/blog/2017/11/crail-metadata.html
new file mode 100644
index 0000000..5df113a
--- /dev/null
+++ b/content/blog/2017/11/crail-metadata.html
@@ -0,0 +1,570 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Crail Storage Performance -- Part III: Metadata</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/crail-metadata-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/crail-metadata-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Crail Storage Performance -- Part III: Metadata</h2>   
+          
+
+          <p class="meta">21 Nov 2017,  </p>
+
+<div class="post">
+<div style="text-align: justify">
+<p>
+This is part III of our series of posts discussing Crail's raw storage performance. This part is about Crail's metadata performance and scalability.
+</p>
+</div>
+
+<h3 id="hardware-configuration">Hardware Configuration</h3>
+
+<p>The specific cluster configuration used for the experiments in this blog:</p>
+
+<ul>
+  <li>Cluster
+    <ul>
+      <li>8 node x86_64 cluster</li>
+    </ul>
+  </li>
+  <li>Node configuration
+    <ul>
+      <li>CPU: 2 x Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz</li>
+      <li>DRAM: 96GB DDR3</li>
+      <li>Network: 1x100Gbit/s Mellanox ConnectX-5</li>
+    </ul>
+  </li>
+  <li>Software
+    <ul>
+      <li>Ubuntu 16.04.3 LTS (Xenial Xerus) with Linux kernel version 4.10.0-33-generic</li>
+      <li>Crail 1.0, internal version 2993</li>
+    </ul>
+  </li>
+</ul>
+
+<h3 id="crail-metadata-operation-overview">Crail Metadata Operation Overview</h3>
+
+<div style="text-align: justify"> 
+<p>
+As described in <a href="//crail.incubator.apache.org/blog/2017/08/crail-memory.html">part I</a>, Crail data operations are composed of actual data transfers and metadata operations. Examples of metadata operations are operations for creating or modifying the state of a file, or operations to lookup the storage server that stores a particular range (block) of a file. In Crail, all the metadata is managed by the namenode(s) (as opposed to the data which is managed by the storage nodes). Clients interact with Crail namenodes via Remote Procedure Calls (RPCs). Crail supports multiple RPC protocols for different types of networks and also offers a pluggable RPC interface so that new RPC bindings can be implemented easily. On RDMA networks, the default DaRPC (<a href="https://dl.acm.org/citation.cfm?id=2670994">DaRPC paper</a>, <a href="http://github.com/zrlio/darpc">DaRPC GitHub</a>) based RPC binding provides the best performance. The figure below gives an overview of the Crail metadat
 a processing in a DaRPC configuration. 
+</p>
+</div>
+
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/rpc.png" width="480" /></div>
+<p><br /></p>
+
+<div style="text-align: justify"> 
+<p>
+Crail supports partitioning of metadata across several namenods. Thereby, metadata operations issued by clients are hashed to a particular namenode depending on the name of object the operation attempts to create or retrieve. With the DaRPC binding, RPC messages are exchanged using RDMA send/recv operations. At the server, RPC processing is parallelized across different cores. To minimize locking and cache contention, each core handles a disjoint set of client connections. Connections assigned to the same core share the same RDMA completion queue which is processed exclusively by that given core. All the network queues, including send-, recv- and completion queues are mapped into user-space and accessed directly from within the JVM process. Since Crail offers a hierarchical storage namespace, metadata operations to create, delete or rename new storage resources effectively result in modifications to a tree-like data structure at the namenode. These structural operations require a so
 mewhat more expensive locking than the more lightweight operations used to lookup the file status or to extend a file with a new storage block. Consequently, Crail namenodes use two separate data structures to manage metadata: (a) a basic tree data structure that requires directory-based locking, and (b) a fast lock-free map to lookup of storage resources that are currently being read or written.
+</p>
+</div>
+
+<h3 id="experimental-setup">Experimental Setup</h3>
+
+<div style="text-align: justify"> 
+<p>
+In two of the previous blogs (<a href="//crail.incubator.apache.org/blog/2017/08/crail-memory.html">DRAM</a>, <a href="//crail.incubator.apache.org/blog/2017/08/crail-nvme-fabrics-v1.html">NVMf</a>) we have already shown that Crail metadata operations are very low latency. Essentially a single metadata operation issued by a remote client takes 5-6 microseconds, which is only slightly more than the raw network latency of the RDMA network fabric. In this blog, we want to explore the scalability of Crail's metadata management, that is, the number of clients Crail can support, or how Crail scales as the cluster size increases. The level of scalability of Crail is mainly determined by the number of metadata operations Crail can process concurrently, a metric that is often referred to as IOPS. The higher the number of IOPS the system can handle, the more clients can concurrently use Crail without performance loss. 
+</p>
+<p>
+An important metadata operation is ''getFile()'', which is used by clients to lookup the status of a file (whether the file exists, what size it has, etc.). The ''getFile()'' operation is served by Crail's fast lock-free map and in spirit is very similar to the ''getBlock()'' metadata operation (used by clients to query which storage nodes holds a particular block). In a typical Crail use case, ''getFile()'' and ''getBlock()'' are responsible for the peak metadata load at a namenode. In this experiment, we measure the achievable IOPS on the server side in an artificial configuration with many clients distributed across the cluster issuing ''getFile()'' in a tight loop. Note that the client side RPC interface in Crail is asynchronous, thus, clients can issue multiple metadata operations without blocking while asynchronously waiting for the result. In the experiments below, each client may have a maximum of 128 ''getFile()'' operations outstanding at any point in time. In a practical 
 scenario, Crail clients may also have multiple metadata operations in flight either because clients are shared by different cores, or because Crail interleaves metadata and data operations (see <a href="//crail.incubator.apache.org/blog/2017/08/crail-memory.html">DRAM</a>). What makes the benchmark artificial is that clients exclusively focus on generating load for the namenode and thereby are neither performing data operations nor are they doing any compute. The basic command of the benchmark as executed by each of the individual clients is given by the following command:
+</p>
+</div>
+<div class="highlighter-rouge"><div class="highlight"><pre class="highlight"><code>./bin/crail iobench -t getMultiFileAsync -f / -k 10000000 -b 128
+</code></pre></div></div>
+<div style="text-align: justify"> 
+<p>
+Where ''-t'' specifies the benchmark to run, ''-f'' specifies the path on the
+Crail file system to be used for the benchmark, ''-k'' specifies the number of
+iterations to be performed by the benchmark
+(how many times will the benchmark execute ''getFile()'') and
+''-b'' specifies the maximum number of requests in flight.
+</p>
+</div>
+
+<h3 id="single-namenode-scalability">Single Namenode Scalability</h3>
+
+<div style="text-align: justify"> 
+<p>
+In the first experiment, we measure the aggregated number of metadata operations a single Crail namenode can handle per second. The namenode runs on 8 physical cores with hyper-threading disabled. The result is shown in the first graph below, labeled ''Namenode IOPS''. The namenode only gets saturated with more than 16 clients. The graph shows that the namenode can handle close to 10 million ''getFile()'' operations per second. With significantly more clients, the overall number of IOPS drops slightly, as more resources are being allocated on the single RDMA card, which basically creates a contention on hardware resources.
+</p>
+<p> 
+As comparison, we measure the raw number of IOPS, which can be executed on the RDMA network. We measure the raw number using ib_send_bw. We configured ib_send_bw with the same parameters in terms of RDMA configuration as the namenode. This means, we instructed ib_send_bw not to do CQ moderation, and to use a receive queue and a send queue of length 32, which equals the length of the namenode queues. Note that the default configuration of ib_send_bw uses CQ moderation and does preposting of send operations, which can only be done, if the operation is known in advance. This is not the case in a real system, like crail's namenode. The basic ib_send_bw command is given below:
+</p>
+</div>
+<div class="highlighter-rouge"><div class="highlight"><pre class="highlight"><code>ib_send_bw -s 1 -Q 1 -r 32 -t 32 -n 10000000
+</code></pre></div></div>
+<div style="text-align: justify"> 
+<p>
+Where ''-s 1'' specifies to send packets with a payload of 1 (we don't want to
+measure the transmission time of data, just the number of I/O operations),
+''-Q 1'' specifies not to do CQ moderation, ''-r 32'' specifies the receive
+queue length to be 32, ''-t 32'' specifies the send queue length to be 32
+and ''-n'' specifies the number of
+iterations to be performed by ib_send_bw.
+</p>
+</div>
+<div style="text-align: justify"> 
+<p>
+The line of the raw number of IOPS, labeled ''ib send'' is shown in the same graph. With this measurement we show that Crail's namenode IOPS are similar to the raw ib_send_bw IOPS with the same configuration.
+</p>
+</div>
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/namenode_ibsend_iops64.svg" width="550" /></div>
+<p><br /></p>
+<div style="text-align: justify"> 
+<p>
+If one starts ib_send_bw without specifying the queue sizes or whether or not to use CQ moderation, the raw number of IOPS might be higher. This is due to the fact, that the default values of ib_send_bw use a receive queue of 512, a send queue of 128 and CQ moderation of 100, meaning that a new completion is generated only after 100 sends. As comparison, we did this
+measurement too and show the result, labeled 'ib_send CQ mod', in the same graph. Fine tuning of receive and send queue sizes, CQ moderation size, postlists etc might lead to a higher number of IOPS. 
+</p>
+</div>
+
+<h3 id="multiple-namenode-scalability">Multiple Namenode Scalability</h3>
+
+<div style="text-align: justify"> 
+<p>
+To increase the number of IOPS the overall system can handle, we allow starting multiple namenode instances. Hot metadata operations, such as ''getFile()'', are distributed over all running instances of the namenode. ''getFile()'' is implemented such that no synchronization among the namenodes is required. As such, we expect good scalability. The graph below compares the overall IOPS of a system with one namenode to a system with two namenodes and four namenodes.
+</p>
+</div>
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/namenode_multi64.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify"> 
+<p>
+We show in this graph that the system can handle around 17Mio IOPS with two namenodes and 28Mio IOPS with four namenodes (with more than 64 clients we measured the number of IOPS to be slightly higher than 30Mio IOPS). Having multiple namenode instances matters especially with a higher number of clients. In the graph we see that the more clients we have the more we can benefit from a second namenode instance or even more instances.
+</p>
+</div>
+
+<div style="text-align: justify"> 
+<p>
+We only have 7 physical nodes available to run the client processes. This
+means, after 7 client processes, processes start sharing a physical machine.
+With 64 client processes, each machine runs 9 (10 in one case) client
+instances, which share the cores and the resources of the RDMA hardware.
+We believe this is the reason, why the graphs appear not to scale linearly.
+The number of total IOPS is client-bound, not namenode-bound.
+With more physical machines, we believe that scalability could be shown
+much better. Again, there is absolutely no communication among the
+namenodes happening, which should lead to linear scalability.
+</p>
+</div>
+
+<h3 id="cluster-sizes">Cluster sizes</h3>
+
+<div style="text-align: justify"> 
+<p>
+Let us look at a concrete application, which ideally runs on a large cluster:
+TeraSort. In a previous blog, <a href="//crail.incubator.apache.org/blog/2017/01/sorting.html">sorting</a>,
+we analyze performance characteristics of TeraSort on Crail on a big cluster
+of 128 nodes, where we run 384 executors in total. This already proves that
+Crail can at least handle 384 clients. Now we analyze the theoretical number
+of clients without performance loss at the namenode. Still this theoretical
+number is not a hard limit on the number of clients. Just adding more
+clients would start dropping the number of IOPS per client (not at the
+namenode).
+</p>
+</div>
+
+<div style="text-align: justify"> 
+<p>
+In contrast to the benchmarks above, a real-world application, like TeraSort,
+does not issue RPC requests in a tight loop. It rather does sorting
+(computation), file reading and writing and and of course a certain amount of
+RPCs to manage the files.
+</p>
+</div>
+
+<div style="text-align: justify"> 
+<p>
+We would like to know how many RPCs a run of TeraSort generates and therefore
+how big the load in terms of number of IOPS is at the namenode for a
+real-world application.
+We run TeraSort on a data set of 200GB and measured the
+number of IOPS at the namenode with 4 executors, 8 executors and 12 executors.
+Every executor runs 12 cores. For this experiment, we use a single namenode
+instance. We plot the distribution of the number of IOPS measured at the
+namenode over the elapsed runtime of the TeraSort application.
+</p>
+</div>
+
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/terasort_iops.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify"> 
+<p>
+From the graph we pick the peak number of IOPS measured
+throughout the execution time for all three cases. The following table
+shows the three peak IOPS numbers:
+</p>
+</div>
+
+<p><br /></p>
+<center>
+<table>
+  <thead>
+    <tr>
+      <th>#Executor nodes</th>
+      <th>Measured IOPS</th>
+      <th>% of single namenode</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td align="right">4</td>
+      <td align="right">32k</td>
+      <td align="right">0.32%</td>
+    </tr>
+    <tr>
+      <td align="right">8</td>
+      <td align="right">67k</td>
+      <td align="right">0.67%</td>
+    </tr>
+    <tr>
+      <td align="right">12</td>
+      <td align="right">107k</td>
+      <td align="right">1.07%</td>
+    </tr>
+  </tbody>
+</table>
+</center>
+<p><br /></p>
+
+<div style="text-align: justify"> 
+<p>
+From this table we see that it scales linearly. Even more important,
+we notice that with 12 nodes we still use only around 1% of the
+number of IOPS a single namenode can handle.
+If we extrapolate this to a
+100%, we can handle a cluster size of almost 1200 nodes (1121 clients being just
+below 10Mio IOPS at the namenode). The
+extrapolated numbers would look like this:
+</p>
+</div>
+
+<p><br /></p>
+<center>
+<table>
+  <thead>
+    <tr>
+      <th>#Namenodes</th>
+      <th>Max IOPS by  namenodes</th>
+      <th>#Executor nodes</th>
+      <th>Extrapolated IOPS</th>
+      <th>% of all namenodes</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td align="right">1</td>
+      <td align="right">10000k</td>
+      <td align="right">1121</td>
+      <td align="right">9996k</td>
+      <td align="right">99.96%</td>
+    </tr>
+    <tr>
+      <td align="right">1</td>
+      <td align="right">10000k</td>
+      <td align="right">1200</td>
+      <td align="right">10730k</td>
+      <td align="right">107.3%</td>
+    </tr>
+    <tr>
+      <td align="right">2</td>
+      <td align="right">17000k</td>
+      <td align="right">1906</td>
+      <td align="right">16995k</td>
+      <td align="right">99.97%</td>
+    </tr>
+    <tr>
+      <td align="right">4</td>
+      <td align="right">30000k</td>
+      <td align="right">3364</td>
+      <td align="right">29995k</td>
+      <td align="right">99.98%</td>
+    </tr>
+</tbody>
+</table>
+</center>
+<p><br /></p>
+
+<div style="text-align: justify"> 
+<p>
+Of course we know that there is no system with perfect linear scalability.
+But even if we would loose 50% of the number of IOPS (compared to the
+theoretical maximum) on a big cluster, Crail could still handle a cluster size
+of 600 nodes and a single namenode without any performance loss at the
+namenode.
+Should we still want to run an application like TeraSort on a bigger cluster,
+we can add a second namenode or have even more instances of namenodes
+to ensure that clients do not suffer from contention in terms of IOPS at
+the namenode.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+We believe that the combination of benchmarks above, the scalability
+experiments and the real-world
+application of TeraSort shows clearly that Crail and Crail's namenode can handle
+a big cluster of at least several hundreds of nodes, theoretically up to
+1200 nodes with a single namenode and even more with multiple namenodes.
+</p>
+</div>
+
+<h3 id="system-comparison">System comparison</h3>
+<div style="text-align: justify">
+<p>
+In this section we compare the number of IOPS Crail can handle to
+two other systems:
+<a href="http://hadoop.apache.org/">Hadoop's HDFS namenode</a> and
+<a href="https://ramcloud.atlassian.net/wiki/spaces/RAM/overview">RAMCloud</a>.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+HDFS is a well known distributed file system. Like Crail, HDFS runs
+a namenode and several datanodes. The namenode implements similar functionality
+as Crail's namenode, while HDFS's datanodes provide additional functionality,
+like replication, for example. We are interested in the
+number of IOPS the namenode can handle. As such, the datanode's functionality
+is not relevant for this experiment. HDFS is implemented in Java like Crail.
+Due to this high similarity in terms of functionality and language used to
+implement the system, HDFS is a good candidate to compare Crail to.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+HDFS does not use RDMA to send RPCs. Instead, RPCs are sent over a regular
+IP network. In our case, it is the same 100Gbit/s ethernet-based RoCE network.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+To measure the number of IOPS HDFS's namenode can handle, we run the same
+experiment as for Crail. The clients issue a ''getFile()'' RPC to the
+namenode and we vary the number of clients from 1 to 64. The following
+plot shows the number of IOPS relative to the number of clients.
+</p>
+</div>
+
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/namenode_hdfs_iops.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify">
+<p>
+The graph shows that the namenode can handle around 200000 IOPS. One reason
+for the difference to the number of IOPS of Crail is surely that HDFS does not
+use the capabilities offered by the RDMA network, while Crail does. However
+this cannot be the only reason, why the namenode cannot handle more than
+200000 IOPS. We would need to analyze more deeply where the bottleneck is
+to find an answer. We believe that the amount of code which
+gets executed at probably various layers of the software stack
+is too big to achieve high performance in terms of IOPS.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+RAMCloud is a fast key-value store, which makes use of the RDMA network
+to reach low latency and high throughput. It runs one master coordinator and
+and optionally several slave coordinators, which can take over, if the master
+coordinator fails. Coordinator persistence can be achieved
+by external persistent storage, like Zookeeper or LogCabin.
+RAMCloud runs several storage servers, which
+store key-value pairs in RAM. Optionally, replicas can be stored on secondary
+storage, which provides persistence. RAMCloud is implemented in C++. Therefore
+it is natively compiled code.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+We are interested in the number of IOPS RAMCloud can handle. We decided
+to run the readThroughput benchmark of RAMCloud's ClusterPerf program, which
+measures the number of object reads per second. This is probably the closest
+benchmark to the RPC benchmark of Crail and HDFS.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+For a fair comparison, we run RAMCloud without any persistence, so without
+Zookeeper and without replicas to secondary storage. We run one coordinator
+and one storage server, which is somewhat similar to running one namenode
+in the Crail and HDFS cases. Also, we wanted to vary the number of clients
+from 1 to 64. At the moment we can only get results for up to 16 clients.
+We asked the RAMCloud developers for possible reasons and got to know that the
+reason is a starvation bug in the benchmark (not in the RAMCloud system
+itself). The RAMCloud developers are looking into this issue. We will update
+the blog with the latest numbers as soon as the bug is fixed.
+</p>
+</div>
+
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/ramcloud_iops.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify">
+<p>
+RAMCloud reaches a peak of 1.12Mio IOPS with 14 clients. The utilization of the
+dispatcher thread is at 100% already with 10 clients. Even with more clients,
+the number of IOPS won't get higher than 1.12Mio, because the
+dispatcher thread is the bottleneck, as can be seen in the graph.
+In addition, we got a confirmation from the developers that more than
+10 clients will not increase the number of IOPS.
+So we think that the measurements are not unfair, even if we do not have
+results for more than 16 clients. Again, we we will update the blog
+with a higher number of clients, as soon as the bug is fixed.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>
+Let us now summarize the number of IOPS of all three systems in one plot
+below. For a fair comparison, Crail runs only one namenode for this
+experiments and we compare the results to RAMCloud with one coordinator and
+one storage server (without replication as described above) and the one
+namenode instance of HDFS. We see that Crail's single namenode can handle
+a much bigger number of RPCs compared to the other two systems (remember
+that Crail can run multiple namenodes and we measured a number of IOPS
+of 30Mio/s with 4 namenodes).
+</p>
+</div>
+
+<p><br /></p>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/crail-metadata/max_iops_crail_hdfs_ramcloud.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify">
+<p>
+HDFS is deployed on production clusters and handles real workloads
+with roughly 200000 IOPS. We believe that Crail, which can handle a much
+bigger number of IOPS, is able to run real workloads on very large
+clusters. A common assumption is that Java-based implementations suffer from
+performance loss. We show that a Java-based system can handle a high amount
+of operations even compared to a C++-based system like RAMCloud.
+</p>
+</div>
+
+<h3 id="summary">Summary</h3>
+
+<div style="text-align: justify"> 
+<p>
+In this blog we show three key points of Crail: First, Crail's namenode performs the same as ib_send_bw with realistic parameters in terms of IOPS. This shows that the actual processing of the RPC is implemented efficiently. Second, with only one namenode, Crail performs 10x to 50x better than RAMCloud and HDFS, two popular systems, where RAMCloud is RDMA-based and implemented natively. Third, Crail's metadata service can be scaled out to serve large number of clients. We have shown that Crail offers near linear scaling with up to 4 namenodes, offering a performance that is sufficient to serve several 1000s of clients. 
+</p>
+</div>
+
+
+</div>
+
+<!-- 
+
+<div id="disqus_thread"></div>
+<script>
+
+/**
+*  RECOMMENDED CONFIGURATION VARIABLES: EDIT AND UNCOMMENT THE SECTION BELOW TO INSERT DYNAMIC VALUES FROM YOUR PLATFORM OR CMS.
+*  LEARN WHY DEFINING THESE VARIABLES IS IMPORTANT: https://disqus.com/admin/universalcode/#configuration-variables*/
+/*
+var disqus_config = function () {
+this.page.url = PAGE_URL;  // Replace PAGE_URL with your page's canonical URL variable
+this.page.identifier = PAGE_IDENTIFIER; // Replace PAGE_IDENTIFIER with your page's unique identifier variable
+};
+*/
+(function() { // DON'T EDIT BELOW THIS LINE
+var d = document, s = d.createElement('script');
+s.src = '//crail-io.disqus.com/embed.js';
+s.setAttribute('data-timestamp', +new Date());
+(d.head || d.body).appendChild(s);
+})();
+</script>
+<noscript>Please enable JavaScript to view the <a href="https://disqus.com/?ref_noscript">comments powered by Disqus.</a></noscript>
+
+-->
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2017/11/floss.html
----------------------------------------------------------------------
diff --git a/content/blog/2017/11/floss.html b/content/blog/2017/11/floss.html
new file mode 100644
index 0000000..7bdd6cc
--- /dev/null
+++ b/content/blog/2017/11/floss.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Floss</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/floss-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/floss-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Floss</h2>   
+          
+
+          <p>Crail features in the <a href="https://twit.tv/shows/floss-weekly/episodes/458?autostart=false">FLOSS weekly podcast</a></p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2017/11/iops.html
----------------------------------------------------------------------
diff --git a/content/blog/2017/11/iops.html b/content/blog/2017/11/iops.html
new file mode 100644
index 0000000..5b91957
--- /dev/null
+++ b/content/blog/2017/11/iops.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Iops</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/iops-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/iops-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Iops</h2>   
+          
+
+          <p>New blog <a href="//crail.incubator.apache.org/blog/2017/11/crail-metadata.html">post</a> about Crail’s metadata performance and scalability</p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2017/11/rdmashuffle.html
----------------------------------------------------------------------
diff --git a/content/blog/2017/11/rdmashuffle.html b/content/blog/2017/11/rdmashuffle.html
new file mode 100644
index 0000000..816139f
--- /dev/null
+++ b/content/blog/2017/11/rdmashuffle.html
@@ -0,0 +1,181 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Spark Shuffle: SparkRDMA vs Crail</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/rdmashuffle-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/rdmashuffle-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Spark Shuffle: SparkRDMA vs Crail</h2>   
+          
+
+          <p class="meta">17 Nov 2017,  <mark>this is a blog post from a user of the Crail project.</mark>  </p>
+
+<div class="post">
+<div style="text-align: justify">
+<p>
+This blog is comparing the shuffle performance of Crail with SparkRDMA, an alternative RDMA-based shuffle plugin for Spark.
+</p>
+</div>
+
+<h3 id="hardware-configuration">Hardware Configuration</h3>
+
+<p>The specific cluster configuration used for the experiments in this blog:</p>
+
+<ul>
+  <li>Cluster
+    <ul>
+      <li>8 compute + 1 management node x86_64 cluster</li>
+    </ul>
+  </li>
+  <li>Node configuration
+    <ul>
+      <li>CPU: 2 x Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz</li>
+      <li>DRAM: 96GB DDR3</li>
+      <li>Network: 1x100Gbit/s Mellanox ConnectX-5</li>
+    </ul>
+  </li>
+  <li>Software
+    <ul>
+      <li>Ubuntu 16.04.3 LTS (Xenial Xerus) with Linux kernel version 4.10.0-33-generic</li>
+      <li><a href="https://github.com/zrlio/crail">Crail 1.0</a>, commit a45c8382050f471e9342e1c6cf25f9f2001af6b5</li>
+      <li><a href="">Crail Shuffle plugin</a>, commit 2273b5dd53405cab3389f5c1fc2ee4cd30f02ae6</li>
+      <li><a href="https://github.com/Mellanox/SparkRDMA">SparkRDMA</a>, commit d95ce3e370a8e3b5146f4e0ab5e67a19c6f405a5 (latest master on 8th of November 2017)</li>
+    </ul>
+  </li>
+</ul>
+
+<h3 id="overview">Overview</h3>
+<div style="text-align: justify">
+<p>
+Lately there has been an increasing interest in the community to include RDMA networking into data processing frameworks like Spark and Hadoop. One natural spot to integrate RDMA is in the shuffle operation that involves all-to-all network communication pattern. Naturally, due to its performance requirements the shuffle operation is of interest to us as well, and we have developed a Spark plugin for shuffle. In our previous blog posts, we have already shown that the Crail Shuffler achieves great workload-level speedups compared to vanilla Spark. In this blog post, we take a look at another recently proposed design called <a href="https://github.com/Mellanox/SparkRDMA">SparkRDMA</a> (<a href="https://issues.apache.org/jira/browse/SPARK-22229">SPARK-22229 JIRA</a>). SparkRDMA proposes to improve the shuffle performance of Spark by performing data transfers over RDMA. For this, the code manages its own off-heap memory which needs to be registered with the NIC for RDMA use. It supports 
 two ways to store shuffle data between the stages: (1) shuffle data is stored in regular files (just like vanilla Spark) but the data transfer is implemented via RDMA, (2) data is stored in memory (allocated and registered for RDMA transfer) and the data transfer is implemented via RDMA. We call it the "last-mile" approach where just the networking operations are replaced by the RDMA operations.
+</p>
+<p>
+In contrast, the Crail shuffler plugin takes a more holistic approach and leverages the high performance of Crail distributed data store to deliver gains. It uses Crail store to efficiently manage I/O resources, storage and networking devices, memory registrations, client sessions, data distribution, etc. Consequently, the shuffle operation becomes as simple as writing and reading files. And recall that Crail store is designed as a fast data bus for the intermediate data. The shuffle operation is just one of many operations that can be accelerated using Crail store. Beyond these operations, the modular architecture of Crail store enables us to seamlessly leverage different storage types (DRAM, NVMe, and more), perform tiering, support disaggregation, share inter-job data, jointly optimize I/O resources for various workloads, etc. These capabilities and performance gains give us confidence in the design choices we made for the Crail project.
+</p>
+</div>
+
+<h3 id="performance-comparison">Performance comparison</h3>
+<div style="text-align: justify">
+<p>Lets start by quantitatively assessing performance gains from the Crail shuffle plugin and SparkRDMA. As described above, SparkRDMA can be operated in two different modes. Users decide which mode to use by selecting a particular type of shuffle writer (spark.shuffle.rdma.shuffleWriterMethod). The Wrapper shuffle writer writes shuffle data to files between the stages, the Chunked shuffle writer stores shuffle data in memory. We evaluate both writer methods for terasort and SQL equijoin.
+</p>
+</div>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/rdma-shuffle/terasort.svg" width="550" /></div>
+<p><br /></p>
+<div style="text-align: justify">
+<p>
+First we run <a href="https://github.com/zrlio/crail-spark-terasort">terasort</a> on our 8+1 machine cluster (see above). We sort 200GB, thus, each node gets 25GB of data (equal distribution). We further did a basic search of the parameter space for each of the systems to find the best possible configuration. In all the experiments we use 8 executors with 12 cores each. Note that in a typical Spark run more CPU cores than assigned are engaged because of garbabge collection, etc. In our test runs assigning 12 cores lead to the best performance.
+</p>
+<p>
+The plot above shows runtimes of the various configuration we run with terasort. SparkRDMA with the Wrapper shuffle writer performance slightly better (3-4%) than vanilla Spark whereas the Chunked shuffle writer shows a 30% overhead. On a quick inspection we found that this overhead stems from memory allocation and registration for the shuffle data that is kept in memory between the stages. Compared to vanilla Spark, Crail's shuffle plugin shows performance improvement of around 235%.
+</p>
+</div>
+<div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/rdma-shuffle/sql.svg" width="550" /></div>
+<p><br /></p>
+
+<div style="text-align: justify">
+<p>
+For our second workload we choose the <a href="https://github.com/zrlio/sql-benchmarks">SQL equijoin</a> with a <a href="https://github.com/zrlio/spark-nullio-fileformat">special fileformat</a> that allows data to be generated on the fly. By generating data on the fly we eliminate any costs for reading data from storage and focus entirely on the shuffle performance. The shuffle data size is around 148GB. Here the Wrapper shuffle writer is slightly slower than vanilla Spark but instead the Chunked shuffle writer is roughly the same amount faster. The Crail shuffle plugin again delivers a great performance increase over vanilla Spark.
+</p>
+</div>
+
+<div style="text-align: justify">
+<p>Please let us know if your have recommendations about how these experiments can be improved.</p>
+</div>
+
+<h3 id="summary">Summary</h3>
+
+<div style="text-align: justify">
+<p>
+These benchmarks validate our belief that a "last-mile" integration cannot deliver the same performance gains as a holistic approach, i.e. one has to look at the whole picture in how to integrate RDMA into Spark applications (and for that matter any framework or application). Only replacing the data transfer alone does not lead to the anticipated performance increase. We learned this the hard way when we intially started working on Crail.
+</p>
+
+</div>
+
+
+</div>
+
+<!-- 
+
+-->
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/01/apache.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/01/apache.html b/content/blog/2018/01/apache.html
new file mode 100644
index 0000000..e3abaf2
--- /dev/null
+++ b/content/blog/2018/01/apache.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Apache</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/apache-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/apache-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Apache</h2>   
+          
+
+          <p>Crail is now an Apache Incubator project!</p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/06/apache-release.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/06/apache-release.html b/content/blog/2018/06/apache-release.html
new file mode 100644
index 0000000..23f40a2
--- /dev/null
+++ b/content/blog/2018/06/apache-release.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Apache Release</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/apache-release-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/apache-release-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Apache Release</h2>   
+          
+
+          <p>Apache Crail 1.0 incubator <a href="//crail.incubator.apache.org/download">release</a></p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/06/dataworks.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/06/dataworks.html b/content/blog/2018/06/dataworks.html
new file mode 100644
index 0000000..c278119
--- /dev/null
+++ b/content/blog/2018/06/dataworks.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Dataworks</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/dataworks-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/dataworks-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Dataworks</h2>   
+          
+
+          <p>Apache Crail (incubating) to feature in the <a href="https://dataworkssummit.com/san-jose-2018/session/data-processing-at-the-speed-of-100-gbpsapache-crail-incubating/">DataWorks Summit</a> on June 21st</p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/06/sparksummit.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/06/sparksummit.html b/content/blog/2018/06/sparksummit.html
new file mode 100644
index 0000000..d6afa64
--- /dev/null
+++ b/content/blog/2018/06/sparksummit.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Sparksummit</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/sparksummit-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/sparksummit-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Sparksummit</h2>   
+          
+
+          <p>A Spark serverless architecture powered by Crail will be presented today at the <a href="https://databricks.com/session/serverless-machine-learning-on-modern-hardware-using-apache-spark">Spark Summit</a></p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/08/sql-p1-news.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/08/sql-p1-news.html b/content/blog/2018/08/sql-p1-news.html
new file mode 100644
index 0000000..a3b81cc
--- /dev/null
+++ b/content/blog/2018/08/sql-p1-news.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: Sql P1 News</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/sql-p1-news-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/sql-p1-news-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>Sql P1 News</h2>   
+          
+
+          <p>A new blog <a href="//crail.incubator.apache.org/blog/2018/08/sql-p1.html">post</a> discussing file formats performance is now online</p>
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>

http://git-wip-us.apache.org/repos/asf/incubator-crail-website/blob/0fc17baa/content/blog/2018/08/sql-p1.html
----------------------------------------------------------------------
diff --git a/content/blog/2018/08/sql-p1.html b/content/blog/2018/08/sql-p1.html
new file mode 100644
index 0000000..40cea88
--- /dev/null
+++ b/content/blog/2018/08/sql-p1.html
@@ -0,0 +1,232 @@
+<!DOCTYPE html>
+<html>
+    <head>
+        <meta charset="utf-8">
+        <title>The Apache Crail (Incubating) Project: SQL Performance: Part 1 - Input File Formats</title>
+        <meta name="viewport" content="width=device-width, initial-scale=1.0">
+        <link href="//crail.incubator.apache.org/css/bootstrap.min.css" rel="stylesheet">
+        <link href="//crail.incubator.apache.org/css/group.css" rel="stylesheet">
+        <link rel="alternate" type="application/atom+xml" title="Atom"
+            href="//crail.incubator.apache.org/blog/blog.xml">
+        
+        <meta property="og:image" content="//crail.incubator.apache.org/img/blog/preview/sql-p1-summary.png" />
+        <meta property="og:image:secure_url" content="//crail.incubator.apache.org/img/blog/preview/sql-p1-summary.png" />
+    </head>
+
+    <body>
+        <div class="container">
+          <div class="header">
+            <ul class="nav nav-pills pull-right">
+              
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/">
+                    Home
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/overview/">
+                    Overview
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/download/">
+                    Downloads
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/blog/">
+                    Blog
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/community/">
+                    Community
+                  </a>
+                </li>
+              
+                
+                <li >
+                  <a href="//crail.incubator.apache.org/documentation/">
+                    Documentation
+                  </a>
+                </li>
+              
+            </ul>
+            <a href="//crail.incubator.apache.org/">
+                <img src="//crail.incubator.apache.org/img/crail_logo.png"
+                    srcset="//crail.incubator.apache.org/img/crail_logo.png"
+                    alt="Crail" id="logo">
+            </a>
+          </div>
+
+          
+          
+          <h2>SQL Performance: Part 1 - Input File Formats</h2>   
+          
+
+          <p class="meta">08 Aug 2018,  <mark>this is a blog post from a user of the Crail project.</mark>  </p>
+
+<div class="post">
+<div style="text-align: justify">
+<p>
+This is the first user blog post in a multi-part series where we will focus on relational data processing performance (e.g., SQL) in presence of high-performance network and storage devices - the kind of devices that Crail targets. Relational data processing is one of the most popular and versatile workloads people run in the  cloud. The general idea is that data is stored in tables with a schema, and is processed using a domain specific language like SQL. Examples of some popular systems that support such relational data analytics in the cloud are <a href="https://spark.apache.org/sql/">Apache Spark/SQL</a>, <a href="https://hive.apache.org/">Apache Hive</a>, <a href="https://impala.apache.org/">Apache Impala</a>, etc. In this post, we discuss the important first step in relational data processing, which is the reading of input data tables.
+</p>
+</div>
+
+<h3 id="hardware-and-software-configuration">Hardware and Software Configuration</h3>
+
+<p>The specific cluster configuration used for the experiments in this blog:</p>
+
+<ul>
+  <li>Cluster
+    <ul>
+      <li>4 compute + 1 management node x86_64 cluster</li>
+    </ul>
+  </li>
+  <li>Node configuration
+    <ul>
+      <li>CPU: 2 x Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz</li>
+      <li>DRAM: 256 GB DDR3</li>
+      <li>Network: 1x100Gbit/s Mellanox ConnectX-5</li>
+    </ul>
+  </li>
+  <li>Software
+    <ul>
+      <li>Ubuntu 16.04.3 LTS (Xenial Xerus) with Linux kernel version 4.10.0-33-generic</li>
+      <li>Apache HDFS (2.7.3)</li>
+      <li>Apache Paruqet (1.8), Apache ORC (1.4), Apache Arrow (0.8), Apache Avro (1.4)</li>
+      <li><a href="https://github.com/apache/incubator-crail/">Apache Crail (incubating) with NVMeF support</a>, commit 64e635e5ce9411041bf47fac5d7fadcb83a84355 (since then Crail has a stable source release v1.0 with a newer NVMeF code-base)</li>
+    </ul>
+  </li>
+</ul>
+
+<h3 id="overview">Overview</h3>
+
+<p>In a typical cloud-based relational data processing setup, the input data is stored on an external data storage solution like HDFS or AWS S3. Data tables and their associated schema are converted into a storage-friendly format for optimal performance. Examples of some popular and familiar file formats are <a href="https://parquet.apache.org/">Apache Parquet</a>, <a href="https://orc.apache.org/">Apache ORC</a>, <a href="https://avro.apache.org/">Apache Avro</a>, <a href="https://en.wikipedia.org/wiki/JSON">JSON</a>, etc. More recently, <a href="https://arrow.apache.org/">Apache Arrow</a> has been introduced to standardize the in-memory columnar data representation between multiple frameworks. To be precise, Arrow is not a storage format but it defines an <a href="https://github.com/apache/arrow/blob/master/format/IPC.md">interprocess communication (IPC) format</a> that can be used to store data in a stroage system (our binding for reading Arrow IPC messages from HDFS is available
  <a href="https://github.com/zrlio/fileformat-benchmarks/blob/master/src/main/java/com/github/animeshtrivedi/FileBench/HdfsSeekableByteChannel.java">here</a>). There is no one size fits all as all these formats have their own strengths, weaknesses, and features. In this blog, we are specifically interested in the performance of these formats on modern high-performance networking and storage devices.</p>
+
+<figure><div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/sql-p1/outline.svg" width="550" /><figcaption>Figure 1: The benchmarking setup with HDFS and file formats on a 100 Gbps network with NVMe flash devices. All formats contains routines for compression, encoding, and value materialization with associated I/O buffer management and data copies routines.<p></p></figcaption></div></figure>
+
+<p>To benchmark the performance of file formats, we wrote a set of micro-benchmarks which are available at <a href="https://github.com/zrlio/fileformat-benchmarks">https://github.com/zrlio/fileformat-benchmarks</a>. We cannot use typical SQL micro-benchmarks because every SQL engine has its own favorite file format, on which it performs the best. Hence, in order to ensure parity, we decoupled the performance of reading the input file format from the SQL query processing by writing simple table reading micro-benchmarks. Our benchmark reads in the store_sales table from the TPC-DS dataset (scale factor 100), and calculates a sum of values present in the table. The table contains 23 columns of integers, doubles, and longs.</p>
+
+<figure><div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/sql-p1/performance-all.svg" width="550" /><figcaption>Figure 2: Performance of JSON, Avro, Parquet, ORC, and Arrow on NVMe devices over a 100 Gbps network.<p></p></figcaption></div></figure>
+
+<p>We evaluate the performance of the benchmark on a 3 node HDFS cluster connected using 100 Gbps RoCE. One datanode in HDFS contains 4 NVMe devices with a collective aggregate bandwidth of 12.5 GB/sec (equals to 100 Gbps, hence, we have a balanced network and storage performance). Figure 2 shows our results where none of the file formats is able to deliver the full hardware performance for reading input files. One third of the performance is already lost in HDFS (maximum throughput 74.9 Gbps out of possible 100 Gbps). The rest of the performance is lost inside the file format implementation, which needs to deal with encoding, buffer and I/O management, compression, etc. The best performer is Apache Arrow which is designed for in-memory columnar datasets. The performance of these file formats are bounded by the performance of the CPU, which is 100% loaded during the experiment. For a detailed analysis of the file formats, please refer to our paper - <a href="https://www.usenix.org/c
 onference/atc18/presentation/trivedi">Albis: High-Performance File Format for Big Data Systems (USENIX, ATC’18)</a>. As a side-note on the Arrow performance - we have evaluated the performance of <em>implementation of Arrow’s Java library</em>. As this library has been focused on interactions with off-heap memory, there is a head room for optimizing the HDFS/on-heap reading path of Arrow’s Java library.</p>
+
+<h3 id="albis-high-performance-file-format-for-big-data-systems">Albis: High-Performance File Format for Big Data Systems</h3>
+
+<p>Based on these findings, we have developed a new file format called Albis. Albis is built on similar design choices as Crail. The top-level idea is to leverage the performance of modern networking and storage devices without being bottleneck by the CPU. While designing Albis we revisited many outdated assumptions about the nature of I/O in a distributed setting, and came up with the following ideas:</p>
+
+<ul>
+  <li>No compression or encoding: Modern network and storage devices are fast. Hence, there is no need to trade CPU cycles for performance. A 4 byte integer should be stored as a 4 byte value.</li>
+  <li>Keep the data/metadata management simple: Albis splits a table into row and column groups, which are stored in hierarchical files and directories on the underlying file system (e.g., HDFS or Crail).</li>
+  <li>Careful object materialization using a binary API: To optimize the runtime representation in managed runtimes like the JVM, only objects which are necessary for SQL processing are materialized. Otherwise, a 4 byte integer can be passed around as a byte array (using the binary API of Albis).</li>
+</ul>
+
+<figure><div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/sql-p1/core-scalability.svg" width="550" /><figcaption>Figure 3: Core scalability of JSON, Avro, Parquet, ORC, Arrow, and Albis on HDFS/NVMe.<p></p></figcaption></div></figure>
+
+<p>Using the Albis format, we revise our previous experiment where we read the input store_sales table from HDFS. In the figure above, we show the performance of Albis and other file formats with number of CPU cores involved. At the right hand of the x-axis, we have performance with all 16 cores engaged, hence, representing the peak possible performance. As evident, Albis delivers 59.9 Gbps out of 74.9 Gbps possible bandwidth with HDFS over NVMe. Albis performance is 1.9 - 21.4x better than other file formats. To give an impression where the performance is coming from, in the table below we show some micro-architectural features for Parquet, ORC, Arrow, and Albis. Our previously discussed design ideas in Albis result in a shorter code path (shown as less instructions required for each row), better cache performance (shows as lower cache misses per row), and clearly better performance (shown as nanoseconds required per row for processing). For a detailed evaluation of Albis please re
 fer to our paper.</p>
+
+<table style="width:100%">
+  <caption> Table 1: Micro-architectural analysis for Parquet, ORC, Arrow, and Albis on a 16-core Xeon machine.<p></p></caption>
+  <tr>
+    <th></th>
+    <th>Parquet</th>
+    <th>ORC</th> 
+    <th>Arrow</th>
+    <th>Albis</th>
+  </tr>
+  <tr>
+    <th>Instructions/row</th>
+    <td>6.6K</td> 
+    <td>4.9K</td> 
+    <td>1.9K</td> 
+    <td>1.6K</td> 
+  </tr>
+  <tr>
+    <th>Cache misses/row</th>
+    <td>9.2</td> 
+    <td>4.6</td> 
+    <td>5.1</td> 
+    <td>3.0</td> 
+  </tr>
+  <tr>
+    <th>Nanoseconds/row</th>
+    <td>105.3</td> 
+    <td>63.0</td> 
+    <td>31.2</td> 
+    <td>20.8</td> 
+  </tr>
+</table>
+<p></p>
+
+<h3 id="apache-crail-incubating-with-albis">Apache Crail (Incubating) with Albis</h3>
+
+<p>For our final experiment, we try to answer the question what it would take to deliver the full 100 Gbps bandwidth for Albis. Certainly, the first bottleneck is to improve the base storage layer performance. Here we use Apache Crail (Incubating) with its <a href="https://en.wikipedia.org/wiki/NVM_Express#NVMeOF">NVMeF</a> storage tier. This tier uses <a href="https://github.com/zrlio/jNVMf">jNVMf library</a> to implement NVMeF stack in Java. As we have shown in a previous blog <a href="//crail.incubator.apache.org/blog/2017/08/crail-nvme-fabrics-v1.html">post</a> that Crail’s NVMeF tier can deliver performance (97.8 Gbps) very close to the hardware limits. Hence, Albis with Crail is a perfect setup to evaluate on high-performance NVMe and RDMA devices. Before we get there, let’s get some calculations right. The store_sales table in the TPC-DS dataset has a data density of 93.9% (out of 100 bytes, only 93.9 is data, others are null values). As we measure the goodput, the expect
 ed performance of Albis on Crail is 93.9% of 97.8 Gbps, which calculates to 91.8 Gbps. In our experiments, Albis on Crail delivers 85.5 Gbps. Figure 4 shows more detailed results.</p>
+
+<figure><div style="text-align:center"><img src="//crail.incubator.apache.org/img/blog/sql-p1/albis-crail.svg" width="550" /><figcaption>Figure 4: Performance of Albis on Crail.<p></p></figcaption></div></figure>
+
+<p>The left half of the figure shows the performance scalability of Albis on Crail in a setup with 1 core (8.9 Gbps) to 16 cores (85.5 Gbps). In comparison, the right half of the figure shows the performance of Crail on HDFS/NVMe at 59.9 Gbps, and on Crail/NVMe at 85.5 Gbps. The last bar shows the performance of Albis if the benchmark does not materialize Java object values. In this configuration, Albis on Crail delivers 91.3 Gbps, which is very close to the expected peak of 91.8 Gbps.</p>
+
+<h3 id="summary">Summary</h3>
+<div style="text-align: justify">
+<p>
+In this first blog of a multipart series, we have looked at the data ingestion performance of file formats on high-performance networking and storage devices. We found that popular file formats are in need for a performance revision. Based on our analysis, we designed and implemented Albis - a new file format for storing relational data. Albis and Crail share many design choices. Their combined performance of 85+ Gbps on a 100 Gbps network, gives us confidence in our approach and underlying software philosophy for both, Crail and Albis. 
+</p>
+
+<p>
+Stay tuned for the next part where we look at workload-level performance in Spark/SQL on modern high-performance networking and storage devices. Meanwhile let us know if you have any feedback or comments. 
+</p>
+</div>
+
+</div>
+
+<!-- 
+
+<div id="disqus_thread"></div>
+<script>
+
+/**
+*  RECOMMENDED CONFIGURATION VARIABLES: EDIT AND UNCOMMENT THE SECTION BELOW TO INSERT DYNAMIC VALUES FROM YOUR PLATFORM OR CMS.
+*  LEARN WHY DEFINING THESE VARIABLES IS IMPORTANT: https://disqus.com/admin/universalcode/#configuration-variables*/
+/*
+var disqus_config = function () {
+this.page.url = PAGE_URL;  // Replace PAGE_URL with your page's canonical URL variable
+this.page.identifier = PAGE_IDENTIFIER; // Replace PAGE_IDENTIFIER with your page's unique identifier variable
+};
+*/
+(function() { // DON'T EDIT BELOW THIS LINE
+var d = document, s = d.createElement('script');
+s.src = '//crail-io.disqus.com/embed.js';
+s.setAttribute('data-timestamp', +new Date());
+(d.head || d.body).appendChild(s);
+})();
+</script>
+<noscript>Please enable JavaScript to view the <a href="https://disqus.com/?ref_noscript">comments powered by Disqus.</a></noscript>
+
+-->
+
+
+        <br>
+	<br> 
+          <div class="footer">
+            <p>Apache Crail is an effort undergoing <a href="https://incubator.apache.org/">incubation</a> at <a href="https://www.apache.org/">The Apache Software Foundation (ASF)</a>, sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
+            </p>
+          </div>
+
+        </div> <!-- /container -->
+
+        <!-- Support retina images. -->
+        <script type="text/javascript"
+            src="//crail.incubator.apache.org/js/srcset-polyfill.js"></script>
+    </body>
+</html>