You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@ozone.apache.org by GitBox <gi...@apache.org> on 2021/01/07 21:34:39 UTC

[GitHub] [ozone] bharatviswa504 opened a new pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

bharatviswa504 opened a new pull request #1766:
URL: https://github.com/apache/ozone/pull/1766


   ## What changes were proposed in this pull request?
   
   Support MPU keys in encrypted buckets.
   
   ## What is the link to the Apache JIRA
   
   https://issues.apache.org/jira/browse/HDDS-4653
   
   ## How was this patch tested?
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] hanishakoneru commented on pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

hanishakoneru commented on pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#issuecomment-769414699


   Thanks @bharatviswa504 for working on this. I have pushed an update to fix MPU in Encrypted bucket. 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569036497



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next

Review comment:
       NIT: "offset of the data to be read next" is confusing and can be removed here. 

##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;

Review comment:
       NIT: can re rename partIndexOfPrevPos=> prevPartIndex




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569797661



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;
+
+  // If a read's start/ length position doesn't coincide with a Crypto buffer
+  // boundary, it will be adjusted as reads should happen only at the buffer
+  // boundaries for decryption to happen correctly. In this case, after the
+  // data has been read and decrypted, only the requested data should be
+  // returned to the client. readPositionAdjustedBy and readLengthAdjustedBy
+  // store these adjustment information. Before returning to client, the first
+  // readPositionAdjustedBy number of bytes and the last readLengthAdjustedBy
+  // number of bytes must be discarded.
+  private int readPositionAdjustedBy = 0;
+  private int readLengthAdjustedBy = 0;
+
+  public MultipartCryptoKeyInputStream(String keyName,
+      List<OzoneCryptoInputStream> inputStreams) {
+
+    Preconditions.checkNotNull(inputStreams);
+
+    this.key = keyName;
+    this.partStreams = inputStreams;
+
+    // Calculate and update the partOffsets
+    this.partOffsets = new long[inputStreams.size()];
+    int i = 0;
+    for (OzoneCryptoInputStream ozoneCryptoInputStream : inputStreams) {
+      this.partOffsets[i++] = length;
+      length += ozoneCryptoInputStream.getLength();
+    }
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read() throws IOException {
+    byte[] buf = new byte[1];
+    if (read(buf, 0, 1) == EOF) {
+      return EOF;
+    }
+    return Byte.toUnsignedInt(buf[0]);
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read(byte[] b, int off, int len) throws IOException {
+    checkOpen();
+    if (b == null) {
+      throw new NullPointerException();
+    }
+    if (off < 0 || len < 0 || len > b.length - off) {
+      throw new IndexOutOfBoundsException();
+    }
+    if (len == 0) {
+      return 0;
+    }
+    int totalReadLen = 0;
+    while (len > 0) {
+      if (partStreams.size() == 0 ||
+          (partStreams.size() - 1 <= partIndex &&
+              partStreams.get(partIndex).getRemaining() == 0)) {
+        return totalReadLen == 0 ? EOF : totalReadLen;
+      }
+
+      // Get the current partStream and read data from it
+      OzoneCryptoInputStream current = partStreams.get(partIndex);
+      // CryptoInputStream reads hadoop.security.crypto.buffer.size number of
+      // bytes (default 8KB) at a time. This needs to be taken into account
+      // in calculating the numBytesToRead.
+      int numBytesToRead = getNumBytesToRead(len, (int)current.getRemaining(),
+          current.getBufferSize());
+      int numBytesRead;
+
+      if (readPositionAdjustedBy != 0 || readLengthAdjustedBy != 0) {
+        // There was some adjustment made in position and/ or length of data
+        // to be read to account for Crypto buffer boundary. Hence, read the
+        // data into a temp buffer and then copy only the requested data into
+        // clients buffer.
+        byte[] tempBuffer = new byte[numBytesToRead];
+        int actualNumBytesRead = current.read(tempBuffer, 0,
+            numBytesToRead);
+        numBytesRead = actualNumBytesRead - readPositionAdjustedBy -
+            readLengthAdjustedBy;
+
+        if (actualNumBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead(accounting for Crypto " +
+                  "boundaries)=%d numBytesRead(actual)=%d " +
+                  "numBytesToBeRead(into client buffer discarding crypto " +
+                  "boundary adjustments)=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              actualNumBytesRead, numBytesRead));
+        }
+
+        // TODO: Byte array copies are not optimal. If there is a better and
+        //  more optimal solution to copy only a part of read data into
+        //  client buffer, this should be updated.
+        System.arraycopy(tempBuffer, readPositionAdjustedBy, b, off,
+            numBytesRead);
+
+        LOG.debug("OzoneCryptoInputStream for key: {} part: {} read {} bytes " +
+                "instead of {} bytes to account for Crypto buffer boundary. " +
+                "Client buffer will be copied with read data from position {}" +
+                "upto position {}, discarding the extra bytes read to " +
+                "maintain Crypto buffer boundary limits", key, partIndex,
+            actualNumBytesRead, numBytesRead, readPositionAdjustedBy,
+            actualNumBytesRead - readPositionAdjustedBy);
+
+        // Reset readPositionAdjustedBy and readLengthAdjustedBy
+        readPositionAdjustedBy = 0;
+        readLengthAdjustedBy = 0;
+      } else {
+        numBytesRead = current.read(b, off, numBytesToRead);
+        if (numBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead=%d numBytesRead=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              numBytesRead));
+        }
+      }
+
+      totalReadLen += numBytesRead;
+      off += numBytesRead;
+      len -= numBytesRead;
+
+      if (current.getRemaining() <= 0 &&
+          ((partIndex + 1) < partStreams.size())) {
+        partIndex += 1;
+      }
+
+    }
+    return totalReadLen;
+  }
+
+  /**
+   * Get number of bytes to read from the current stream based on the length
+   * to be read, number of bytes remaining in the stream and the Crypto buffer
+   * size.
+   * Reads should be performed at the CryptoInputStream Buffer boundaries only.
+   * Otherwise, the decryption will be incorrect.
+   */
+  private int getNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) throws IOException {
+
+    Preconditions.checkArgument(readPositionAdjustedBy == 0);
+    Preconditions.checkArgument(readLengthAdjustedBy == 0);
+
+    // Check and adjust position if required
+    adjustReadPosition(cryptoBufferSize);
+    remaining += readPositionAdjustedBy;
+    lenToRead += readPositionAdjustedBy;
+
+    return adjustNumBytesToRead(lenToRead, remaining, cryptoBufferSize);
+  }
+
+  /**
+   * Reads should be performed at the CryptoInputStream Buffer boundary size.
+   * Otherwise, the decryption will be incorrect. Hence, if the position is
+   * not at the boundary limit, we have to adjust the position and might need
+   * to read more data than requested. The extra data will be filtered out
+   * before returning to the client.
+   */
+  private void adjustReadPosition(long cryptoBufferSize) throws IOException {
+    // Position of the buffer in current stream
+    long currentPosOfStream = partStreams.get(partIndex).getPos();
+    if (currentPosOfStream % cryptoBufferSize != 0) {
+      // Adjustment required.
+      // Update readPositionAdjustedBy and seek to the adjusted position
+      readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);

Review comment:
       NIT: readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);
   can be moved before if to save a % operation.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#issuecomment-784382436


   +1, thanks @hanishakoneru and @bharatviswa504  for the contribution. I will merge the PR shortly. 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] hanishakoneru commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

hanishakoneru commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569801032



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;
+
+  // If a read's start/ length position doesn't coincide with a Crypto buffer
+  // boundary, it will be adjusted as reads should happen only at the buffer
+  // boundaries for decryption to happen correctly. In this case, after the
+  // data has been read and decrypted, only the requested data should be
+  // returned to the client. readPositionAdjustedBy and readLengthAdjustedBy
+  // store these adjustment information. Before returning to client, the first
+  // readPositionAdjustedBy number of bytes and the last readLengthAdjustedBy
+  // number of bytes must be discarded.
+  private int readPositionAdjustedBy = 0;
+  private int readLengthAdjustedBy = 0;
+
+  public MultipartCryptoKeyInputStream(String keyName,
+      List<OzoneCryptoInputStream> inputStreams) {
+
+    Preconditions.checkNotNull(inputStreams);
+
+    this.key = keyName;
+    this.partStreams = inputStreams;
+
+    // Calculate and update the partOffsets
+    this.partOffsets = new long[inputStreams.size()];
+    int i = 0;
+    for (OzoneCryptoInputStream ozoneCryptoInputStream : inputStreams) {
+      this.partOffsets[i++] = length;
+      length += ozoneCryptoInputStream.getLength();
+    }
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read() throws IOException {
+    byte[] buf = new byte[1];
+    if (read(buf, 0, 1) == EOF) {
+      return EOF;
+    }
+    return Byte.toUnsignedInt(buf[0]);
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read(byte[] b, int off, int len) throws IOException {
+    checkOpen();
+    if (b == null) {
+      throw new NullPointerException();
+    }
+    if (off < 0 || len < 0 || len > b.length - off) {
+      throw new IndexOutOfBoundsException();
+    }
+    if (len == 0) {
+      return 0;
+    }
+    int totalReadLen = 0;
+    while (len > 0) {
+      if (partStreams.size() == 0 ||
+          (partStreams.size() - 1 <= partIndex &&
+              partStreams.get(partIndex).getRemaining() == 0)) {
+        return totalReadLen == 0 ? EOF : totalReadLen;
+      }
+
+      // Get the current partStream and read data from it
+      OzoneCryptoInputStream current = partStreams.get(partIndex);
+      // CryptoInputStream reads hadoop.security.crypto.buffer.size number of
+      // bytes (default 8KB) at a time. This needs to be taken into account
+      // in calculating the numBytesToRead.
+      int numBytesToRead = getNumBytesToRead(len, (int)current.getRemaining(),
+          current.getBufferSize());
+      int numBytesRead;
+
+      if (readPositionAdjustedBy != 0 || readLengthAdjustedBy != 0) {
+        // There was some adjustment made in position and/ or length of data
+        // to be read to account for Crypto buffer boundary. Hence, read the
+        // data into a temp buffer and then copy only the requested data into
+        // clients buffer.
+        byte[] tempBuffer = new byte[numBytesToRead];
+        int actualNumBytesRead = current.read(tempBuffer, 0,
+            numBytesToRead);
+        numBytesRead = actualNumBytesRead - readPositionAdjustedBy -
+            readLengthAdjustedBy;
+
+        if (actualNumBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead(accounting for Crypto " +
+                  "boundaries)=%d numBytesRead(actual)=%d " +
+                  "numBytesToBeRead(into client buffer discarding crypto " +
+                  "boundary adjustments)=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              actualNumBytesRead, numBytesRead));
+        }
+
+        // TODO: Byte array copies are not optimal. If there is a better and
+        //  more optimal solution to copy only a part of read data into
+        //  client buffer, this should be updated.
+        System.arraycopy(tempBuffer, readPositionAdjustedBy, b, off,
+            numBytesRead);
+
+        LOG.debug("OzoneCryptoInputStream for key: {} part: {} read {} bytes " +
+                "instead of {} bytes to account for Crypto buffer boundary. " +
+                "Client buffer will be copied with read data from position {}" +
+                "upto position {}, discarding the extra bytes read to " +
+                "maintain Crypto buffer boundary limits", key, partIndex,
+            actualNumBytesRead, numBytesRead, readPositionAdjustedBy,
+            actualNumBytesRead - readPositionAdjustedBy);
+
+        // Reset readPositionAdjustedBy and readLengthAdjustedBy
+        readPositionAdjustedBy = 0;
+        readLengthAdjustedBy = 0;
+      } else {
+        numBytesRead = current.read(b, off, numBytesToRead);
+        if (numBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead=%d numBytesRead=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              numBytesRead));
+        }
+      }
+
+      totalReadLen += numBytesRead;
+      off += numBytesRead;
+      len -= numBytesRead;
+
+      if (current.getRemaining() <= 0 &&
+          ((partIndex + 1) < partStreams.size())) {
+        partIndex += 1;
+      }
+
+    }
+    return totalReadLen;
+  }
+
+  /**
+   * Get number of bytes to read from the current stream based on the length
+   * to be read, number of bytes remaining in the stream and the Crypto buffer
+   * size.
+   * Reads should be performed at the CryptoInputStream Buffer boundaries only.
+   * Otherwise, the decryption will be incorrect.
+   */
+  private int getNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) throws IOException {
+
+    Preconditions.checkArgument(readPositionAdjustedBy == 0);
+    Preconditions.checkArgument(readLengthAdjustedBy == 0);
+
+    // Check and adjust position if required
+    adjustReadPosition(cryptoBufferSize);
+    remaining += readPositionAdjustedBy;
+    lenToRead += readPositionAdjustedBy;
+
+    return adjustNumBytesToRead(lenToRead, remaining, cryptoBufferSize);
+  }
+
+  /**
+   * Reads should be performed at the CryptoInputStream Buffer boundary size.
+   * Otherwise, the decryption will be incorrect. Hence, if the position is
+   * not at the boundary limit, we have to adjust the position and might need
+   * to read more data than requested. The extra data will be filtered out
+   * before returning to the client.
+   */
+  private void adjustReadPosition(long cryptoBufferSize) throws IOException {
+    // Position of the buffer in current stream
+    long currentPosOfStream = partStreams.get(partIndex).getPos();
+    if (currentPosOfStream % cryptoBufferSize != 0) {
+      // Adjustment required.
+      // Update readPositionAdjustedBy and seek to the adjusted position
+      readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);
+      // Seek current partStream to adjusted position. We do not need to
+      // reset the seeked positions of other streams.
+      partStreams.get(partIndex)
+          .seek(currentPosOfStream - readPositionAdjustedBy);
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted " +
+              "position {} by -{} to account for Crypto buffer boundary",
+          key, partIndex, currentPosOfStream, readPositionAdjustedBy);
+    }
+  }
+
+  /**
+   * If the length of data requested does not end at a Crypto Buffer
+   * boundary, the number of bytes to be read must be adjusted accordingly.
+   * The extra data will be filtered out before returning to the client.
+   */
+  private int adjustNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) {
+    int numBytesToRead = Math.min(cryptoBufferSize, remaining);
+    if (lenToRead < numBytesToRead) {
+      // Adjustment required; Update readLengthAdjustedBy.
+      readLengthAdjustedBy = numBytesToRead - lenToRead;
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted length " +
+              "by +{} to account for Crypto buffer boundary",
+          key, partIndex, readLengthAdjustedBy);
+    }
+    return numBytesToRead;
+  }
+
+  /**
+   * Seeks the InputStream to the specified position. This involves 2 steps:
+   *    1. Updating the partIndex to the partStream corresponding to the
+   *    seeked position.
+   *    2. Seeking the corresponding partStream to the adjusted position.
+   *
+   * For example, let’s say the part sizes are 200 bytes and part[0] stores
+   * data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+   * Let’s say we seek to position 240. In the first step, the partIndex
+   * would be updated to 1 as indices 200 - 399 reside in partStream[1]. In
+   * the second step, the partStream[1] would be seeked to position 40 (=
+   * 240 - blockOffset[1] (= 200)).
+   */
+  @Override
+  public void seek(long pos) throws IOException {
+    if (pos == 0 && length == 0) {
+      // It is possible for length and pos to be zero in which case
+      // seek should return instead of throwing exception
+      return;
+    }
+    if (pos < 0 || pos > length) {
+      throw new EOFException("EOF encountered at pos: " + pos);
+    }
+
+    // 1. Update the partIndex
+    if (partIndex >= partStreams.size()) {
+      partIndex = Arrays.binarySearch(partOffsets, pos);
+    } else if (pos < partOffsets[partIndex]) {
+      partIndex =
+          Arrays.binarySearch(partOffsets, 0, partIndex, pos);
+    } else if (pos >= partOffsets[partIndex] + partStreams
+        .get(partIndex).getLength()) {
+      partIndex = Arrays.binarySearch(partOffsets, partIndex + 1,
+          partStreams.size(), pos);
+    }
+    if (partIndex < 0) {
+      // Binary search returns -insertionPoint - 1  if element is not present
+      // in the array. insertionPoint is the point at which element would be
+      // inserted in the sorted array. We need to adjust the blockIndex
+      // accordingly so that partIndex = insertionPoint - 1
+      partIndex = -partIndex - 2;
+    }
+
+    // Reset the previous partStream's position
+    partStreams.get(partIndexOfPrevPos).seek(0);
+
+    // Reset all the partStreams above the partIndex. We do this to reset
+    // any previous reads which might have updated the higher part
+    // streams position.
+    for (int index = partIndex + 1; index < partStreams.size(); index++) {
+      partStreams.get(index).seek(0);
+    }
+    // 2. Seek the partStream to the adjusted position
+    partStreams.get(partIndex).seek(pos - partOffsets[partIndex]);
+    partIndexOfPrevPos = partIndex;
+  }
+
+  @Override
+  public synchronized long getPos() throws IOException {
+    return length == 0 ? 0 : partOffsets[partIndex] +
+        partStreams.get(partIndex).getPos();
+  }
+
+  @Override
+  public boolean seekToNewSource(long targetPos) throws IOException {

Review comment:
       The underlying InputStreams also have this as false. 
   If we do want to support this, then it would need to be changed in underlying streams also. 




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on a change in pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r573252307



##########
File path: hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/request/key/OMKeyRequest.java
##########
@@ -546,6 +546,39 @@ protected void generateRequiredEncryptionInfo(KeyArgs keyArgs,
     }
   }
 
+  protected void getFileEncryptionInfoForMpuKey(KeyArgs keyArgs,
+      KeyArgs.Builder newKeyArgs, OzoneManager ozoneManager)
+      throws IOException {
+
+    String volumeName = keyArgs.getVolumeName();
+    String bucketName = keyArgs.getBucketName();
+
+    boolean acquireLock = false;
+    OMMetadataManager omMetadataManager = ozoneManager.getMetadataManager();
+
+    if (ozoneManager.getKmsProvider() != null) {
+      try {
+        acquireLock = omMetadataManager.getLock().acquireReadLock(

Review comment:
       Can we move the acquireReadLock outside try block?
   




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] bharatviswa504 commented on pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

bharatviswa504 commented on pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#issuecomment-777195550


   Thank You @hanishakoneru for continuing the work on this PR and making it work.
   Fixed find bug issues in the latest commit, test failure looks unrelated, will wait for another CI run.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao merged pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao merged pull request #1766:
URL: https://github.com/apache/ozone/pull/1766


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] bharatviswa504 commented on a change in pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

bharatviswa504 commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r574242483



##########
File path: hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/request/key/OMKeyRequest.java
##########
@@ -546,6 +546,39 @@ protected void generateRequiredEncryptionInfo(KeyArgs keyArgs,
     }
   }
 
+  protected void getFileEncryptionInfoForMpuKey(KeyArgs keyArgs,
+      KeyArgs.Builder newKeyArgs, OzoneManager ozoneManager)
+      throws IOException {
+
+    String volumeName = keyArgs.getVolumeName();
+    String bucketName = keyArgs.getBucketName();
+
+    boolean acquireLock = false;
+    OMMetadataManager omMetadataManager = ozoneManager.getMetadataManager();
+
+    if (ozoneManager.getKmsProvider() != null) {
+      try {
+        acquireLock = omMetadataManager.getLock().acquireReadLock(

Review comment:
       Lock release is handled in the finally block.
   Any reason to move this outside try block, just trying to understand the reason.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] hanishakoneru commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

hanishakoneru commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569802349



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;
+
+  // If a read's start/ length position doesn't coincide with a Crypto buffer
+  // boundary, it will be adjusted as reads should happen only at the buffer
+  // boundaries for decryption to happen correctly. In this case, after the
+  // data has been read and decrypted, only the requested data should be
+  // returned to the client. readPositionAdjustedBy and readLengthAdjustedBy
+  // store these adjustment information. Before returning to client, the first
+  // readPositionAdjustedBy number of bytes and the last readLengthAdjustedBy
+  // number of bytes must be discarded.
+  private int readPositionAdjustedBy = 0;
+  private int readLengthAdjustedBy = 0;
+
+  public MultipartCryptoKeyInputStream(String keyName,
+      List<OzoneCryptoInputStream> inputStreams) {
+
+    Preconditions.checkNotNull(inputStreams);
+
+    this.key = keyName;
+    this.partStreams = inputStreams;
+
+    // Calculate and update the partOffsets
+    this.partOffsets = new long[inputStreams.size()];
+    int i = 0;
+    for (OzoneCryptoInputStream ozoneCryptoInputStream : inputStreams) {
+      this.partOffsets[i++] = length;
+      length += ozoneCryptoInputStream.getLength();
+    }
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read() throws IOException {
+    byte[] buf = new byte[1];
+    if (read(buf, 0, 1) == EOF) {
+      return EOF;
+    }
+    return Byte.toUnsignedInt(buf[0]);
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read(byte[] b, int off, int len) throws IOException {
+    checkOpen();
+    if (b == null) {
+      throw new NullPointerException();
+    }
+    if (off < 0 || len < 0 || len > b.length - off) {
+      throw new IndexOutOfBoundsException();
+    }
+    if (len == 0) {
+      return 0;
+    }
+    int totalReadLen = 0;
+    while (len > 0) {
+      if (partStreams.size() == 0 ||
+          (partStreams.size() - 1 <= partIndex &&
+              partStreams.get(partIndex).getRemaining() == 0)) {
+        return totalReadLen == 0 ? EOF : totalReadLen;
+      }
+
+      // Get the current partStream and read data from it
+      OzoneCryptoInputStream current = partStreams.get(partIndex);
+      // CryptoInputStream reads hadoop.security.crypto.buffer.size number of
+      // bytes (default 8KB) at a time. This needs to be taken into account
+      // in calculating the numBytesToRead.
+      int numBytesToRead = getNumBytesToRead(len, (int)current.getRemaining(),
+          current.getBufferSize());
+      int numBytesRead;
+
+      if (readPositionAdjustedBy != 0 || readLengthAdjustedBy != 0) {
+        // There was some adjustment made in position and/ or length of data
+        // to be read to account for Crypto buffer boundary. Hence, read the
+        // data into a temp buffer and then copy only the requested data into
+        // clients buffer.
+        byte[] tempBuffer = new byte[numBytesToRead];
+        int actualNumBytesRead = current.read(tempBuffer, 0,
+            numBytesToRead);
+        numBytesRead = actualNumBytesRead - readPositionAdjustedBy -
+            readLengthAdjustedBy;
+
+        if (actualNumBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead(accounting for Crypto " +
+                  "boundaries)=%d numBytesRead(actual)=%d " +
+                  "numBytesToBeRead(into client buffer discarding crypto " +
+                  "boundary adjustments)=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              actualNumBytesRead, numBytesRead));
+        }
+
+        // TODO: Byte array copies are not optimal. If there is a better and
+        //  more optimal solution to copy only a part of read data into
+        //  client buffer, this should be updated.
+        System.arraycopy(tempBuffer, readPositionAdjustedBy, b, off,
+            numBytesRead);
+
+        LOG.debug("OzoneCryptoInputStream for key: {} part: {} read {} bytes " +
+                "instead of {} bytes to account for Crypto buffer boundary. " +
+                "Client buffer will be copied with read data from position {}" +
+                "upto position {}, discarding the extra bytes read to " +
+                "maintain Crypto buffer boundary limits", key, partIndex,
+            actualNumBytesRead, numBytesRead, readPositionAdjustedBy,
+            actualNumBytesRead - readPositionAdjustedBy);
+
+        // Reset readPositionAdjustedBy and readLengthAdjustedBy
+        readPositionAdjustedBy = 0;
+        readLengthAdjustedBy = 0;
+      } else {
+        numBytesRead = current.read(b, off, numBytesToRead);
+        if (numBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead=%d numBytesRead=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              numBytesRead));
+        }
+      }
+
+      totalReadLen += numBytesRead;
+      off += numBytesRead;
+      len -= numBytesRead;
+
+      if (current.getRemaining() <= 0 &&
+          ((partIndex + 1) < partStreams.size())) {
+        partIndex += 1;
+      }
+
+    }
+    return totalReadLen;
+  }
+
+  /**
+   * Get number of bytes to read from the current stream based on the length
+   * to be read, number of bytes remaining in the stream and the Crypto buffer
+   * size.
+   * Reads should be performed at the CryptoInputStream Buffer boundaries only.
+   * Otherwise, the decryption will be incorrect.
+   */
+  private int getNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) throws IOException {
+
+    Preconditions.checkArgument(readPositionAdjustedBy == 0);
+    Preconditions.checkArgument(readLengthAdjustedBy == 0);
+
+    // Check and adjust position if required
+    adjustReadPosition(cryptoBufferSize);
+    remaining += readPositionAdjustedBy;
+    lenToRead += readPositionAdjustedBy;
+
+    return adjustNumBytesToRead(lenToRead, remaining, cryptoBufferSize);
+  }
+
+  /**
+   * Reads should be performed at the CryptoInputStream Buffer boundary size.
+   * Otherwise, the decryption will be incorrect. Hence, if the position is
+   * not at the boundary limit, we have to adjust the position and might need
+   * to read more data than requested. The extra data will be filtered out
+   * before returning to the client.
+   */
+  private void adjustReadPosition(long cryptoBufferSize) throws IOException {
+    // Position of the buffer in current stream
+    long currentPosOfStream = partStreams.get(partIndex).getPos();
+    if (currentPosOfStream % cryptoBufferSize != 0) {
+      // Adjustment required.
+      // Update readPositionAdjustedBy and seek to the adjusted position
+      readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);
+      // Seek current partStream to adjusted position. We do not need to
+      // reset the seeked positions of other streams.
+      partStreams.get(partIndex)
+          .seek(currentPosOfStream - readPositionAdjustedBy);
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted " +
+              "position {} by -{} to account for Crypto buffer boundary",
+          key, partIndex, currentPosOfStream, readPositionAdjustedBy);
+    }
+  }
+
+  /**
+   * If the length of data requested does not end at a Crypto Buffer
+   * boundary, the number of bytes to be read must be adjusted accordingly.
+   * The extra data will be filtered out before returning to the client.
+   */
+  private int adjustNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) {
+    int numBytesToRead = Math.min(cryptoBufferSize, remaining);
+    if (lenToRead < numBytesToRead) {
+      // Adjustment required; Update readLengthAdjustedBy.
+      readLengthAdjustedBy = numBytesToRead - lenToRead;
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted length " +
+              "by +{} to account for Crypto buffer boundary",
+          key, partIndex, readLengthAdjustedBy);
+    }
+    return numBytesToRead;
+  }
+
+  /**
+   * Seeks the InputStream to the specified position. This involves 2 steps:
+   *    1. Updating the partIndex to the partStream corresponding to the
+   *    seeked position.
+   *    2. Seeking the corresponding partStream to the adjusted position.
+   *
+   * For example, let’s say the part sizes are 200 bytes and part[0] stores
+   * data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+   * Let’s say we seek to position 240. In the first step, the partIndex
+   * would be updated to 1 as indices 200 - 399 reside in partStream[1]. In
+   * the second step, the partStream[1] would be seeked to position 40 (=
+   * 240 - blockOffset[1] (= 200)).
+   */
+  @Override
+  public void seek(long pos) throws IOException {
+    if (pos == 0 && length == 0) {
+      // It is possible for length and pos to be zero in which case
+      // seek should return instead of throwing exception
+      return;
+    }
+    if (pos < 0 || pos > length) {
+      throw new EOFException("EOF encountered at pos: " + pos);
+    }
+
+    // 1. Update the partIndex
+    if (partIndex >= partStreams.size()) {
+      partIndex = Arrays.binarySearch(partOffsets, pos);
+    } else if (pos < partOffsets[partIndex]) {
+      partIndex =
+          Arrays.binarySearch(partOffsets, 0, partIndex, pos);
+    } else if (pos >= partOffsets[partIndex] + partStreams
+        .get(partIndex).getLength()) {
+      partIndex = Arrays.binarySearch(partOffsets, partIndex + 1,
+          partStreams.size(), pos);
+    }
+    if (partIndex < 0) {
+      // Binary search returns -insertionPoint - 1  if element is not present
+      // in the array. insertionPoint is the point at which element would be
+      // inserted in the sorted array. We need to adjust the blockIndex
+      // accordingly so that partIndex = insertionPoint - 1
+      partIndex = -partIndex - 2;
+    }
+
+    // Reset the previous partStream's position
+    partStreams.get(partIndexOfPrevPos).seek(0);
+
+    // Reset all the partStreams above the partIndex. We do this to reset
+    // any previous reads which might have updated the higher part
+    // streams position.
+    for (int index = partIndex + 1; index < partStreams.size(); index++) {
+      partStreams.get(index).seek(0);
+    }
+    // 2. Seek the partStream to the adjusted position
+    partStreams.get(partIndex).seek(pos - partOffsets[partIndex]);
+    partIndexOfPrevPos = partIndex;
+  }
+
+  @Override
+  public synchronized long getPos() throws IOException {
+    return length == 0 ? 0 : partOffsets[partIndex] +

Review comment:
       The underlying stream would perform the check. But I will add it here as well.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] hanishakoneru commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

hanishakoneru commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569798937



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;

Review comment:
       Done.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on a change in pull request #1766: HDDS-4563. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r569041857



##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;
+
+  // If a read's start/ length position doesn't coincide with a Crypto buffer
+  // boundary, it will be adjusted as reads should happen only at the buffer
+  // boundaries for decryption to happen correctly. In this case, after the
+  // data has been read and decrypted, only the requested data should be
+  // returned to the client. readPositionAdjustedBy and readLengthAdjustedBy
+  // store these adjustment information. Before returning to client, the first
+  // readPositionAdjustedBy number of bytes and the last readLengthAdjustedBy
+  // number of bytes must be discarded.
+  private int readPositionAdjustedBy = 0;
+  private int readLengthAdjustedBy = 0;
+
+  public MultipartCryptoKeyInputStream(String keyName,
+      List<OzoneCryptoInputStream> inputStreams) {
+
+    Preconditions.checkNotNull(inputStreams);
+
+    this.key = keyName;
+    this.partStreams = inputStreams;
+
+    // Calculate and update the partOffsets
+    this.partOffsets = new long[inputStreams.size()];
+    int i = 0;
+    for (OzoneCryptoInputStream ozoneCryptoInputStream : inputStreams) {
+      this.partOffsets[i++] = length;
+      length += ozoneCryptoInputStream.getLength();
+    }
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read() throws IOException {
+    byte[] buf = new byte[1];
+    if (read(buf, 0, 1) == EOF) {
+      return EOF;
+    }
+    return Byte.toUnsignedInt(buf[0]);
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read(byte[] b, int off, int len) throws IOException {
+    checkOpen();
+    if (b == null) {
+      throw new NullPointerException();
+    }
+    if (off < 0 || len < 0 || len > b.length - off) {
+      throw new IndexOutOfBoundsException();
+    }
+    if (len == 0) {
+      return 0;
+    }
+    int totalReadLen = 0;
+    while (len > 0) {
+      if (partStreams.size() == 0 ||
+          (partStreams.size() - 1 <= partIndex &&
+              partStreams.get(partIndex).getRemaining() == 0)) {
+        return totalReadLen == 0 ? EOF : totalReadLen;
+      }
+
+      // Get the current partStream and read data from it
+      OzoneCryptoInputStream current = partStreams.get(partIndex);
+      // CryptoInputStream reads hadoop.security.crypto.buffer.size number of
+      // bytes (default 8KB) at a time. This needs to be taken into account
+      // in calculating the numBytesToRead.
+      int numBytesToRead = getNumBytesToRead(len, (int)current.getRemaining(),
+          current.getBufferSize());
+      int numBytesRead;
+
+      if (readPositionAdjustedBy != 0 || readLengthAdjustedBy != 0) {
+        // There was some adjustment made in position and/ or length of data
+        // to be read to account for Crypto buffer boundary. Hence, read the
+        // data into a temp buffer and then copy only the requested data into
+        // clients buffer.
+        byte[] tempBuffer = new byte[numBytesToRead];
+        int actualNumBytesRead = current.read(tempBuffer, 0,
+            numBytesToRead);
+        numBytesRead = actualNumBytesRead - readPositionAdjustedBy -
+            readLengthAdjustedBy;
+
+        if (actualNumBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead(accounting for Crypto " +
+                  "boundaries)=%d numBytesRead(actual)=%d " +
+                  "numBytesToBeRead(into client buffer discarding crypto " +
+                  "boundary adjustments)=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              actualNumBytesRead, numBytesRead));
+        }
+
+        // TODO: Byte array copies are not optimal. If there is a better and
+        //  more optimal solution to copy only a part of read data into
+        //  client buffer, this should be updated.
+        System.arraycopy(tempBuffer, readPositionAdjustedBy, b, off,
+            numBytesRead);
+
+        LOG.debug("OzoneCryptoInputStream for key: {} part: {} read {} bytes " +
+                "instead of {} bytes to account for Crypto buffer boundary. " +
+                "Client buffer will be copied with read data from position {}" +
+                "upto position {}, discarding the extra bytes read to " +
+                "maintain Crypto buffer boundary limits", key, partIndex,
+            actualNumBytesRead, numBytesRead, readPositionAdjustedBy,
+            actualNumBytesRead - readPositionAdjustedBy);
+
+        // Reset readPositionAdjustedBy and readLengthAdjustedBy
+        readPositionAdjustedBy = 0;
+        readLengthAdjustedBy = 0;
+      } else {
+        numBytesRead = current.read(b, off, numBytesToRead);
+        if (numBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead=%d numBytesRead=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              numBytesRead));
+        }
+      }
+
+      totalReadLen += numBytesRead;
+      off += numBytesRead;
+      len -= numBytesRead;
+
+      if (current.getRemaining() <= 0 &&
+          ((partIndex + 1) < partStreams.size())) {
+        partIndex += 1;
+      }
+
+    }
+    return totalReadLen;
+  }
+
+  /**
+   * Get number of bytes to read from the current stream based on the length
+   * to be read, number of bytes remaining in the stream and the Crypto buffer
+   * size.
+   * Reads should be performed at the CryptoInputStream Buffer boundaries only.
+   * Otherwise, the decryption will be incorrect.
+   */
+  private int getNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) throws IOException {
+
+    Preconditions.checkArgument(readPositionAdjustedBy == 0);
+    Preconditions.checkArgument(readLengthAdjustedBy == 0);
+
+    // Check and adjust position if required
+    adjustReadPosition(cryptoBufferSize);
+    remaining += readPositionAdjustedBy;
+    lenToRead += readPositionAdjustedBy;
+
+    return adjustNumBytesToRead(lenToRead, remaining, cryptoBufferSize);
+  }
+
+  /**
+   * Reads should be performed at the CryptoInputStream Buffer boundary size.
+   * Otherwise, the decryption will be incorrect. Hence, if the position is
+   * not at the boundary limit, we have to adjust the position and might need
+   * to read more data than requested. The extra data will be filtered out
+   * before returning to the client.
+   */
+  private void adjustReadPosition(long cryptoBufferSize) throws IOException {
+    // Position of the buffer in current stream
+    long currentPosOfStream = partStreams.get(partIndex).getPos();
+    if (currentPosOfStream % cryptoBufferSize != 0) {
+      // Adjustment required.
+      // Update readPositionAdjustedBy and seek to the adjusted position
+      readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);
+      // Seek current partStream to adjusted position. We do not need to
+      // reset the seeked positions of other streams.
+      partStreams.get(partIndex)
+          .seek(currentPosOfStream - readPositionAdjustedBy);
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted " +
+              "position {} by -{} to account for Crypto buffer boundary",
+          key, partIndex, currentPosOfStream, readPositionAdjustedBy);
+    }
+  }
+
+  /**
+   * If the length of data requested does not end at a Crypto Buffer
+   * boundary, the number of bytes to be read must be adjusted accordingly.
+   * The extra data will be filtered out before returning to the client.
+   */
+  private int adjustNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) {
+    int numBytesToRead = Math.min(cryptoBufferSize, remaining);
+    if (lenToRead < numBytesToRead) {
+      // Adjustment required; Update readLengthAdjustedBy.
+      readLengthAdjustedBy = numBytesToRead - lenToRead;
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted length " +
+              "by +{} to account for Crypto buffer boundary",
+          key, partIndex, readLengthAdjustedBy);
+    }
+    return numBytesToRead;
+  }
+
+  /**
+   * Seeks the InputStream to the specified position. This involves 2 steps:
+   *    1. Updating the partIndex to the partStream corresponding to the
+   *    seeked position.
+   *    2. Seeking the corresponding partStream to the adjusted position.
+   *
+   * For example, let’s say the part sizes are 200 bytes and part[0] stores
+   * data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+   * Let’s say we seek to position 240. In the first step, the partIndex
+   * would be updated to 1 as indices 200 - 399 reside in partStream[1]. In
+   * the second step, the partStream[1] would be seeked to position 40 (=
+   * 240 - blockOffset[1] (= 200)).
+   */
+  @Override
+  public void seek(long pos) throws IOException {
+    if (pos == 0 && length == 0) {
+      // It is possible for length and pos to be zero in which case
+      // seek should return instead of throwing exception
+      return;
+    }
+    if (pos < 0 || pos > length) {
+      throw new EOFException("EOF encountered at pos: " + pos);
+    }
+
+    // 1. Update the partIndex
+    if (partIndex >= partStreams.size()) {
+      partIndex = Arrays.binarySearch(partOffsets, pos);
+    } else if (pos < partOffsets[partIndex]) {
+      partIndex =
+          Arrays.binarySearch(partOffsets, 0, partIndex, pos);
+    } else if (pos >= partOffsets[partIndex] + partStreams
+        .get(partIndex).getLength()) {
+      partIndex = Arrays.binarySearch(partOffsets, partIndex + 1,
+          partStreams.size(), pos);
+    }
+    if (partIndex < 0) {
+      // Binary search returns -insertionPoint - 1  if element is not present
+      // in the array. insertionPoint is the point at which element would be
+      // inserted in the sorted array. We need to adjust the blockIndex
+      // accordingly so that partIndex = insertionPoint - 1
+      partIndex = -partIndex - 2;
+    }
+
+    // Reset the previous partStream's position
+    partStreams.get(partIndexOfPrevPos).seek(0);
+
+    // Reset all the partStreams above the partIndex. We do this to reset
+    // any previous reads which might have updated the higher part
+    // streams position.
+    for (int index = partIndex + 1; index < partStreams.size(); index++) {
+      partStreams.get(index).seek(0);
+    }
+    // 2. Seek the partStream to the adjusted position
+    partStreams.get(partIndex).seek(pos - partOffsets[partIndex]);
+    partIndexOfPrevPos = partIndex;
+  }
+
+  @Override
+  public synchronized long getPos() throws IOException {
+    return length == 0 ? 0 : partOffsets[partIndex] +
+        partStreams.get(partIndex).getPos();
+  }
+
+  @Override
+  public boolean seekToNewSource(long targetPos) throws IOException {

Review comment:
       So seekToNewSource is not supported? Do we have use case requires this?

##########
File path: hadoop-ozone/client/src/main/java/org/apache/hadoop/ozone/client/io/MultipartCryptoKeyInputStream.java
##########
@@ -0,0 +1,373 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * <p>
+ * http://www.apache.org/licenses/LICENSE-2.0
+ * <p>
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hadoop.ozone.client.io;
+
+import com.google.common.base.Preconditions;
+import org.apache.hadoop.crypto.CryptoInputStream;
+import org.apache.hadoop.fs.CanUnbuffer;
+import org.apache.hadoop.fs.FSExceptionMessages;
+import org.apache.hadoop.fs.Seekable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.EOFException;
+import java.io.IOException;
+import java.util.Arrays;
+import java.util.List;
+
+public class MultipartCryptoKeyInputStream extends OzoneInputStream
+    implements Seekable, CanUnbuffer {
+
+  private static final Logger LOG =
+      LoggerFactory.getLogger(MultipartCryptoKeyInputStream.class);
+
+  private static final int EOF = -1;
+
+  private String key;
+  private long length = 0L;
+  private boolean closed = false;
+
+  // List of OzoneCryptoInputStream, one for each part of the key
+  private List<OzoneCryptoInputStream> partStreams;
+
+  // partOffsets[i] stores the index of the first data byte in
+  // partStream w.r.t the whole key data.
+  // For example, let’s say the part size is 200 bytes and part[0] stores
+  // data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+  // Then, partOffsets[0] = 0 (the offset of the first byte of data in
+  // part[0]), partOffsets[1] = 200 and so on.
+  private long[] partOffsets;
+
+  // Index of the partStream corresponding to the current position of the
+  // MultipartCryptoKeyInputStream i.e. offset of the data to be read next
+  private int partIndex = 0;
+
+  // Tracks the partIndex corresponding to the last seeked position so that it
+  // can be reset if a new position is seeked.
+  private int partIndexOfPrevPos = 0;
+
+  // If a read's start/ length position doesn't coincide with a Crypto buffer
+  // boundary, it will be adjusted as reads should happen only at the buffer
+  // boundaries for decryption to happen correctly. In this case, after the
+  // data has been read and decrypted, only the requested data should be
+  // returned to the client. readPositionAdjustedBy and readLengthAdjustedBy
+  // store these adjustment information. Before returning to client, the first
+  // readPositionAdjustedBy number of bytes and the last readLengthAdjustedBy
+  // number of bytes must be discarded.
+  private int readPositionAdjustedBy = 0;
+  private int readLengthAdjustedBy = 0;
+
+  public MultipartCryptoKeyInputStream(String keyName,
+      List<OzoneCryptoInputStream> inputStreams) {
+
+    Preconditions.checkNotNull(inputStreams);
+
+    this.key = keyName;
+    this.partStreams = inputStreams;
+
+    // Calculate and update the partOffsets
+    this.partOffsets = new long[inputStreams.size()];
+    int i = 0;
+    for (OzoneCryptoInputStream ozoneCryptoInputStream : inputStreams) {
+      this.partOffsets[i++] = length;
+      length += ozoneCryptoInputStream.getLength();
+    }
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read() throws IOException {
+    byte[] buf = new byte[1];
+    if (read(buf, 0, 1) == EOF) {
+      return EOF;
+    }
+    return Byte.toUnsignedInt(buf[0]);
+  }
+
+  /**
+   * {@inheritDoc}
+   */
+  @Override
+  public int read(byte[] b, int off, int len) throws IOException {
+    checkOpen();
+    if (b == null) {
+      throw new NullPointerException();
+    }
+    if (off < 0 || len < 0 || len > b.length - off) {
+      throw new IndexOutOfBoundsException();
+    }
+    if (len == 0) {
+      return 0;
+    }
+    int totalReadLen = 0;
+    while (len > 0) {
+      if (partStreams.size() == 0 ||
+          (partStreams.size() - 1 <= partIndex &&
+              partStreams.get(partIndex).getRemaining() == 0)) {
+        return totalReadLen == 0 ? EOF : totalReadLen;
+      }
+
+      // Get the current partStream and read data from it
+      OzoneCryptoInputStream current = partStreams.get(partIndex);
+      // CryptoInputStream reads hadoop.security.crypto.buffer.size number of
+      // bytes (default 8KB) at a time. This needs to be taken into account
+      // in calculating the numBytesToRead.
+      int numBytesToRead = getNumBytesToRead(len, (int)current.getRemaining(),
+          current.getBufferSize());
+      int numBytesRead;
+
+      if (readPositionAdjustedBy != 0 || readLengthAdjustedBy != 0) {
+        // There was some adjustment made in position and/ or length of data
+        // to be read to account for Crypto buffer boundary. Hence, read the
+        // data into a temp buffer and then copy only the requested data into
+        // clients buffer.
+        byte[] tempBuffer = new byte[numBytesToRead];
+        int actualNumBytesRead = current.read(tempBuffer, 0,
+            numBytesToRead);
+        numBytesRead = actualNumBytesRead - readPositionAdjustedBy -
+            readLengthAdjustedBy;
+
+        if (actualNumBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead(accounting for Crypto " +
+                  "boundaries)=%d numBytesRead(actual)=%d " +
+                  "numBytesToBeRead(into client buffer discarding crypto " +
+                  "boundary adjustments)=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              actualNumBytesRead, numBytesRead));
+        }
+
+        // TODO: Byte array copies are not optimal. If there is a better and
+        //  more optimal solution to copy only a part of read data into
+        //  client buffer, this should be updated.
+        System.arraycopy(tempBuffer, readPositionAdjustedBy, b, off,
+            numBytesRead);
+
+        LOG.debug("OzoneCryptoInputStream for key: {} part: {} read {} bytes " +
+                "instead of {} bytes to account for Crypto buffer boundary. " +
+                "Client buffer will be copied with read data from position {}" +
+                "upto position {}, discarding the extra bytes read to " +
+                "maintain Crypto buffer boundary limits", key, partIndex,
+            actualNumBytesRead, numBytesRead, readPositionAdjustedBy,
+            actualNumBytesRead - readPositionAdjustedBy);
+
+        // Reset readPositionAdjustedBy and readLengthAdjustedBy
+        readPositionAdjustedBy = 0;
+        readLengthAdjustedBy = 0;
+      } else {
+        numBytesRead = current.read(b, off, numBytesToRead);
+        if (numBytesRead != numBytesToRead) {
+          throw new IOException(String.format("Inconsistent read for key=%s " +
+                  "part=%d length=%d numBytesToRead=%d numBytesRead=%d",
+              key, partIndex, current.getLength(), numBytesToRead,
+              numBytesRead));
+        }
+      }
+
+      totalReadLen += numBytesRead;
+      off += numBytesRead;
+      len -= numBytesRead;
+
+      if (current.getRemaining() <= 0 &&
+          ((partIndex + 1) < partStreams.size())) {
+        partIndex += 1;
+      }
+
+    }
+    return totalReadLen;
+  }
+
+  /**
+   * Get number of bytes to read from the current stream based on the length
+   * to be read, number of bytes remaining in the stream and the Crypto buffer
+   * size.
+   * Reads should be performed at the CryptoInputStream Buffer boundaries only.
+   * Otherwise, the decryption will be incorrect.
+   */
+  private int getNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) throws IOException {
+
+    Preconditions.checkArgument(readPositionAdjustedBy == 0);
+    Preconditions.checkArgument(readLengthAdjustedBy == 0);
+
+    // Check and adjust position if required
+    adjustReadPosition(cryptoBufferSize);
+    remaining += readPositionAdjustedBy;
+    lenToRead += readPositionAdjustedBy;
+
+    return adjustNumBytesToRead(lenToRead, remaining, cryptoBufferSize);
+  }
+
+  /**
+   * Reads should be performed at the CryptoInputStream Buffer boundary size.
+   * Otherwise, the decryption will be incorrect. Hence, if the position is
+   * not at the boundary limit, we have to adjust the position and might need
+   * to read more data than requested. The extra data will be filtered out
+   * before returning to the client.
+   */
+  private void adjustReadPosition(long cryptoBufferSize) throws IOException {
+    // Position of the buffer in current stream
+    long currentPosOfStream = partStreams.get(partIndex).getPos();
+    if (currentPosOfStream % cryptoBufferSize != 0) {
+      // Adjustment required.
+      // Update readPositionAdjustedBy and seek to the adjusted position
+      readPositionAdjustedBy = (int) (currentPosOfStream % cryptoBufferSize);
+      // Seek current partStream to adjusted position. We do not need to
+      // reset the seeked positions of other streams.
+      partStreams.get(partIndex)
+          .seek(currentPosOfStream - readPositionAdjustedBy);
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted " +
+              "position {} by -{} to account for Crypto buffer boundary",
+          key, partIndex, currentPosOfStream, readPositionAdjustedBy);
+    }
+  }
+
+  /**
+   * If the length of data requested does not end at a Crypto Buffer
+   * boundary, the number of bytes to be read must be adjusted accordingly.
+   * The extra data will be filtered out before returning to the client.
+   */
+  private int adjustNumBytesToRead(int lenToRead, int remaining,
+      int cryptoBufferSize) {
+    int numBytesToRead = Math.min(cryptoBufferSize, remaining);
+    if (lenToRead < numBytesToRead) {
+      // Adjustment required; Update readLengthAdjustedBy.
+      readLengthAdjustedBy = numBytesToRead - lenToRead;
+      LOG.debug("OzoneCryptoInputStream for key: {} part: {} adjusted length " +
+              "by +{} to account for Crypto buffer boundary",
+          key, partIndex, readLengthAdjustedBy);
+    }
+    return numBytesToRead;
+  }
+
+  /**
+   * Seeks the InputStream to the specified position. This involves 2 steps:
+   *    1. Updating the partIndex to the partStream corresponding to the
+   *    seeked position.
+   *    2. Seeking the corresponding partStream to the adjusted position.
+   *
+   * For example, let’s say the part sizes are 200 bytes and part[0] stores
+   * data from indices 0 - 199, part[1] from indices 200 - 399 and so on.
+   * Let’s say we seek to position 240. In the first step, the partIndex
+   * would be updated to 1 as indices 200 - 399 reside in partStream[1]. In
+   * the second step, the partStream[1] would be seeked to position 40 (=
+   * 240 - blockOffset[1] (= 200)).
+   */
+  @Override
+  public void seek(long pos) throws IOException {
+    if (pos == 0 && length == 0) {
+      // It is possible for length and pos to be zero in which case
+      // seek should return instead of throwing exception
+      return;
+    }
+    if (pos < 0 || pos > length) {
+      throw new EOFException("EOF encountered at pos: " + pos);
+    }
+
+    // 1. Update the partIndex
+    if (partIndex >= partStreams.size()) {
+      partIndex = Arrays.binarySearch(partOffsets, pos);
+    } else if (pos < partOffsets[partIndex]) {
+      partIndex =
+          Arrays.binarySearch(partOffsets, 0, partIndex, pos);
+    } else if (pos >= partOffsets[partIndex] + partStreams
+        .get(partIndex).getLength()) {
+      partIndex = Arrays.binarySearch(partOffsets, partIndex + 1,
+          partStreams.size(), pos);
+    }
+    if (partIndex < 0) {
+      // Binary search returns -insertionPoint - 1  if element is not present
+      // in the array. insertionPoint is the point at which element would be
+      // inserted in the sorted array. We need to adjust the blockIndex
+      // accordingly so that partIndex = insertionPoint - 1
+      partIndex = -partIndex - 2;
+    }
+
+    // Reset the previous partStream's position
+    partStreams.get(partIndexOfPrevPos).seek(0);
+
+    // Reset all the partStreams above the partIndex. We do this to reset
+    // any previous reads which might have updated the higher part
+    // streams position.
+    for (int index = partIndex + 1; index < partStreams.size(); index++) {
+      partStreams.get(index).seek(0);
+    }
+    // 2. Seek the partStream to the adjusted position
+    partStreams.get(partIndex).seek(pos - partOffsets[partIndex]);
+    partIndexOfPrevPos = partIndex;
+  }
+
+  @Override
+  public synchronized long getPos() throws IOException {
+    return length == 0 ? 0 : partOffsets[partIndex] +

Review comment:
       Do we need to checkOpen() at the begining? 




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org

[GitHub] [ozone] xiaoyuyao commented on a change in pull request #1766: HDDS-4653. Support TDE for MPU Keys on Encrypted Buckets

Posted by GitBox <gi...@apache.org>.

xiaoyuyao commented on a change in pull request #1766:
URL: https://github.com/apache/ozone/pull/1766#discussion_r577875334



##########
File path: hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/request/key/OMKeyRequest.java
##########
@@ -546,6 +546,39 @@ protected void generateRequiredEncryptionInfo(KeyArgs keyArgs,
     }
   }
 
+  protected void getFileEncryptionInfoForMpuKey(KeyArgs keyArgs,
+      KeyArgs.Builder newKeyArgs, OzoneManager ozoneManager)
+      throws IOException {
+
+    String volumeName = keyArgs.getVolumeName();
+    String bucketName = keyArgs.getBucketName();
+
+    boolean acquireLock = false;
+    OMMetadataManager omMetadataManager = ozoneManager.getMetadataManager();
+
+    if (ozoneManager.getKmsProvider() != null) {
+      try {
+        acquireLock = omMetadataManager.getLock().acquireReadLock(

Review comment:
       The suggestion is given based on the JDK lock document examples: https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/concurrent/locks/ReentrantReadWriteLock.html
   
   




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org