You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2022/03/02 10:23:18 UTC
[GitHub] [incubator-mxnet] bgawrych opened a new pull request #20925: [v1.x] Reduce after quantization memory usage
bgawrych opened a new pull request #20925:
URL: https://github.com/apache/incubator-mxnet/pull/20925
## Description ##
Port of https://github.com/apache/incubator-mxnet/pull/20894
Script:
```
import mxnet as mx
from mxnet.gluon.model_zoo import vision
import psutil
import os
def get_process_memory():
process = psutil.Process(os.getpid())
mem_info = process.memory_info()
return mem_info.rss * 1e-6
batch_shape = (1, 3, 224, 224)
data = mx.nd.random.normal(shape=batch_shape)
print("memory before loading model: ", get_process_memory())
net = vision.resnet50_v1(pretrained=True)
print("memory after loading model: ", get_process_memory())
out = net(data)
out.wait_to_read()
print("memory after fp32 forward pass", get_process_memory())
indata = {'data':data}
label = {'label':mx.nd.zeros(shape=(1,))}
dataiter = mx.io.NDArrayIter(indata, label, 3, True, last_batch_handle='discard')
net_quantized = mx.contrib.quant.quantize_net(net, quantized_dtype='auto',
quantize_mode="smart",
calib_mode='naive',
calib_data=dataiter,
num_calib_examples=1,
ctx=mx.current_context())
print("memory after quantization: ", get_process_memory())
outputs = net_quantized(data)
outputs.wait_to_read()
print("memory after int8 forward pass: ", get_process_memory())
```
Output before:
```
memory before loading model: 201.936896
memory after loading model: 433.41004799999996
memory after fp32 forward pass 523.698176
memory after quantization: 1308.803072
memory after int8 forward pass: 1313.349632
```
Output after:
```
memory before loading model: 202.502144
memory after loading model: 434.184192
memory after fp32 forward pass 520.986624
memory after quantization: 1136.570368
memory after int8 forward pass: 1141.485568
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@mxnet.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [incubator-mxnet] bgawrych commented on pull request #20925: [v1.x] Reduce after quantization memory usage
Posted by GitBox <gi...@apache.org>.
bgawrych commented on pull request #20925:
URL: https://github.com/apache/incubator-mxnet/pull/20925#issuecomment-1058237954
@mxnet-bot run ci [windows-gpu]
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@mxnet.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [incubator-mxnet] bgawrych merged pull request #20925: [v1.x] Reduce after quantization memory usage
Posted by GitBox <gi...@apache.org>.
bgawrych merged pull request #20925:
URL: https://github.com/apache/incubator-mxnet/pull/20925
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@mxnet.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #20925: [v1.x] Reduce after quantization memory usage
Posted by GitBox <gi...@apache.org>.
mxnet-bot commented on pull request #20925:
URL: https://github.com/apache/incubator-mxnet/pull/20925#issuecomment-1058238016
Jenkins CI successfully triggered : [windows-gpu]
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@mxnet.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #20925: [v1.x] Reduce after quantization memory usage
Posted by GitBox <gi...@apache.org>.
mxnet-bot commented on pull request #20925:
URL: https://github.com/apache/incubator-mxnet/pull/20925#issuecomment-1056759024
Hey @bgawrych , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:
- To trigger all jobs: @mxnet-bot run ci [all]
- To trigger specific jobs: @mxnet-bot run ci [job1, job2]
***
**CI supported jobs**: [sanity, website, windows-gpu, clang, centos-cpu, windows-cpu, unix-gpu, unix-cpu, edge, miscellaneous, centos-gpu]
***
_Note_:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@mxnet.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org