You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hugegraph.apache.org by "LiJie20190102 (via GitHub)" <gi...@apache.org> on 2023/03/31 08:33:31 UTC

[GitHub] [incubator-hugegraph] LiJie20190102 opened a new issue, #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

LiJie20190102 opened a new issue, #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187

   ### Problem Type (问题类型)
   
   others (please edit later)
   
   ### Before submit
   
   - [X] 我已经确认现有的 [Issues](https://github.com/apache/hugegraph/issues) 与 [FAQ](https://hugegraph.apache.org/docs/guides/faq/) 中没有相同 / 重复问题 (I have confirmed and searched that there are no similar problems in the historical issue and documents)
   
   ### Environment (环境信息)
   
   - Server Version: 1.0.0 (Apache Release Version)
   - Backend: RocksDB x nodes, HDD or SSD 
   - OS: xx CPUs, xx G RAM, Ubuntu 2x.x / CentOS 7.x 
   - Data Size:  65608366 vertices, 1806067135 edges <!-- (like 1000W 点, 9000W 边) -->
   
   
   ### Your Question (问题描述)
   
   I imported 65608366 vertices and 1806067135 edges. When I used hugegraph-computer or gremlin to query, the number of query edges was correct. However, when I used "hugeClient.traverser().iteratorEdges(shard, 500)" to query the number of edges for each shard, and finally accumulated it, I found that there was an additional number of edges (1806312225 at this time). I don't know why the numbers were inconsistent. Can't we use "hugeClient. traverer(). iteratorEdges" to obtain the data size of all edges?
   
    hugegraph-computer log:
   ![image](https://user-images.githubusercontent.com/53458004/229066205-5b39d497-c1d4-4aba-9c2d-1bc430c895e5.png)
   
   gremlin  result:
   ![image](https://user-images.githubusercontent.com/53458004/229067204-87c0718d-5154-446c-a80d-c7c3e3a7f77f.png)
   
   
   "hugeClient.traverser().iteratorEdges(shard, 500)" detail:
   Step 1:Query all shards information   (http://x.x.x.x:8065/graphs/hugegraph/traversers/edges/shards?split_size=1048576)
   Step 2:Use "hugeClient. traverser(). iteratorEdges" to obtain the number of edges for each shard and then sum them。
   result: Number of edges is 1806312225 , not 1806067135 .
   
   
   
   
   ### Vertex/Edge example (问题点 / 边数据举例)
   
   _No response_
   
   ### Schema [VertexLabel, EdgeLabel, IndexLabel] (元数据结构)
   
   ```javascript
   {
       "vertexlabels": [
           {
               "id": 1,
               "name": "person",
               "id_strategy": "CUSTOMIZE_NUMBER",
               "primary_keys": [],
               "nullable_keys": [],
               "index_labels": [
                   "personByAge"
               ],
               "properties": [
                   "id"
               ],
               "status": "CREATED",
               "ttl": 0,
               "enable_label_index": true,
               "user_data": {
                   "~create_time": "2023-03-13 09:52:29.084"
               }
           }
       ]
   }
   
   {
       "edgelabels": [
           {
               "id": 1,
               "name": "friend",
               "source_label": "person",
               "target_label": "person",
               "frequency": "SINGLE",
               "sort_keys": [],
               "nullable_keys": [],
               "index_labels": [],
               "properties": [],
               "status": "CREATED",
               "ttl": 0,
               "enable_label_index": true,
               "user_data": {
                   "~create_time": "2023-03-13 09:52:30.760"
               }
           }
       ]
   }
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1554247701

   @javeme @imbajin @coderzc 
   We are planning to use hugegraph in the production environment, but we are currently experiencing this issue. Please help us solve it as soon as possible. Thank you all


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1541240162

   @javeme @imbajin @coderzc Hello, do you have any relevant conclusions?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1495508925

   When I used 'count (-1)', there were some exceptions
   
   ![企业微信截图_16805945976451](https://user-images.githubusercontent.com/53458004/229724687-6c5cf8c2-c43a-41c8-8557-cdca300e53ba.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] imbajin commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "imbajin (via GitHub)" <gi...@apache.org>.
imbajin commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1495599884

   > When I used 'count (-1)', there were some exceptions
   > 
   > ![企业微信截图_16805945976451](https://user-images.githubusercontent.com/53458004/229724687-6c5cf8c2-c43a-41c8-8557-cdca300e53ba.png)
   
   use async way to execute gremlin instead, refer [async-gremlin](https://hugegraph.apache.org/docs/clients/restful-api/gremlin/#813-%E5%90%91hugegraphserver%E5%8F%91%E9%80%81gremlin%E8%AF%AD%E5%8F%A5post%E5%BC%82%E6%AD%A5%E6%89%A7%E8%A1%8C)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] javeme commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "javeme (via GitHub)" <gi...@apache.org>.
javeme commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1511248248

   please note the 'count (-1)' may  mean `.limit(-1).count()`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] javeme commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "javeme (via GitHub)" <gi...@apache.org>.
javeme commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1542224547

   > https://blog.csdn.net/penriver/article/details/115124350. We conducted the test based on this article, and the number of edges and vertices is consistent with the article. Please help with this, thank you . @coderzc @imbajin
   
   @LiJie20190102 do you mean the counts of iteratorEdges() and g.E().count() with the backend rocksdb: `count(iteratorEdges())` != `g.E().count()`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1492869286

   > Thanks a lot for the details, could u tell us how to reproduce it with the minimum data?
   
   Sorry, I don't know yet. When the number of vertices is 65608366, it is still found to be the correct number


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1519413809

   > please note the 'count (-1)' may mean `.limit(-1).count()`
   The result is
   ![image](https://user-images.githubusercontent.com/53458004/233908675-c35be007-f97c-45fd-a76d-e5d404a06499.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] imbajin commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "imbajin (via GitHub)" <gi...@apache.org>.
imbajin commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1493590225

   > @coderzc @imbajin Hello, are you dealing with this issue? I think this issue is more important. Thank you for helping me with it
   
   we need to know how to reproduce it first,thanks


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1493593329

   > > @coderzc @imbajin Hello, are you dealing with this issue? I think this issue is more important. Thank you for helping me with it
   > 
   > we need to know how to reproduce it first,thanks
   
   The problem scenario is as follows:
   
   1. First, import 65608366 vertices and 1806067135 edges;
   
   ![image](https://user-images.githubusercontent.com/53458004/229405012-45c701ba-9ce5-407d-9862-6581f0a57199.png)
   
   
   3. When I used "hugeClient. traverser(). iteratorEdges (shard, 500)" to query and sum the number of edges for each shard, I found that it was 1806312225, not 1806067135.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] imbajin commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "imbajin (via GitHub)" <gi...@apache.org>.
imbajin commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1495442096

   > blog.csdn.net/penriver/article/details/115124350
   
   OK. get it, thanks for the feedback, you could also try `count(-1)` in gremlin query


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1542247878

   > > https://blog.csdn.net/penriver/article/details/115124350. We conducted the test based on this article, and the number of edges and vertices is consistent with the article. Please help with this, thank you . @coderzc @imbajin
   > 
   > @LiJie20190102 do you mean the counts of iteratorEdges() and g.E().count() with the backend rocksdb: `count(iteratorEdges())` != `g.E().count()`
   
   yeah


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] imbajin commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "imbajin (via GitHub)" <gi...@apache.org>.
imbajin commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1554385745

   > @javeme @imbajin @coderzc We are planning to use hugegraph in the production environment, but we are currently experiencing this issue. Please help us solve it as soon as possible. Thank you all
   
   We welcome you to use HugeGraph. The imprecision of shard may be caused by some **empty hole**, but we need a way to reproduce it for confirmation & lack the time/priority for now..
   
   In addition, because this case is relatively small, it can only be solved during scheduling. If emergency positioning/special support is needed, you can reply "support" in the Wechat official account
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1493580195

   @imbajin Hello, are you dealing with this issue? I think this issue is more important. Thank you for helping me with it


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] imbajin commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "imbajin (via GitHub)" <gi...@apache.org>.
imbajin commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1492131592

   Thanks a lot for the details, could u tell us how to reproduce it with the minimum data?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1495430036

   https://blog.csdn.net/penriver/article/details/115124350.   We conducted the test based on this article, and the number of edges and vertices is consistent with the article. Please help with this, thank you . @coderzc @imbajin 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org


[GitHub] [incubator-hugegraph] LiJie20190102 commented on issue #2187: [Question] The number of edges I queried is inconsistent with the number of edges I imported

Posted by "LiJie20190102 (via GitHub)" <gi...@apache.org>.
LiJie20190102 commented on issue #2187:
URL: https://github.com/apache/incubator-hugegraph/issues/2187#issuecomment-1495920772

   When I use count (-1), I am unable to query the correct data as it displays as 0.
   ![image](https://user-images.githubusercontent.com/53458004/229757785-cafcca04-cb1d-4eca-9c8a-ee108f35ee5e.png)
   
   
   ![image](https://user-images.githubusercontent.com/53458004/229758708-ce683a14-3c83-4086-a83b-744d6ddd5a81.png)
   
   At the same time, when I use count(), I can find the correct data:
   ![image](https://user-images.githubusercontent.com/53458004/229797500-045d4b59-d980-4184-b472-7fe01430f7bc.png)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@hugegraph.apache.org
For additional commands, e-mail: issues-help@hugegraph.apache.org