Split off log messages from diagnosing rippled problems

2025-11-16 09:45:51 +00:00 · 2018-08-29 17:07:06 -07:00
parent 07b62a7e56
commit a70ad15b24
4 changed files with 152 additions and 129 deletions
--- a/content/tutorials/manage-the-rippled-server/troubleshooting/diagnosing-problems.md
+++ b/content/tutorials/manage-the-rippled-server/troubleshooting/diagnosing-problems.md
@@ -38,7 +38,7 @@ For troubleshooting purposes, the most important fields are (from most commonly

 ### No Response from Server

-The following message indicates that the `rippled` executable wasn't able to connect as a client to the `rippled` server:
+The `rippled` executable returns the following message if it wasn't able to connect as a client to the `rippled` server:

 ```json
 {
@@ -64,134 +64,8 @@ You can control the verbosity of the debug log with the [log_level method][]. Th

 It is normal for a `rippled` the server to print many warning-level (`WRN`) messages during startup and a few warning-level messages from time to time later on. You can **safely ignore** most warnings in the first 5 to 15 minutes of server startup.

-The following sections describe some of the most common types of log messages and how to interpret them.
+For a more thorough explanation of various types of log messages, see [Understanding Log Messages](understanding-log-messages.html).

-### Crashes
-
-Messages in the log that mention runtime errors can indicate that the server crashed. These messages usually start with a message such as one of the following examples:
-
-```
-Throw<std::runtime_error>
-```
-
-```
-Terminating thread rippled: main: unhandled St13runtime_error
-```
-
-If your server always crashes on startup, see [Server Won't Start](server-wont-start.html) for possible cases.
-
-If your server crashes randomly during operation or as a result of particular commands, make sure you are [updated](updating-rippled.html) to the latest `rippled` version. If you are on the latest version and your server is still crashing, check the following:
-
- Is your server running out of memory? On some systems, `rippled` may be terminated by the Out Of Memory (OOM) Killer or another monitor process.
- If your server is running in a shared environment, are other users or administrators causing the machine or service to be restarted? For example, some hosted providers automatically kill any service that uses a large amount of a shared machine's resources for an extended period of time.
- Does your server meet the [minimum requirements](install-rippled.html#minimum-system-requirements) to run `rippled`? What about the [recommendations for production servers](capacity-planning.html#recommendation-1)?
-
-If none of the above apply, please report the issue to Ripple as a security-sensitive bug. If Ripple can reproduce the crash, you may be eligible for a bounty. See <https://ripple.com/bug-bounty/> for details.
-
-
-### Connection reset by peer
-
-The following log message indicates that a peer `rippled` server closed a connection:
-
-```text
-2018-Aug-28 22:55:41.738765510 Peer:WRN [012] onReadMessage: Connection reset by peer
-```
-
-Losing connections from time to time is normal for any peer-to-peer network. **Occasional messages of this kind do not indicate a problem.**
-
-A large number of these messages around the same time may indicate a problem, such as:
-
- Your internet connection to one or more specific peers was cut off
- Your server may have been overloading the peer with requests, causing it to drop your server
-
-
-### No hash for fetch pack
-
-```text
-2018-Aug-28 22:56:21.397076850 LedgerMaster:ERR No hash for fetch pack. Missing Index 7159808
-```
-
-***TODO: how serious is this?*** https://github.com/ripple/rippled/blob/8a02903fa5eda4daa10972800d2598b9542b02d2/src/ripple/app/ledger/impl/LedgerMaster.cpp#L526
-
-(If not serious, may be worth dropping down one or two severity levels.)
-
-### LoadMonitor Job
-
-```text
-2018-Aug-28 22:56:36.180827973 LoadMonitor:WRN Job: gotFetchPack run: 11566ms wait: 0ms
-2018-Aug-28 22:56:36.180970431 LoadMonitor:WRN Job: processLedgerData run: 0ms wait: 11566ms
-2018-Aug-28 22:56:36.181053831 LoadMonitor:WRN Job: AcquisitionDone run: 0ms wait: 11566ms
-2018-Aug-28 22:56:36.181110594 LoadMonitor:WRN Job: processLedgerData run: 0ms wait: 11566ms
-2018-Aug-28 22:56:36.181169931 LoadMonitor:WRN Job: AcquisitionDone run: 0ms wait: 11566ms
-```
-
-***TODO: how serious is this?***
-
-
-### View of consensus changed during open
-
-Log messages such as the following occur when a server is not in sync with the rest of the network:
-
-```text
-2018-Aug-28 22:56:22.368460130 LedgerConsensus:WRN View of consensus changed during open status=open,  mode=proposing
-2018-Aug-28 22:56:22.368468202 LedgerConsensus:WRN 96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661 to 00B1E512EF558F2FD9A0A6C263B3D922297F26A55AEB56A009341A22895B516E
-2018-Aug-28 22:56:22.368499966 LedgerConsensus:WRN {"accepted":true,"account_hash":"89A821400087101F1BF2D2B912C6A9F2788CC715590E8FA5710F2D10BF5E3C03","close_flags":0,"close_time":588812130,"close_time_human":"2018-Aug-28 22:55:30.000000000","close_time_resolution":30,"closed":true,"hash":"96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661","ledger_hash":"96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661","ledger_index":"3","parent_close_time":588812070,"parent_hash":"5F5CB224644F080BC8E1CC10E126D62E9D7F9BE1C64AD0565881E99E3F64688A","seqNum":"3","totalCoins":"100000000000000000","total_coins":"100000000000000000","transaction_hash":"0000000000000000000000000000000000000000000000000000000000000000"}
-```
-
-During the first 5 to 15 minutes after the server starts up, it is normal for it to be out of sync with the rest of the network and print messages such as these. If the server writes these messages long after starting up, it could indicate a problem. Common causes include unreliable network connections and insufficient hardware specs.
-
-
-### Already validated sequence at or past
-
-Log messages such as the following indicate that a server received validations for different ledger sequences out of order.
-
-```text
-2018-Aug-28 22:55:58.316094260 Validations:WRN Val for 2137ACEFC0D137EFA1D84C2524A39032802E4B74F93C130A289CD87C9C565011 trusted/full from nHUeUNSn3zce2xQZWNghQvd9WRH6FWEnCBKYVJu2vAizMxnXegfJ signing key n9KcRZYHLU9rhGVwB9e4wEMYsxXvUfgFxtmX25pc1QPNgweqzQf5 already validated sequence at or past 12133663 src=1
-```
-
-Occasional messages of this type do not usually indicate a problem. If this type of message occurs frequently with the same sending validator, it could indicate a problem, including any of the following (roughly in order of most to least likely):
-
- The server writing the message is having network issues
- The validator described in the message is having network issues
- The validator described in the message is behaving maliciously
-
-
-### Unable to determine hash of ancestor
-
-Log messages such as the following occur when the server sees a validation message from a peer and it does not know the parent ledger version that server is building on. This is normal when a server is syncing to the network.
-
-```text
-2018-Aug-28 22:56:22.256065549 Validations:WRN Unable to determine hash of ancestor seq=3 from ledger hash=00B1E512EF558F2FD9A0A6C263B3D922297F26A55AEB56A009341A22895B516E seq=12133675
-```
-
-If this message occurs frequently outside of the first 5 to 15 minutes after starting the server, it could indicate a problem.
-
-
-### InboundLedger Want hash
-
-Log messages such as the following indicate that the server is requesting ledger data from other servers:
-
-```text
-InboundLedger:WRN Want: 5AE53B5E39E6388DBACD0959E5F5A0FCAF0E0DCBA45D9AB15120E8CDD21E019B
-```
-
-This is normal if your server is syncing, backfilling, or downloading [history shards](history-sharding.html).
-
-
-### InboundLedger 11 timeouts for ledger
-
-```text
-InboundLedger:WRN 11 timeouts for ledger 8265938
-```
-
-This indicates that your server is having trouble requesting specific ledger data from its peers. If the [ledger index](basic-data-types.html#ledger-index) is much lower than the most recent validated ledger's index as reported by the [server_info method][], this probably indicates that your server is downloading a [history shard](history-sharding.html).
-
-This is not strictly a problem, but if you want to acquire ledger history faster, you can configure `rippled` to connect to peers with full history by adding or editing the `[ips_fixed]` config stanza and restarting the server. For example, to always try to connect to one of Ripple's full-history servers:
-
-```
-[ips_fixed]
-s2.ripple.com 51235
-```


 <!--{# common link defs #}-->
--- a/content/tutorials/manage-the-rippled-server/troubleshooting/server-wont-start.md
+++ b/content/tutorials/manage-the-rippled-server/troubleshooting/server-wont-start.md
@@ -1,4 +1,4 @@
-# rippled Server Won't Start (Troubleshooting)
+# rippled Server Won't Start

 This page explains possible reasons the `rippled` server does not start successfully, and how to fix them.

--- a/content/tutorials/manage-the-rippled-server/troubleshooting/understanding-log-messages.md
+++ b/content/tutorials/manage-the-rippled-server/troubleshooting/understanding-log-messages.md
@@ -0,0 +1,139 @@
+# Understanding Log Messages
+
+The following sections describe some of the most common types of log messages that can appear in a `rippled` server's debug log and how to interpret them.
+
+This is an important step in [Diagnosing Problems](diagnosing-problems.html) with `rippled`.
+
+## Crashes
+
+Messages in the log that mention runtime errors can indicate that the server crashed. These messages usually start with a message such as one of the following examples:
+
+```
+Throw<std::runtime_error>
+```
+
+```
+Terminating thread rippled: main: unhandled St13runtime_error
+```
+
+If your server always crashes on startup, see [Server Won't Start](server-wont-start.html) for possible cases.
+
+If your server crashes randomly during operation or as a result of particular commands, make sure you are [updated](updating-rippled.html) to the latest `rippled` version. If you are on the latest version and your server is still crashing, check the following:
+
+- Is your server running out of memory? On some systems, `rippled` may be terminated by the Out Of Memory (OOM) Killer or another monitor process.
+- If your server is running in a shared environment, are other users or administrators causing the machine or service to be restarted? For example, some hosted providers automatically kill any service that uses a large amount of a shared machine's resources for an extended period of time.
+- Does your server meet the [minimum requirements](install-rippled.html#minimum-system-requirements) to run `rippled`? What about the [recommendations for production servers](capacity-planning.html#recommendation-1)?
+
+If none of the above apply, please report the issue to Ripple as a security-sensitive bug. If Ripple can reproduce the crash, you may be eligible for a bounty. See <https://ripple.com/bug-bounty/> for details.
+
+
+## Connection reset by peer
+
+The following log message indicates that a peer `rippled` server closed a connection:
+
+```text
+2018-Aug-28 22:55:41.738765510 Peer:WRN [012] onReadMessage: Connection reset by peer
+```
+
+Losing connections from time to time is normal for any peer-to-peer network. **Occasional messages of this kind do not indicate a problem.**
+
+A large number of these messages around the same time may indicate a problem, such as:
+
+- Your internet connection to one or more specific peers was cut off
+- Your server may have been overloading the peer with requests, causing it to drop your server
+
+
+## No hash for fetch pack
+
+```text
+2018-Aug-28 22:56:21.397076850 LedgerMaster:ERR No hash for fetch pack. Missing Index 7159808
+```
+
+***TODO: how serious is this?*** https://github.com/ripple/rippled/blob/8a02903fa5eda4daa10972800d2598b9542b02d2/src/ripple/app/ledger/impl/LedgerMaster.cpp#L526
+
+(If not serious, may be worth dropping down one or two severity levels.)
+
+## LoadMonitor Job
+
+```text
+2018-Aug-28 22:56:36.180827973 LoadMonitor:WRN Job: gotFetchPack run: 11566ms wait: 0ms
+2018-Aug-28 22:56:36.180970431 LoadMonitor:WRN Job: processLedgerData run: 0ms wait: 11566ms
+2018-Aug-28 22:56:36.181053831 LoadMonitor:WRN Job: AcquisitionDone run: 0ms wait: 11566ms
+2018-Aug-28 22:56:36.181110594 LoadMonitor:WRN Job: processLedgerData run: 0ms wait: 11566ms
+2018-Aug-28 22:56:36.181169931 LoadMonitor:WRN Job: AcquisitionDone run: 0ms wait: 11566ms
+```
+
+***TODO: how serious is this?***
+
+
+## View of consensus changed during open
+
+Log messages such as the following occur when a server is not in sync with the rest of the network:
+
+```text
+2018-Aug-28 22:56:22.368460130 LedgerConsensus:WRN View of consensus changed during open status=open,  mode=proposing
+2018-Aug-28 22:56:22.368468202 LedgerConsensus:WRN 96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661 to 00B1E512EF558F2FD9A0A6C263B3D922297F26A55AEB56A009341A22895B516E
+2018-Aug-28 22:56:22.368499966 LedgerConsensus:WRN {"accepted":true,"account_hash":"89A821400087101F1BF2D2B912C6A9F2788CC715590E8FA5710F2D10BF5E3C03","close_flags":0,"close_time":588812130,"close_time_human":"2018-Aug-28 22:55:30.000000000","close_time_resolution":30,"closed":true,"hash":"96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661","ledger_hash":"96A8DF9ECF5E9D087BAE9DDDE38C197D3C1C6FB842C7BB770F8929E56CC71661","ledger_index":"3","parent_close_time":588812070,"parent_hash":"5F5CB224644F080BC8E1CC10E126D62E9D7F9BE1C64AD0565881E99E3F64688A","seqNum":"3","totalCoins":"100000000000000000","total_coins":"100000000000000000","transaction_hash":"0000000000000000000000000000000000000000000000000000000000000000"}
+```
+
+During the first 5 to 15 minutes after the server starts up, it is normal for it to be out of sync with the rest of the network and print messages such as these. If the server writes these messages long after starting up, it could indicate a problem. Common causes include unreliable network connections and insufficient hardware specs.
+
+
+## Already validated sequence at or past
+
+Log messages such as the following indicate that a server received validations for different ledger sequences out of order.
+
+```text
+2018-Aug-28 22:55:58.316094260 Validations:WRN Val for 2137ACEFC0D137EFA1D84C2524A39032802E4B74F93C130A289CD87C9C565011 trusted/full from nHUeUNSn3zce2xQZWNghQvd9WRH6FWEnCBKYVJu2vAizMxnXegfJ signing key n9KcRZYHLU9rhGVwB9e4wEMYsxXvUfgFxtmX25pc1QPNgweqzQf5 already validated sequence at or past 12133663 src=1
+```
+
+Occasional messages of this type do not usually indicate a problem. If this type of message occurs frequently with the same sending validator, it could indicate a problem, including any of the following (roughly in order of most to least likely):
+
+- The server writing the message is having network issues
+- The validator described in the message is having network issues
+- The validator described in the message is behaving maliciously
+
+
+## Unable to determine hash of ancestor
+
+Log messages such as the following occur when the server sees a validation message from a peer and it does not know the parent ledger version that server is building on. This is normal when a server is syncing to the network.
+
+```text
+2018-Aug-28 22:56:22.256065549 Validations:WRN Unable to determine hash of ancestor seq=3 from ledger hash=00B1E512EF558F2FD9A0A6C263B3D922297F26A55AEB56A009341A22895B516E seq=12133675
+```
+
+If this message occurs frequently outside of the first 5 to 15 minutes after starting the server, it could indicate a problem.
+
+
+## InboundLedger Want hash
+
+Log messages such as the following indicate that the server is requesting ledger data from other servers:
+
+```text
+InboundLedger:WRN Want: 5AE53B5E39E6388DBACD0959E5F5A0FCAF0E0DCBA45D9AB15120E8CDD21E019B
+```
+
+This is normal if your server is syncing, backfilling, or downloading [history shards](history-sharding.html).
+
+
+## InboundLedger 11 timeouts for ledger
+
+```text
+InboundLedger:WRN 11 timeouts for ledger 8265938
+```
+
+This indicates that your server is having trouble requesting specific ledger data from its peers. If the [ledger index](basic-data-types.html#ledger-index) is much lower than the most recent validated ledger's index as reported by the [server_info method][], this probably indicates that your server is downloading a [history shard](history-sharding.html).
+
+This is not strictly a problem, but if you want to acquire ledger history faster, you can configure `rippled` to connect to peers with full history by adding or editing the `[ips_fixed]` config stanza and restarting the server. For example, to always try to connect to one of Ripple's full-history servers:
+
+```
+[ips_fixed]
+s2.ripple.com 51235
+```
+
+
+
+<!--{# common link defs #}-->
+{% include '_snippets/rippled-api-links.md' %}
+{% include '_snippets/tx-type-links.md' %}
+{% include '_snippets/rippled_versions.md' %}
--- a/dactyl-config.yml
+++ b/dactyl-config.yml
@@ -966,6 +966,16 @@ pages:
        targets:
            - local

+    -   md: tutorials/manage-the-rippled-server/troubleshooting/understanding-log-messages.md
+        html: understanding-log-messages.html
+        funnel: Docs
+        doc_type: Tutorials
+        category: Manage the rippled Server
+        subcategory: Troubleshooting rippled
+        blurb: Interpret and respond to warning and error messages in the debug log.
+        targets:
+            - local
+
    -   md: tutorials/manage-the-rippled-server/troubleshooting/server-wont-start.md
        html: server-wont-start.html
        funnel: Docs