Expand --nstimes option #421

tgreenx · 2025-02-11T17:53:03Z

Purpose

This PR proposes to expand the --nstimes option in several ways. See the "Changes" section below.

Note: this option applies only to "sent" queries. Cached queries are not counted.

Context

N/A

Changes

Extend to all queried name servers: this means that non-queried name servers will not appear, with the exception of name servers of the child or parent zone
Add classification of name servers ("child", "parent", "other"). Note that name servers shared by the child and parent zone will appear in both categories.
Add "Count" column as the number of queries sent per name server
Add "Grand total" row for query times and query count
Refactoring

How to test this PR

Run any test with the --nstimes option. Check that the formatting is done correctly.

Example output:

$ zonemaster-cli --nstimes --raw --no-ipv6 --test basic02 afnic.fr

                     Name servers         Max        Min        Avg     Stddev     Median       Total       Count
=================================  ========== ========== ========== ========== ========== =========== ===========
Child zone ----------------------
          g.ext.nic.fr/194.0.36.1      110.98      33.27      72.88      22.62      69.07     1093.16          15
      g.ext.nic.fr/2001:678:4c::1        0.00       0.00       0.00       0.00       0.00        0.00           0
           ns1.nic.fr/192.134.4.1        4.78       4.78       4.78       0.00       4.78        4.78           1
  ns1.nic.fr/2001:67c:2218:2::4:1        0.00       0.00       0.00       0.00       0.00        0.00           0
            ns2.nic.fr/192.93.0.4        4.82       4.82       4.82       0.00       4.82        4.82           1
  ns2.nic.fr/2001:660:3005:1::1:2        0.00       0.00       0.00       0.00       0.00        0.00           0
          ns3.nic.fr/192.134.0.49        4.96       4.96       4.96       0.00       4.96        4.96           1
  ns3.nic.fr/2001:660:3006:1::1:1        0.00       0.00       0.00       0.00       0.00        0.00           0
Parent zone ---------------------
               d.nic.fr/194.0.9.1      113.08      60.96      87.02      26.06      87.02      174.05           2
           d.nic.fr/2001:678:c::1        0.00       0.00       0.00       0.00       0.00        0.00           0
      f.ext.nic.fr/194.146.106.46        0.00       0.00       0.00       0.00       0.00        0.00           0
f.ext.nic.fr/2001:67c:1010:11::53        0.00       0.00       0.00       0.00       0.00        0.00           0
          g.ext.nic.fr/194.0.36.1      110.98      33.27      72.88      22.62      69.07     1093.16          15
      g.ext.nic.fr/2001:678:4c::1        0.00       0.00       0.00       0.00       0.00        0.00           0
Other ---------------------------
    a.root-servers.net/198.41.0.4      113.63     113.63     113.63       0.00     113.63      113.63           1
  m.root-servers.net/202.12.27.33      110.43       8.39      43.07      21.62      39.96      602.93          14
=================================  ========== ========== ========== ========== ========== =========== ===========
                      Grand total                                                             1998.33          35

mattias-p

This change makes the --nstimes option go outside of timing territory and into the wider statistics territory, I guess. I think it begs the question if there are other per-nameserver stats that we would be interested in, and if so we should consider having an --nsstats option. But IMHO that's out of scope for this PR.

marc-vanderwal

Yes, for those statistics it makes perfect sense to have the sample size. Could we also have a count of the number of queries answered? (Or conversely, the number of “lost” packets?)

tgreenx · 2025-02-13T09:11:15Z

Could we also have a count of the number of queries answered? (Or conversely, the number of “lost” packets?)

We could, but not with the current implementation of Zonemaster::Engine::Nameserver.

tgreenx · 2025-02-25T18:45:56Z

@mattias-p @marc-vanderwal Sorry, I've further extended this PR with some new changes, could you make a fresh new review ? Thanks.

lib/Zonemaster/CLI.pm

mattias-p · 2025-03-11T18:08:22Z

lib/Zonemaster/CLI.pm

+                $total_queries_count += scalar @{ $ns->times } unless $nss_already_processed{$ns};
+                $total_queries_times += ( 1000 * $ns->sum_time ) unless $nss_already_processed{$ns};
+
+                return $total_queries_count, $total_queries_times, %nss_already_processed;


Isn't it better to move this summation out from print_nstimes and into its own loop?

matsduf · 2025-03-12T14:28:43Z

What are the use cases for using --nstimes?

Note: this option applies only to "sent" queries. Cached queries are not counted.

Is there any technical reason to exclude those? Otherwise it could be reasonable to see the queries to cache too. Will there be any difference if global or local cache is used? If so, it can be confusing when using global cache.

mattias-p · 2025-03-12T15:53:06Z

Note: this option applies only to "sent" queries. Cached queries are not counted.

Is there any technical reason to exclude those? Otherwise it could be reasonable to see the queries to cache too.

AFAICT what is actually interesting is the number of distinct requests. That number should to be the same whether or not recorded data is used, and whether or not intermediate results within Engine are cached. I believe the number of queries that are actually sent, the number of queries that would be sent if we didn't cache, and the sum of those are of little interest.

script/zonemaster-cli

matsduf · 2025-03-12T16:17:36Z

AFAICT what is actually interesting is the number of distinct requests. That number should to be the same whether or not recorded data is used, and whether or not intermediate results within Engine are cached. I believe the number of queries that are actually sent, the number of queries that would be sent if we didn't cache, and the sum of those are of little interest.

@mattias-p, what use case do consider when writing this? Do you see other use cases of the --ns-times option?

mattias-p · 2025-03-13T10:26:11Z

AFAICT what is actually interesting is the number of distinct requests. That number should to be the same whether or not recorded data is used, and whether or not intermediate results within Engine are cached. I believe the number of queries that are actually sent, the number of queries that would be sent if we didn't cache, and the sum of those are of little interest.

@mattias-p, what use case do consider when writing this? Do you see other use cases of the --ns-times option?

I'm not really thinking about use cases. I'm thinking about how to make meaningful measurements. A measurement is more meaningful when it is variable in a single dimension and isn't influenced by multiple different things. So I meant to suggest that we could define the count so that it isn't affected by whether caching options and implementation details.

matsduf · 2025-03-14T16:21:19Z

I think it is reasonable to have some use cases in mind when defining what figures to extract and present. Maybe it could be useful to distinguish between queries that are responded from cache and those that require sending a message externally.

tgreenx · 2025-06-04T17:02:25Z

@matsduf @mattias-p @marc-vanderwal please re-review this PR.

matsduf

This looks fine as function, but I think there is a lack of documentation. I am not sure that the best place is to add documentation into the scripts. Rather I think a markdown document in the document tree would be better, but a reference from here.

Please share your thoughts of documentation and I am ready to approve.

mattias-p

I comments for a bunch for remarks and questions and whatnot. And here's another one that I didn't know where to put as a comment:

The section mapping is defined once for the JSON output and once for table output. And the grand totals are calculated as part of the table output logic. IMHO it would be better to set up the data to be shown just once and do it before we branch to the individual output formats.

lib/Zonemaster/CLI.pm

mattias-p · 2025-06-05T07:57:36Z

lib/Zonemaster/CLI.pm

+            }
+
+            printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';
+            printf "%${max}s %67.2f %11s\n", __( 'Grand total' ), $total_queries_times, $total_queries_count;


Shouldn't the grand total be included in the JSON output also?

I wondered about that too, but decided against it for now. It could be done more easily once I do the refactoring you suggested.

tgreenx

This looks fine as function, but I think there is a lack of documentation. I am not sure that the best place is to add documentation into the scripts. Rather I think a markdown document in the document tree would be better, but a reference from here.

Please share your thoughts of documentation and I am ready to approve.

So far I think all the options are described that way. I could attempt to extend the documentation of this option in a future PR, although I don't immediately see what is missing. I think the current option description and its output are rather self-explanatory, considering that I've added explicit headers to the output.

The section mapping is defined once for the JSON output and once for table output. And the grand totals are calculated as part of the table output logic. IMHO it would be better to set up the data to be shown just once and do it before we branch to the individual output formats.

Yes you're right, if you don't mind though I'd prefer to have it done in a future PR (I can open an issue to track it) so that this one can be merged in time for v2025.1. Is that fine with you?

lib/Zonemaster/CLI.pm

tgreenx · 2025-06-05T15:52:06Z

lib/Zonemaster/CLI.pm

+            }
+
+            printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';
+            printf "%${max}s %67.2f %11s\n", __( 'Grand total' ), $total_queries_times, $total_queries_count;


I wondered about that too, but decided against it for now. It could be done more easily once I do the refactoring you suggested.

matsduf · 2025-06-05T21:00:48Z

This looks fine as function, but I think there is a lack of documentation. I am not sure that the best place is to add documentation into the scripts. Rather I think a markdown document in the document tree would be better, but a reference from here.
Please share your thoughts of documentation and I am ready to approve.

So far I think all the options are described that way. I could attempt to extend the documentation of this option in a future PR, although I don't immediately see what is missing. I think the current option description and its output are rather self-explanatory, considering that I've added explicit headers to the output.

Please look at it from a "foreign" eye, and then the output is far from self-explanatory.

marc-vanderwal

It’s fine, but some column headings in the statistics table aren’t translated. I took the liberty to suggest some little style changes as well.

marc-vanderwal · 2025-06-06T09:08:48Z

lib/Zonemaster/CLI.pm

+            my $header = __( 'Name servers' );
+            my $max = max map { length( "$_" ) } ( ( @child_nss, @parent_nss, @all_responded_nss ), $header );
+            printf "\n%${max}s %s\n", $header, '        Max        Min        Avg     Stddev     Median       Total       Count';
+            printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';


This table’s columns aren’t fully localized and that’s a problem.

Also, it's better to write printf "%*s", $width, $string instead of printf "%${width}s", $string.

Suggested change

my $header = __( 'Name servers' );

my $max = max map { length( "$_" ) } ( ( @child_nss, @parent_nss, @all_responded_nss ), $header );

printf "\n%${max}s %s\n", $header, ' Max Min Avg Stddev Median Total Count';

printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';

my $max = max map { length( "$_" ) } ( ( @child_nss, @parent_nss, @all_responded_nss ), __( 'Name servers' ) );

my @columns = (

{ label => __( 'Name servers' ), width => $max },

{ label => __( 'Max' ), width => 11 },

{ label => __( 'Min' ), width => 10 },

{ label => __( 'Avg' ), width => 10 },

{ label => __( 'Stddev' ), width => 10 },

{ label => __( 'Median' ), width => 10 },

{ label => __( 'Total' ), width => 11 },

{ label => __( 'Count' ), width => 11 },

);

# Table header

print "\n";

say join " ", map { sprintf "%*s", $_->{width}, $_->{label} } @columns;

say join " ", map { "=" x $_->{width} } @columns;

The width of each the column is wide enough for the English header name, but in translation the header name might get wider. I guess the solution is to have adaptive column width for all comlumns.

Or does the second suggestion take care of that?

marc-vanderwal · 2025-06-06T09:10:42Z

lib/Zonemaster/CLI.pm

+                printf "%${max}s ",  $ns->string;
+                printf "%11.2f ",    1000 * $ns->max_time;
+                printf "%10.2f ",    1000 * $ns->min_time;
+                printf "%10.2f ",    1000 * $ns->average_time;
+                printf "%10.2f ",    1000 * $ns->stddev_time;
+                printf "%10.2f ",    1000 * $ns->median_time;
+                printf "%11.2f ",    1000 * $ns->sum_time;
+                printf "%11d\n",     scalar @{ $ns->times };


Thanks to introducing the @columns variable above in my previous suggestion, we can have a single source of truth for the column widths.

And there’s another trick: by using %*2$s instead of %*s as a format string, the width field can be put second instead of first in the call to printf.

Suggested change

printf "%${max}s ", $ns->string;

printf "%11.2f ", 1000 * $ns->max_time;

printf "%10.2f ", 1000 * $ns->min_time;

printf "%10.2f ", 1000 * $ns->average_time;

printf "%10.2f ", 1000 * $ns->stddev_time;

printf "%10.2f ", 1000 * $ns->median_time;

printf "%11.2f ", 1000 * $ns->sum_time;

printf "%11d\n", scalar @{ $ns->times };

printf '%*2$s ', $ns->string, $columns[0]->{width};

printf '%*2$.2f ', 1000 * $ns->max_time, $columns[1]->{width};

printf '%*2$.2f ', 1000 * $ns->min_time, $columns[2]->{width};

printf '%*2$.2f ', 1000 * $ns->average_time, $columns[3]->{width};

printf '%*2$.2f ', 1000 * $ns->stddev_time, $columns[4]->{width};

printf '%*2$.2f ', 1000 * $ns->median_time, $columns[5]->{width};

printf '%*2$.2f ', 1000 * $ns->sum_time, $columns[6]->{width};

printf "%*2\$d\n", scalar @{ $ns->times }, $columns[7]->{width};

marc-vanderwal · 2025-06-06T09:12:46Z

lib/Zonemaster/CLI.pm

+
+            foreach my $section_order ( sort keys %section_mapping ) {
+                foreach my $section_header ( keys % { $section_mapping{$section_order} } ) {
+                    printf "%s %s\n", $section_header, '-' x ( ( $max - length $section_header ) - 1 );


Using printf to do such a thing is discouraged:

Suggested change

printf "%s %s\n", $section_header, '-' x ( ( $max - length $section_header ) - 1 );

say $section_header, ' ', '-' x ( ( $max - length $section_header ) - 1 );

marc-vanderwal · 2025-06-06T09:13:28Z

lib/Zonemaster/CLI.pm

+            printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';
+            printf "%${max}s %67.2f %11s\n", __( 'Grand total' ), $total_queries_times, $total_queries_count;


Again, using a single source of truth for the column widths:

Suggested change

printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';

printf "%${max}s %67.2f %11s\n", __( 'Grand total' ), $total_queries_times, $total_queries_count;

# Totals line

say join " ", map { "=" x $_->{width} } @columns;

printf "%*s %67.2f %11s\n", $max, __( 'Grand total' ), $total_queries_times, $total_queries_count;

matsduf · 2025-06-06T20:10:54Z

lib/Zonemaster/CLI.pm

@@ -1,15 +1,14 @@
 # Brief help module to define the exception we use for early exits.
 package Zonemaster::Engine::Exception::NormalExit;
-use 5.014002;
+use v5.26;


Does that mean that we increase the lowest Perl version of Zonemaster from v5.16 to v5.26?

matsduf · 2025-06-10T13:52:31Z

This can be merged, but three issues to be created:

Create detailed user documentation and link to that from the script in v2025.2.
Refactoring of the table printing code to better support translations and increase maintainability. -> Refactor table-printing code in zonemaster-cli #439
Refactoring of section mapping of data for both outputs. Grand total is to be included in the JSON output. -> Improvements to --nstimes #440

matsduf

Conflict to be resolved.

marc-vanderwal · 2025-06-10T13:58:09Z

This can be merged, but three issues to be created: […]
* Refactoring of the table printing code to better support translations and increase maintainability.

I created #439 to track that.

- Extend to all queried name servers: this means that non-queried name servers objects will not appear, with the exception of name servers of the child or parent zone - Add classification of name servers ("child", "parent", "other"). Note that name servers shared by the child and parent zone will appear in both categories. - Add number of queries sent per name server - Add grand total row for query times and query count - Refactoring

- Bump Perl version to v5.26 - Use 'my sub { }' syntax - Update unit test

tgreenx · 2025-06-10T14:08:02Z

[ ... ]
* Refactoring of section mapping of data for both outputs. Grand total is to be included in the JSON output.

I created #440 for this.

marc-vanderwal · 2025-06-11T06:58:15Z

Successfully release tested this PR on Rocky Linux 9.

Using the command line in the test procedure, the output I obtained looks similar to the example, albeit with different numbers. The formatting of the table looks correct, though currently, none of the strings are translated.

tgreenx added the V-Patch Versioning: The change gives an update of patch in version. label Feb 11, 2025

tgreenx added this to the v2025.1 milestone Feb 11, 2025

tgreenx requested review from mattias-p, matsduf, tolvmannen, marc-vanderwal and MichaelTimbert February 11, 2025 17:53

mattias-p previously approved these changes Feb 12, 2025

View reviewed changes

marc-vanderwal previously approved these changes Feb 13, 2025

View reviewed changes

tgreenx dismissed stale reviews from marc-vanderwal and mattias-p via 62ee25e February 25, 2025 17:59

tgreenx force-pushed the update-nstimes branch from 8807350 to 62ee25e Compare February 25, 2025 17:59

tgreenx changed the title ~~Expand --nstimes option to also output the total number of sent queries per name server~~ Expand --nstimes option Feb 25, 2025

tgreenx force-pushed the update-nstimes branch from 62ee25e to a3a46df Compare February 25, 2025 18:09

tgreenx marked this pull request as draft February 25, 2025 18:12

tgreenx force-pushed the update-nstimes branch from a3a46df to fd5075c Compare February 25, 2025 18:27

tgreenx requested review from mattias-p and marc-vanderwal February 25, 2025 18:45

tgreenx marked this pull request as ready for review February 25, 2025 18:45

tgreenx added V-Minor Versioning: The change gives an update of minor in version. and removed V-Patch Versioning: The change gives an update of patch in version. labels Feb 25, 2025

mattias-p reviewed Mar 11, 2025

View reviewed changes

matsduf reviewed Mar 12, 2025

View reviewed changes

script/zonemaster-cli Outdated Show resolved Hide resolved

tgreenx requested a review from marc-vanderwal June 3, 2025 16:20

tgreenx force-pushed the update-nstimes branch from 73af23b to acf65f5 Compare June 3, 2025 16:22

matsduf reviewed Jun 4, 2025

View reviewed changes

mattias-p reviewed Jun 5, 2025

View reviewed changes

tgreenx commented Jun 5, 2025

View reviewed changes

marc-vanderwal reviewed Jun 6, 2025

View reviewed changes

matsduf reviewed Jun 6, 2025

View reviewed changes

matsduf mentioned this pull request Jun 9, 2025

Sets minimum Perl version to v5.26.0 (Zonemaster-LDNS) zonemaster/zonemaster-ldns#228

Merged

matsduf previously approved these changes Jun 10, 2025

View reviewed changes

tgreenx added 5 commits June 10, 2025 15:58

Update after review, and refactoring

7372e29

Small refactoring

64c4770

Address review comments

171d8c5

- Bump Perl version to v5.26 - Use 'my sub { }' syntax - Update unit test

Incorporate review comments

646cc55

tgreenx dismissed matsduf’s stale review via 646cc55 June 10, 2025 13:58

tgreenx force-pushed the update-nstimes branch from 93c9b94 to 646cc55 Compare June 10, 2025 13:58

tgreenx requested review from mattias-p, marc-vanderwal and matsduf June 10, 2025 13:59

tgreenx mentioned this pull request Jun 10, 2025

Improvements to --nstimes #440

Open

matsduf approved these changes Jun 10, 2025

View reviewed changes

tgreenx merged commit e3c9168 into zonemaster:develop Jun 10, 2025
3 checks passed

tgreenx deleted the update-nstimes branch June 10, 2025 15:28

marc-vanderwal added the S-ReleaseTested Status: The PR has been successfully tested in release testing label Jun 11, 2025

-            my $header = __( 'Name servers' );
-            my $max = max map { length( "$_" ) } ( ( @child_nss, @parent_nss, @all_responded_nss ), $header );
-            printf "\n%${max}s %s\n", $header, '        Max        Min        Avg     Stddev     Median       Total       Count';
-            printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';
+            my $max = max map { length( "$_" ) } ( ( @child_nss, @parent_nss, @all_responded_nss ), __( 'Name servers' ) );
+            my @columns = (
+                { label => __( 'Name servers' ), width => $max },
+                { label => __( 'Max' ),          width => 11 },
+                { label => __( 'Min' ),          width => 10 },
+                { label => __( 'Avg' ),          width => 10 },
+                { label => __( 'Stddev' ),       width => 10 },
+                { label => __( 'Median' ),       width => 10 },
+                { label => __( 'Total' ),        width => 11 },
+                { label => __( 'Count' ),        width => 11 },
+            );
+            # Table header
+            print "\n";
+            say join " ", map { sprintf "%*s", $_->{width}, $_->{label} } @columns;
+            say join " ", map { "=" x $_->{width} } @columns;

	printf "%s %s\n", $section_header, '-' x ( ( $max - length $section_header ) - 1 );
	say $section_header, ' ', '-' x ( ( $max - length $section_header ) - 1 );

		printf "%${max}s %s\n", '=' x $max, ' ========== ========== ========== ========== ========== =========== ===========';
		printf "%${max}s %67.2f %11s\n", __( 'Grand total' ), $total_queries_times, $total_queries_count;

Expand --nstimes option #421

Expand --nstimes option #421

Uh oh!

Conversation

tgreenx commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Context

Changes

How to test this PR

Uh oh!

mattias-p left a comment

Choose a reason for hiding this comment

Uh oh!

marc-vanderwal left a comment

Choose a reason for hiding this comment

Uh oh!

tgreenx commented Feb 13, 2025

Uh oh!

tgreenx commented Feb 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matsduf commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattias-p commented Mar 12, 2025

Uh oh!

Uh oh!

matsduf commented Mar 12, 2025

Uh oh!

mattias-p commented Mar 13, 2025

Uh oh!

matsduf commented Mar 14, 2025

Uh oh!

tgreenx commented Jun 4, 2025

Uh oh!

matsduf left a comment

Choose a reason for hiding this comment

Uh oh!

mattias-p left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tgreenx left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matsduf commented Jun 5, 2025

Uh oh!

marc-vanderwal left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matsduf Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tgreenx commented Feb 11, 2025 •

edited

Loading

matsduf commented Mar 12, 2025 •

edited

Loading

tgreenx left a comment •

edited

Loading

matsduf Jun 9, 2025 •

edited

Loading

matsduf commented Jun 10, 2025 •

edited by tgreenx

Loading