Skip to content

Can unknown traffic be excluded from the report? #2260

Open
@cdrx

Description

@cdrx

I'm analysing some web traffic, but trying to limit the report to just traffic that is likely genuine user activity (i.e. not a bot, sensible looking user agent, etc).

Using --ignore-crawlers gets me most of the way there, which is great.

If I run with --unknowns-log, I can see from the file that there is a lot of long tail junk activity I'm not interested in (log4j attacks, curl, weird bots etc).

Is it possible to skip / filter out all "unknown" traffic?

Activity

allinurl

allinurl commented on Jan 9, 2022

@allinurl
Owner

Thanks for suggesting this. There's no option now to ignore those. Are you looking to ignore them from being counted completely or simply not showing that data?

cdrx

cdrx commented on Jan 10, 2022

@cdrx
Author

I'm not sure I understand the difference between not counting or not showing the data.

For me, ideally, I would want the unknowns to be either not imported at all, or excluded from the "visits" metric, on the "unique visitors per day" panel.

I guess what I'm looking for is "unique likely-human visitors per day" (as best we can tell, from the logs)

0bi-w6n-K3nobi

0bi-w6n-K3nobi commented on Jan 11, 2022

@0bi-w6n-K3nobi
Contributor

Hum... it is seem so complicated.

In this same way that this request for you may seem "unknown", those may be incorrectly labeled.
Yeah... I known... A lot of web traffic is just trash. But I would take care about this.

Some times, I had DDoS attack or a extraordinary bandwidth consumption from this "unknown" sources.
I advise you to also be aware of this traffic, so as not to have any unpleasant surprises.
Or keep another separate report for that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @cdrx@allinurl@0bi-w6n-K3nobi

        Issue actions

          Can unknown traffic be excluded from the report? · Issue #2260 · allinurl/goaccess