There are customer use cases where sharing information via internet or uploading data somewhere is not acceptable for security reasons: this avoid the usage of some tool like the most famous Google Analytics, and prevent developers from understanding how the web application is used by the customers. I don't want to reinvent the wheel and re-implement a copy of Google Analytics, but getting inspired from it, the goal is to reuse information that we already have to extrapolate an analysis of the WebApp customer usage.

I started this project with the aim of learning a programming language where I am not so comfortable yet (python). The purpose of this Hack Week project is to bring this basic tool at a minimal stable and usable state with the purpose of analyze the usage of a WebApp in scenarios where the WebApp is used in an internal network only (offline, disconnected from the internet).

Starting from the current status of the tool at this commit, I'd like to improve it more:

  • fix the patterns finder [DONE]
    • data in UI are badly presented and grouped - [DONE]
    • the algorithm generates a pair of from-to URLs pattern ignoring they comes from a different ip/user, and the data results reflects a non-real pattern actually. This needs to be fixed. [DONE]
  • add filters for the patterns section [DONE]
  • let the table columns to be ordered
  • go through the python backend algorithms and improve [DONE]
  • provide a feature to compare and diff from a given list of URLs (a struts-config.xml for instance) which are the most used and which are never hit [DONE]

Long run roadmap:

  • let the engine keep the history of what has already been read and what not (by date and time? by log file?)
  • provide a simple optional javascript to send an AJAX request with some information (user, date and time, URL, etc) to a configured endpoint
    • this could replace the logic of reading and parsing tomcat logs
    • store this information in the database
    • run the python code against the database instead of tomcat log files

Looking for hackers with the skills:

log python tomcat analyzer analysis statistics web yarn reactjs

This project is part of:

Hack Week 17

Activity

  • over 5 years ago: dleidi started this project.
  • over 5 years ago: LuNeves liked this project.
  • over 5 years ago: dmaiocchi liked this project.
  • almost 6 years ago: dleidi added keyword "log" to this project.
  • almost 6 years ago: dleidi added keyword "python" to this project.
  • almost 6 years ago: dleidi added keyword "tomcat" to this project.
  • almost 6 years ago: dleidi added keyword "analyzer" to this project.
  • almost 6 years ago: dleidi added keyword "analysis" to this project.
  • almost 6 years ago: dleidi added keyword "statistics" to this project.
  • almost 6 years ago: dleidi added keyword "web" to this project.
  • almost 6 years ago: dleidi added keyword "yarn" to this project.
  • almost 6 years ago: dleidi added keyword "reactjs" to this project.
  • almost 6 years ago: dleidi originated this project.

  • Comments

    Be the first to comment!

    Similar Projects

    WebUI for your data by avicenzi

    [comment]: # (Please use the project descriptio...


    A command line image collector tool for my gallery website by AZhou

    [comment]: # (Please use the project descriptio...