Adobe open-sources its machine learning Stringlifier tool to help developers sanitize logs and detect exposed credentials
Its a season of open-sourcing. Google and Microsoft have done it and Adobe has made sure it is not left behind. The PDF maker has released the open-source version of its Stringlifier tool designed to identify randomly generated strings in any plain text.
The Stringlifier tool is developed by Adobe internally and is written in Python. Stringlifier leverages machine learning to differentiate random character sequences from normal text sequences and helps developers identify exposed datasets. Adobe says that Stringlifier could help developers to analyze security and application logs for errors. The Stringlifier could also help developers to find if their datasets have any exposed credentials.
You can access the source code for Stringlifier on Adobe’s GitHub repository here. Adobe has also made the “pip” (Python package installer) installation package public. The pip comes with a pre-trained model to help developers understand how it works.
Stringlifier can scan through hashes, API keys, randomly generated passwords, or random strings of code, logs, or configuration files to detect exposed datasets. Adobe has already open-sourced another machine learning utility called Tripod.
“We hope you find Stringlifier useful. The entire source code is available in Adobe’s GitHub repository. You can also find all of our other open-source projects from across Adobe’s security teams in that repository. We look forward to getting feedback and contributions are always welcome,” Adobe says in the blog post.