Posts

  • Boruta-Shap

    A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.

    BorutaShap is a wrapper feature selection method which combines both the Boruta feature selection algorithm with shapley values. This combination has proven to out perform the original Permutation Importance method in both speed, and the quality of the feature subset produced. Not only does this algorithm provide a better subset of features, but it can also simultaneously provide the most accurate and consistent global feature rankings which can be used for model inference too. Unlike the orginal R package, which limits the user to a Random Forest model, BorutaShap allows the user to choose any Tree Based learner as the base model in the feature selection process.

    Despite BorutaShap’s runtime improvements the SHAP TreeExplainer scales linearly with the number of observations making it’s use cumbersome for large datasets. To combat this, BorutaShap includes a sampling procedure which uses the smallest possible subsample of the data availble at each iteration of the algorithm. It finds this sample by comparing the distributions produced by an isolation forest of the sample and the data using ks-test. From experiments, this procedure can reduce the run time up to 80% while still creating a valid approximation of the entire data set. Even with these improvments the user still might want a faster solution so BorutaShap has included an option to use the mean decrease in gini impurity. This importance measure is independent of the size dataset as it uses the tree’s structure to compute a global feature ranking making it much faster than SHAP at larger datasets. Although this metric returns somewhat comparable feature subsets, it is not a reliable measure of global feature importance in spite of it’s wide spread use. Thus, I would recommend to using the SHAP metric whenever possible.

    Tags: #python

  • sulley

    A pure-python fully automated and unattended fuzzing framework.

    If you’re looking for an actively maintained fork of Sulley, take a look at BooFuzz.

    Tags: #python

  • dir-compare

    Node JS directory compare

    npm install dir-compare
    

    Tags: #typescript

  • ghidra-delinker-extension

    Ghidra extension for exporting relocatable object files

    This Ghidra extension enables exporting parts of a program as object files. These object files have valid metadata (symbols, relocation tables…) and as such can be reused directly by a toolchain for further processing.

    Use-cases include:

    • Advanced binary patching, by leveraging the linker to mend both original and modified parts together instead of doing this work by hand ;
    • Software ports, by isolating system-independent code from a program and replacing the rest ;
    • Converting programs or object files from one file format to another ;
    • Creating libraries, by extracting parts of a program and reusing them in another context ;
    • Decompilation projects, by splitting a program into multiple object files and reimplementing these Ship of Theseus-style ;

    Matrix of supported instruction set architectures and object files:

      x86 MIPS
    COFF
    ELF

    Tags: #java • ghidra • ghidra-extension

  • luma.gl

    High-performance Toolkit for WebGL-based Data Visualization

    luma.gl is a GPU toolkit for the Web focused primarily on data visualization use cases. luma.gl aims to provide support for GPU programmers that need to work directly with shaders and want a low abstraction API that remains conceptually close to the WebGPU and WebGL APIs. Some features of luma.gl include:

    • A robust GLSL shader module system.
    • A convenient object-oriented API wrapping most WebGL objects
    • Higher-level engine constructs to manage the animation loop, drawing and resource management

    Unlike other common WebGL APIs, the developer can choose to use the parts of luma.gl that support their use case and leave the others behind.

    While generic enough to be used for general 3D rendering, luma.gl’s mandate is primarily to support GPU needs of data visualization frameworks in the vis.gl suite, such as:

    • kepler.gl a powerful open source geospatial analysis tool for large-scale data sets
    • deck.gl a WebGL-powered framework for visual exploratory data analysis of large data sets
    • streetscape.gl A visualization toolkit for autonomy and robotics data encoded in the XVIZ protocol

    Tags: #typescript • webgl • data-visualization

  • accelerated-mobile-pages

    Automatically add Accelerated Mobile Pages (AMP Project) functionality on your WordPress.

    Contributors: mohammed_kaludi, ahmedkaludi, ampforwp
    Tags: AMP, accelerated mobile pages, mobile, amp project, google amp, amp wp, google, plugin, SEO
    Donate link: https://www.paypal.me/Kaludi/25
    Requires at least: 3.0
    Tested up to: 6.8
    Stable tag: 1.1.8 License: GPLv2 or later
    License URI: http://www.gnu.org/licenses/gpl-2.0.html

    AMP for WP is the most recommended AMP plugin by the community. Automatically add Accelerated Mobile Pages (Google AMP Project) functionality on your WordPress site.

    Tags: #php • wordpress • wordpress-plugin

  • redmica

    The future Redmine you can get today — yet another distribution of a flexible project management software named Redmine

    RedMica is yet another distribution of Redmine, a great open source project management software.

    The main aim of RedMica is to quickly deliver new Redmine features that have already been in the repository but have not yet been officially released. We, Far End Technologies Corporation, are using RedMica in our Redmine hosting service My Redmine.

    Tags: #ruby • project-management • issue-tracking

  • Crawler-Detect

    🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

    CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent and http_from header. Currently able to detect 1,000’s of bots/spiders/crawlers.

    Tags: #php • user-agent • crawler

  • medusa

    Mobile Edge-Dynamic Unified Security Analysis

    A modular automation framework and script repository for runtime testing and investigating Android and iOS apps, designed for penetration testers and malware analysts. It lets you plug in modules to bypass protections (like SSL pinning), inspect network/WebView activity, trace API calls, examine memory/crypto, and monitor malware behavior — all from a single coordinated script. It runs on FRIDA under the hood and ships with 90+ reusable modules so tests scale and repeat easily.

    Tags: #javascript • malware • dynamic-analysis

  • rdoc

    RDoc produces HTML and online documentation for Ruby projects.

    Tags: #ruby • documentation-tool • hacktoberfest

subscribe via RSS