Posts

  • JFastText

    Java interface for fastText

    JFastText is a Java wrapper for Facebook’s fastText, a library for efficient learning of word embeddings and fast sentence classification. The JNI interface is built using javacpp.

    The library provides full fastText’s command line interface. It also provides the API for loading trained model from file to do label prediction in memory. Model training and quantization are supported via the command line interface.

    JFastText is ideal for building fast text classifiers in Java.

    Tags: #java • machine-learning • nlp

  • clauneck

    A tool for scraping emails, social media accounts, and much more information from websites using Google Search Results.

    The script will write the results in a CSV file. If it cannot find any one of the information on a website, it will label it as null. For unknown errors happening in-between (connection errors, encoding errors, etc.) the fields will be filled with as error.

    Website Information Type of Information
    serpapi.com contact@serpapi.com Email
    serpapi.com serpapicom Instagram
    serpapi.com serpapicom Facebook
    serpapi.com serp_api Twitter
    serpapi.com null Tiktok
    serpapi.com channel/UCUgIHlYBOD3yA3yDIRhg_mg Youtube
    serpapi.com serpapi Github
    serpapi.com serpapi Medium

    Tags: #ruby • automation • command-line

  • eclipse-collections

    Eclipse Collections is a collections framework for Java with optimized data structures and a rich, functional and fluent API.

    Eclipse Collections is a comprehensive collections library for Java. The library enables productivity and performance by delivering an expressive and efficient set of APIs and types. The iteration protocol was inspired by the Smalltalk collection framework, and the collections are compatible with the Java Collection Framework types.

    Eclipse Collections is compatible with Java 8+. Eclipse Collections is a part of the OpenJDK Quality Outreach program, and it is validated for different versions of the OpenJDK.

    Tags: #java • eclipse-collections • java-collections

  • timeout

    Timeout provides a way to auto-terminate a potentially long-running operation if it hasn’t finished in a fixed amount of time.

    Timeout provides a way to auto-terminate a potentially long-running operation if it hasn’t finished in a fixed amount of time.

    Previous versions didn’t use a module for namespacing, however

    Tags: #ruby

  • mongock

    Lightweight Java based migration tool

    The official documentation is [here][documentation_link], where you can find everything rom version 3.

    Tags: #java • mongock • mongodb

  • sync-dotenv

    Keep your .env in sync with .env.example

    Projects often rely on environmental variables stored in a .env file to run… and because these variables sometimes contain sensitive data, we never add them to source control. Instead, these variables are added e.g. to a .env.example file so it’s easy to get the project running for other developers. However, it’s very easy to forget to update this file when a variable is added/updated in .env (during development). This can make it difficult for devs to get the project running (locally) because they rely on .env.example file to setup their environment (with their own configs).

    Enter sync-dotenv 🔥

    Tags: #typescript • nodejs • dotenv

  • python-user-agents

    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

    • ua-parser >= 0.10.0 is required. Thanks @jnozsc!
    • Added get_device(), get_os() and get_browser() instance methods to UserAgent. Thanks @rodrigondec!

    Tags: #python • user-agent-detection • browser-detection

  • MerlinWP

    Better WordPress Theme Onboarding

    Merlin WP aims to address the tedious and exhausting WordPress theme setup and onboarding process. It makes installing a new WordPress theme, recommended WordPress plugins, Customizer settings, widgets, and demo content an exciting and gratifying user experience. Read more…

    Tags: #php • wordpress • themes

  • git-imerge

    Incremental merge for git

    Perform a merge between two branches incrementally. If conflicts are encountered, figure out exactly which pairs of commits conflict, and present the user with one pairwise conflict at a time for resolution.

    git-imerge has two primary design goals:

    • Reduce the pain of resolving merge conflicts to its unavoidable minimum, by finding and presenting the smallest possible conflicts: those between the changes introduced by one commit from each branch.

    • Allow a merge to be saved, tested, interrupted, published, and collaborated on while it is in progress.

    I think that it is easiest to understand the concept of incremental merging visually, and therefore I recommend the video of my git-imerge presentation from the GitMerge 2013 conference (20 min) as a good place to start. The full slides for that talk are available in this repository under doc/presentations/GitMerge-2013. At the same conference, I was interviewed about git-imerge by Thomas Ferris Nicolaisen for his GitMinutes Podcast #12.

    To learn how to use the git-imerge tool itself, I suggest the blog article git-imerge: A Practical Introduction and also typing git-imerge --help and git-imerge SUBCOMMAND --help. If you want more information, the theory and benefits of incremental merging are described in minute detail in a series of blog articles [1], as are the benefits of retaining history when doing a rebase [2].

    Multiple incremental merges can be in progress at the same time. Each incremental merge has a name, and its progress is recorded in the Git repository as references under refs/imerge/NAME. The current state of an incremental merge can be visualized using the diagram command.

    An incremental merge can be interrupted and resumed arbitrarily, or even pushed to a server to allow somebody else to work on it.

    git-imerge comes with a Bash completion script, completions/git-imerge, which is installed automatically when installing git-imerge.

    Tags: #python • git • merge-conflicts

  • Kore

    Kore is a simple and easy-to-use Kodi remote.

    GitHub repository for the [Kore][1] Android app.

    Kore is the official remote for Kodi, and aims to be a simple and easy to use remote.

    Tags: #java • kore • kodi

subscribe via RSS