Announcing memflow 0.2.0

Today, we are proud to release the first stable version of memflow 0.2! 3 years in the making, this is certainly a monumental release. In this post, we will go through the key changes to the fastest and most flexible physical memory introspection and forensics framework to date.

Key changes

0. memflowup

Not a library change, but the ecosystem change! We now have a rust-written memflowup utility that makes it much easier to manage your memflow installation. Key features:

Download binary builds (optional).
Split between stable and dev versions.
Custom install scripts, for more complicated plugins
- Used by memflow-kvm for DKMS install.
- Entry point for these is install.rhai script at the root of the package's repo.

You can get started with memflowup by running the following:

> curl --proto '=https' --tlsv1.2 -sSf https://sh.memflow.io | sh

1. OS layers and modularity

With the advent of 0.2 series, we now abstracted most of memflow-win32 functionality behind shared set of traits. These traits allow the user to interact with the operating system in unified manner. In addition, we now made OS a plugin, just as Connectors were in 0.1! And finally, we do indeed have multiple OS backends available, right now:

memflow-win32, for Windows analysis, given physical memory access.
memflow-native, for syscall based interaction with the running operating system.
WIP: memflow-linux
- Don't expect much anytime soon, because the challenge of cross-version, zero-knowledge linux support is a tricky one.

With this, OS-independent code that works with memflow-win32, should also work on local OS. Here's an example of such code:

use memflow::prelude::v1::*;

// We don't care what type of process we get, so long as it's a process
fn module_address(process: &mut impl Process, module: &str) -> Result<Address> {
    let module = process.module_by_name(module)?;
    Ok(module.base)
}

In addition, modularization of operating systems allows for greater portability of connectors. For instance, we have now split memflow-qemu-procfs into memflow-qemu, which (optionally) accepts an OS layer. This way, you can not only analyze QEMU VMs running on your computer, but you can also open them up in a nested way on a machine that is already being analyzed through DMA. As seen in this chart:

                        *%%*%%%%                                        
                   %%%%%%%%%%%*%%%%%%                                   
                %%%%%.............%%=-=                                 
                %%%%%.memflow-kvm.=--=-                                 
                %%*%%.............---==                                 
                 %%%%%*%%***=-=-======-                                 
                     %%*%%%%====-=                                      
                          %%- =                                         
                              %+%%%%                                    
                         %*%%%%+%%%%%%%%%                               
                      %%%%...............%%=                            
                      %%%%.memflow-win32.-=-                            
                      %%%*...............-==                            
                      %%%%%*%%%*%--------=-=                            
                          %%%%*%%----===                                
                               %*=-=                                    
                                    +%%*%                               
                                *%%*+%%%%%%%%%                          
                            %%%%*%%%%%**%%%%%*%%%=                      
                            %%%..............=----                      
                            %%%.memflow-qemu.===--                      
                            %%*..............=-===                      
                               %%**%%*%=-=-==--                         
                                    %%*-==                              
                                         = %*%                          
                                      **%%=%%%%%%%%                     
                                 %%%%%%%%%*%%%%*%%%%%%*                 
                                 **%%...............=-==                
                                 %%%%.memflow-win32.--==                
                                 %%%%...............=---                
                                   %%%%%%%%%-=-==--=-                   
                                        %%%%--=-

2. Stable ABI

In 0.1, the Connectors were turned into plugins through use of Rust trait objects. This was an okay solution at the time, however, we knew that it was not a safe one - changes in Rust versions could change the layout of those trait objects, leading to crashes or other misbehavior, in case of mismatch of plugin's rustc version and the one of the user's code. While the layout has remained stable most of the time, the tides started to shift a few years ago, as more effort was put into trait objects on the compiler front.

For 0.2, we knew we could not keep the status quo, so, we built cglue. The crate allows for simple and flexible ABI safe code generation, suited for the needs of memflow. Throughout the (very long) beta period, we received 0 crash reports stemming from ABI instability, while 0.1 had such cases. Therefore, we can conclude that it was a good investment that already made memflow more stable.

In 0.2.0-betaX series, you may have encountered "invalid ABI" errors, well, fear not, because in stable series, we commit to not breaking the ABI across entirety of 0.2 series, so this problem should be a thing of the past for most users.

3. Memory constrained vtop

memflow 0.2 introduces the most scalable virtual address translation backend, period. The backend is able to walk entire page tree in milliseconds, targeting any modern memory architecture (x86 and ARM support out-of-the box, sufficient building blocks for RISC-V). In addition, compared to 0.1, the new backend uses fixed-size buffers, meaning RAM usage will no longer blow up on large translation ranges.

4. 64-bit and 128-bit address spaces, on all architectures

We now support analyzing 64-bit operating systems on 32-bit machines. In addition, if there was a theoretical 128-bit architecture, we would support that as well. However, it's more of a PoC and we do not expect this to be needed in the foreseeable future.

The support can be toggled through 64_bit_mem (default) and 128_bit_mem features. Do note that these feature toggles do change memflow's ABI and it should not be possible to mix the plugin features.

5. Shared `MemoryView`

In 0.1, we have had a split between physical and virtual memory. The reason for the split is caching - we wish minimize latency by caching read-only memory in high-latency scenarios. However, to tell the cache what mode the memory is in (readable/writeable/executable), you must add metadata with each request. Meanwhile, this metadata may only be filled in by the virtual address translation backend.

If user submits an I/O operation - they can't possibly know whether the request is going to a read-only, or a writeable page, therefore they just submit UNKNOWN page flags. This is complicated, therefore, we have lowered the gap between virtual and physical memory access through use of MemoryView trait. This trait not only removes the need for the user to explicitly submit the page flags, but also brings all I/O helpers that existed in virtual memory contexts. To use MemoryView on physical memory, just use the phys_view function:

use memflow::prelude::v1::*;

fn main() -> Result<()> {
    let inventory = Inventory::scan();
    let mut conn = inventory.create_connector("dummy", None, None)?;

    // Create a physical memory view
    let mut view = conn.phys_view();

    // Read from phys addr 0
    let value: u64 = view.read(0.into())?;

    Ok(())
}

6. C and C++ are now first-class citizen

The FFI is now automatically generated using cbindgen and cglue-bindgen. It may initially seem like a downgrade, however, this way we can ensure entirety of memflow's plugin-focused API surface can be both accessed, and implemented by foreign languages, such as C and C++.

The key to using the new FFI, is reading Rust documentation and examples, and then finding the function equivalents in the headers. There are a few quirks here and there, but after understanding them, using the FFI should not be hard. For inspiration, see the following:

Side projects

memflow is as useful as the projects utilizing it. To get started with the new version faster, you may want to have a look at some of them. Here's the list of first-party releases:

reflow - execute code on top of virtual memory.
scanflow - basic CheatEngine features in a command line interface.
cloudflow (WIP) - flexible filesystem based way to interact with memflow.
memflow-py - python bindings for memflow (courtesy of emesare).

Reflection

0.2 took way longer than we originally anticipated. This is mostly due to changing living conditions and the fact that both ko1N and I are only working on the project in hobbyist capacity. In addition, we pushed for perfection from documentation and implementation front - a feat infeasible at the current point. We do believe memflow is the framework that is going to bring the most empowerement to users, however, there are still ways to go.

Next up - Async Metamorphosis

Next, we will work towards integrating mfio into memflow, which will enable higher scalability and simplicity. The key change is going to be transition from synchronous to asynchronous API. There are still a lot of open questions regarding this, such as FFI handling, how much the individual pieces of memflow's code will have to change, and how multithreading needs to be handled. However, we are confident those questions are not impossible to solve. Once the metamorphosis is done, we can consider the structure of memflow done. What comes afterwards, is rapid feature development. It will definitely be an exciting time to be alive. So let's just get there, shall we?

- h33p