Deserialization
Last updated
Last updated
Learn & practice AWS Hacking:HackTricks Training AWS Red Team Expert (ARTE) Learn & practice GCP Hacking: HackTricks Training GCP Red Team Expert (GRTE)
Serialization is understood as the method of converting an object into a format that can be preserved, with the intent of either storing the object or transmitting it as part of a communication process. This technique is commonly employed to ensure that the object can be recreated at a later time, maintaining its structure and state.
Deserialization, conversely, is the process that counteracts serialization. It involves taking data that has been structured in a specific format and reconstructing it back into an object.
Deserialization can be dangerous because it potentially allows attackers to manipulate the serialized data to execute harmful code or cause unexpected behavior in the application during the object reconstruction process.
In PHP, specific magic methods are utilized during the serialization and deserialization processes:
__sleep
: Invoked when an object is being serialized. This method should return an array of the names of all properties of the object that should be serialized. It's commonly used to commit pending data or perform similar cleanup tasks.
__wakeup
: Called when an object is being deserialized. It's used to reestablish any database connections that may have been lost during serialization and perform other reinitialization tasks.
__unserialize
: This method is called instead of __wakeup
(if it exists) when an object is being deserialized. It gives more control over the deserialization process compared to __wakeup
.
__destruct
: This method is called when an object is about to be destroyed or when the script ends. It's typically used for cleanup tasks, like closing file handles or database connections.
__toString
: This method allows an object to be treated as a string. It can be used for reading a file or other tasks based on the function calls within it, effectively providing a textual representation of the object.
If you look to the results you can see that the functions __wakeup
and __destruct
are called when the object is deserialized. Note that in several tutorials you will find that the __toString
function is called when trying yo print some attribute, but apparently that's not happening anymore.
The method __unserialize(array $data)
is called instead of __wakeup()
if it is implemented in the class. It allows you to unserialize the object by providing the serialized data as an array. You can use this method to unserialize properties and perform any necessary tasks upon deserialization.
You can read an explained PHP example here: https://www.notsosecure.com/remote-code-execution-via-php-unserialize/, here https://www.exploit-db.com/docs/english/44756-deserialization-vulnerability.pdf or here https://securitycafe.ro/2015/01/05/understanding-php-object-injection/
You could abuse the PHP autoload functionality to load arbitrary php files and more:
PHP - Deserialization + Autoload ClassesIf for some reason you want to serialize a value as a reference to another value serialized you can:
PHPGGC can help you generating payloads to abuse PHP deserializations.
Note than in several cases you won't be able to find a way to abuse a deserialization in the source code of the application but you may be able to abuse the code of external PHP extensions.
So, if you can, check the phpinfo()
of the server and search on the internet (an even on the gadgets of PHPGGC) some possible gadget you could abuse.
If you have found a LFI that is just reading the file and not executing the php code inside of it, for example using functions like file_get_contents(), fopen(), file() or file_exists(), md5_file(), filemtime() or filesize(). You can try to abuse a deserialization occurring when reading a file using the phar protocol. For more information read the following post:
phar:// deserializationWhen the object gets unpickle, the function __reduce__ will be executed. When exploited, server could return an error.
Before checking the bypass technique, try using print(base64.b64encode(pickle.dumps(P(),2)))
to generate an object that is compatible with python2 if you're running python3.
For more information about escaping from pickle jails check:
Bypass Python sandboxesThe following page present the technique to abuse an unsafe deserialization in yamls python libraries and finishes with a tool that can be used to generate RCE deserialization payload for Pickle, PyYAML, jsonpickle and ruamel.yaml:
Python Yaml DeserializationJS doesn't have "magic" functions like PHP or Python that are going to be executed just for creating an object. But it has some functions that are frequently used even without directly calling them such as toString
, valueOf
, toJSON
.
If abusing a deserialization you can compromise these functions to execute other code (potentially abusing prototype pollutions) you could execute arbitrary code when they are called.
Another "magic" way to call a function without calling it directly is by compromising an object that is returned by an async function (promise). Because, if you transform that return object in another promise with a property called "then" of type function, it will be executed just because it's returned by another promise. Follow this link for more info.
__proto__
and prototype
pollutionIf you want to learn about this technique take a look to the following tutorial:
NodeJS - __proto__ & prototype PollutionThis library allows to serialise functions. Example:
The serialised object will looks like:
You can see in the example that when a function is serialized the _$$ND_FUNC$$_
flag is appended to the serialized object.
Inside the file node-serialize/lib/serialize.js
you can find the same flag and how the code is using it.
As you may see in the last chunk of code, if the flag is found eval
is used to deserialize the function, so basically user input if being used inside the eval
function.
However, just serialising a function won't execute it as it would be necessary that some part of the code is calling y.rce
in our example and that's highly unlikable.
Anyway, you could just modify the serialised object adding some parenthesis in order to auto execute the serialized function when the object is deserialized.
In the next chunk of code notice the last parenthesis and how the unserialize
function will automatically execute the code:
As it was previously indicated, this library will get the code after_$$ND_FUNC$$_
and will execute it using eval
. Therefore, in order to auto-execute code you can delete the function creation part and the last parenthesis and just execute a JS oneliner like in the following example:
You can find here further information about how to exploit this vulnerability.
A noteworthy aspect of funcster is the inaccessibility of standard built-in objects; they fall outside the accessible scope. This restriction prevents the execution of code that attempts to invoke methods on built-in objects, leading to exceptions such as "ReferenceError: console is not defined"
when commands like console.log()
or require(something)
are used.
Despite this limitation, restoration of full access to the global context, including all standard built-in objects, is possible through a specific approach. By leveraging the global context directly, one can bypass this restriction. For instance, access can be re-established using the following snippet:
For more information read this source.
The serialize-javascript package is designed exclusively for serialization purposes, lacking any built-in deserialization capabilities. Users are responsible for implementing their own method for deserialization. A direct use of eval
is suggested by the official example for deserializing serialized data:
If this function is used to deserialize objects you can easily exploit it:
For more information read this source.
In the following pages you can find information about how to abuse this library to execute arbitrary commands:
In Java, deserialization callbacks are executed during the process of deserialization. This execution can be exploited by attackers who craft malicious payloads that trigger these callbacks, leading to potential execution of harmful actions.
To identify potential serialization vulnerabilities in the codebase search for:
Classes that implement the Serializable
interface.
Usage of java.io.ObjectInputStream
, readObject
, readUnshare
functions.
Pay extra attention to:
XMLDecoder
utilized with parameters defined by external users.
XStream
's fromXML
method, especially if the XStream version is less than or equal to 1.46, as it is susceptible to serialization issues.
ObjectInputStream
coupled with the readObject
method.
Implementation of methods such as readObject
, readObjectNodData
, readResolve
, or readExternal
.
ObjectInputStream.readUnshared
.
General use of Serializable
.
For black box testing, look for specific signatures or "Magic Bytes" that denote java serialized objects (originating from ObjectInputStream
):
Hexadecimal pattern: AC ED 00 05
.
Base64 pattern: rO0
.
HTTP response headers with Content-type
set to application/x-java-serialized-object
.
Hexadecimal pattern indicating prior compression: 1F 8B 08 00
.
Base64 pattern indicating prior compression: H4sIA
.
Web files with the .faces
extension and the faces.ViewState
parameter. Discovering these patterns in a web application should prompt an examination as detailed in the post about Java JSF ViewState Deserialization.
If you want to learn about how does a Java Deserialized exploit work you should take a look to Basic Java Deserialization, Java DNS Deserialization, and CommonsCollection1 Payload.
You can check if there is installed any application with known vulnerabilities.
You could try to check all the libraries known to be vulnerable and that Ysoserial can provide an exploit for. Or you could check the libraries indicated on Java-Deserialization-Cheat-Sheet. You could also use gadgetinspector to search for possible gadget chains that can be exploited. When running gadgetinspector (after building it) don't care about the tons of warnings/errors that it's going through and let it finish. It will write all the findings under gadgetinspector/gadget-results/gadget-chains-year-month-day-hore-min.txt. Please, notice that gadgetinspector won't create an exploit and it may indicate false positives.
Using the Burp extension gadgetprobe you can identify which libraries are available (and even the versions). With this information it could be easier to choose a payload to exploit the vulnerability.
Read this to learn more about GadgetProbe.
GadgetProbe is focused on ObjectInputStream
deserializations.
Using Burp extension Java Deserialization Scanner you can identify vulnerable libraries exploitable with ysoserial and exploit them.
Read this to learn more about Java Deserialization Scanner.
Java Deserialization Scanner is focused on ObjectInputStream
deserializations.
You can also use Freddy to detect deserializations vulnerabilities in Burp. This plugin will detect not only ObjectInputStream
related vulnerabilities but also vulns from Json an Yml deserialization libraries. In active mode, it will try to confirm them using sleep or DNS payloads.
You can find more information about Freddy here.
Serialization Test
Not all is about checking if any vulnerable library is used by the server. Sometimes you could be able to change the data inside the serialized object and bypass some checks (maybe grant you admin privileges inside a webapp). If you find a java serialized object being sent to a web application, you can use SerializationDumper to print in a more human readable format the serialization object that is sent. Knowing which data are you sending would be easier to modify it and bypass some checks.
The main tool to exploit Java deserializations is ysoserial (download here). You can also consider using ysoseral-modified which will allow you to use complex commands (with pipes for example).
Note that this tool is focused on exploiting ObjectInputStream
.
I would start using the "URLDNS" payload before a RCE payload to test if the injection is possible. Anyway, note that maybe the "URLDNS" payload is not working but other RCE payload is.
When creating a payload for java.lang.Runtime.exec() you cannot use special characters like ">" or "|" to redirect the output of an execution, "$()" to execute commands or even pass arguments to a command separated by spaces (you can do echo -n "hello world"
but you can't do python2 -c 'print "Hello world"'
). In order to encode correctly the payload you could use this webpage.
Feel free to use the next script to create all the possible code execution payloads for Windows and Linux and then test them on the vulnerable web page:
You can use https://github.com/pwntester/SerialKillerBypassGadgetCollection along with ysoserial to create more exploits. More information about this tool in the slides of the talk where the tool was presented: https://es.slideshare.net/codewhitesec/java-deserialization-vulnerabilities-the-forgotten-bug-class?next_slideshow=1
marshalsec can be used to generate payloads to exploit different Json and Yml serialization libraries in Java.
In order to compile the project I needed to add this dependencies to pom.xml
:
Install maven, and compile the project:
Read more about this Java JSON library: https://www.alphabot.com/security/blog/2020/java/Fastjson-exceptional-deserialization-vulnerabilities.html
If you want to test some ysoserial payloads you can run this webapp: https://github.com/hvqzao/java-deserialize-webapp
Java uses a lot serialization for various purposes like:
HTTP requests: Serialization is widely employed in the management of parameters, ViewState, cookies, etc.
RMI (Remote Method Invocation): The Java RMI protocol, which relies entirely on serialization, is a cornerstone for remote communication in Java applications.
RMI over HTTP: This method is commonly used by Java-based thick client web applications, utilizing serialization for all object communications.
JMX (Java Management Extensions): JMX utilizes serialization for transmitting objects over the network.
Custom Protocols: In Java, the standard practice involves the transmission of raw Java objects, which will be demonstrated in upcoming exploit examples.
A class that implements Serializable
can implement as transient
any object inside the class that shouldn't be serializable. For example:
In scenarios where certain objects must implement the Serializable
interface due to class hierarchy, there's a risk of unintentional deserialization. To prevent this, ensure these objects are non-deserializable by defining a final
readObject()
method that consistently throws an exception, as shown below:
Customizing java.io.ObjectInputStream
is a practical approach for securing deserialization processes. This method is suitable when:
The deserialization code is under your control.
The classes expected for deserialization are known.
Override the resolveClass()
method to limit deserialization to allowed classes only. This prevents deserialization of any class except those explicitly permitted, such as in the following example that restricts deserialization to the Bicycle
class only:
Using a Java Agent for Security Enhancement offers a fallback solution when code modification isn't possible. This method applies mainly for blacklisting harmful classes, using a JVM parameter:
It provides a way to secure deserialization dynamically, ideal for environments where immediate code changes are impractical.
Check and example in rO0 by Contrast Security
Implementing Serialization Filters: Java 9 introduced serialization filters via the ObjectInputFilter
interface, providing a powerful mechanism for specifying criteria that serialized objects must meet before being deserialized. These filters can be applied globally or per stream, offering a granular control over the deserialization process.
To utilize serialization filters, you can set a global filter that applies to all deserialization operations or configure it dynamically for specific streams. For example:
Leveraging External Libraries for Enhanced Security: Libraries such as NotSoSerial, jdeserialize, and Kryo offer advanced features for controlling and monitoring Java deserialization. These libraries can provide additional layers of security, such as whitelisting or blacklisting classes, analyzing serialized objects before deserialization, and implementing custom serialization strategies.
NotSoSerial intercepts deserialization processes to prevent execution of untrusted code.
jdeserialize allows for the analysis of serialized Java objects without deserializing them, helping identify potentially malicious content.
Kryo is an alternative serialization framework that emphasizes speed and efficiency, offering configurable serialization strategies that can enhance security.
Deserialization and ysoserial talk: http://frohoff.github.io/appseccali-marshalling-pickles/
Talk about gadgetinspector: https://www.youtube.com/watch?v=wPbW6zQ52w8 and slides: https://i.blackhat.com/us-18/Thu-August-9/us-18-Haken-Automated-Discovery-of-Deserialization-Gadget-Chains.pdf
Java and .Net JSON deserialization paper: https://www.blackhat.com/docs/us-17/thursday/us-17-Munoz-Friday-The-13th-JSON-Attacks-wp.pdf, talk: https://www.youtube.com/watch?v=oUAeWhW5b8c and slides: https://www.blackhat.com/docs/us-17/thursday/us-17-Munoz-Friday-The-13th-Json-Attacks.pdf
Deserialziations CVEs: https://paper.seebug.org/123/
Find whats is JNDI Injection, how to abuse it via RMI, CORBA & LDAP and how to exploit log4shell (and example of this vuln) in the following page:
JNDI - Java Naming and Directory Interface & Log4ShellThe Java Message Service (JMS) API is a Java message-oriented middleware API for sending messages between two or more clients. It is an implementation to handle the producer–consumer problem. JMS is a part of the Java Platform, Enterprise Edition (Java EE), and was defined by a specification developed at Sun Microsystems, but which has since been guided by the Java Community Process. It is a messaging standard that allows application components based on Java EE to create, send, receive, and read messages. It allows the communication between different components of a distributed application to be loosely coupled, reliable, and asynchronous. (From Wikipedia).
There are several products using this middleware to send messages:
So, basically there are a bunch of services using JMS on a dangerous way. Therefore, if you have enough privileges to send messages to this services (usually you will need valid credentials) you could be able to send malicious objects serialized that will be deserialized by the consumer/subscriber. This means that in this exploitation all the clients that are going to use that message will get infected.
You should remember that even if a service is vulnerable (because it's insecurely deserializing user input) you still need to find valid gadgets to exploit the vulnerability.
The tool JMET was created to connect and attack this services sending several malicious objects serialized using known gadgets. These exploits will work if the service is still vulnerable and if any of the used gadgets is inside the vulnerable application.
JMET talk: https://www.youtube.com/watch?v=0h8DWiOWGGA
In the context of .Net, deserialization exploits operate in a manner akin to those found in Java, where gadgets are exploited to run specific code during the deserialization of an object.
The source code should be inspected for occurrences of:
TypeNameHandling
JavaScriptTypeResolver
The focus should be on serializers that permit the type to be determined by a variable under user control.
The search should target the Base64 encoded string AAEAAAD///// or any similar pattern that might undergo deserialization on the server-side, granting control over the type to be deserialized. This could include, but is not limited to, JSON or XML structures featuring TypeObject
or $type
.
In this case you can use the tool ysoserial.net in order to create the deserialization exploits. Once downloaded the git repository you should compile the tool using Visual Studio for example.
If you want to learn about how does ysoserial.net creates it's exploit you can check this page where is explained the ObjectDataProvider gadget + ExpandedWrapper + Json.Net formatter.
The main options of ysoserial.net are: --gadget
, --formatter
, --output
and --plugin
.
--gadget
used to indicate the gadget to abuse (indicate the class/function that will be abused during deserialization to execute commands).
--formatter
, used to indicated the method to serialized the exploit (you need to know which library is using the back-end to deserialize the payload and use the same to serialize it)
--output
used to indicate if you want the exploit in raw or base64 encoded. Note that ysoserial.net will encode the payload using UTF-16LE (encoding used by default on Windows) so if you get the raw and just encode it from a linux console you might have some encoding compatibility problems that will prevent the exploit from working properly (in HTB JSON box the payload worked in both UTF-16LE and ASCII but this doesn't mean it will always work).
--plugin
ysoserial.net supports plugins to craft exploits for specific frameworks like ViewState
--minify
will provide a smaller payload (if possible)
--raf -f Json.Net -c "anything"
This will indicate all the gadgets that can be used with a provided formatter (Json.Net
in this case)
--sf xml
you can indicate a gadget (-g
)and ysoserial.net will search for formatters containing "xml" (case insensitive)
ysoserial examples to create exploits:
ysoserial.net has also a very interesting parameter that helps to understand better how every exploit works: --test
If you indicates this parameter ysoserial.net will try the exploit locally, so you can test if your payload will work correctly.
This parameter is helpful because if you review the code you will find chucks of code like the following one (from ObjectDataProviderGenerator.cs):
This means that in order to test the exploit the code will call serializersHelper.JsonNet_deserialize
In the previous code is vulnerable to the exploit created. So if you find something similar in a .Net application it means that probably that application is vulnerable too.
Therefore the --test
parameter allows us to understand which chunks of code are vulnerable to the desrialization exploit that ysoserial.net can create.
Take a look to this POST about how to try to exploit the __ViewState parameter of .Net to execute arbitrary code. If you already know the secrets used by the victim machine, read this post to know to execute code.
To mitigate the risks associated with deserialization in .Net:
Avoid allowing data streams to define their object types. Utilize DataContractSerializer
or XmlSerializer
when possible.
For JSON.Net
, set TypeNameHandling
to None
: %%%TypeNameHandling = TypeNameHandling.None%%%
Avoid using JavaScriptSerializer
with a JavaScriptTypeResolver
.
Limit the types that can be deserialized, understanding the inherent risks with .Net types, such as System.IO.FileInfo
, which can modify server files' properties, potentially leading to denial of service attacks.
Be cautious with types having risky properties, like System.ComponentModel.DataAnnotations.ValidationException
with its Value
property, which can be exploited.
Securely control type instantiation to prevent attackers from influencing the deserialization process, rendering even DataContractSerializer
or XmlSerializer
vulnerable.
Implement white list controls using a custom SerializationBinder
for BinaryFormatter
and JSON.Net
.
Stay informed about known insecure deserialization gadgets within .Net and ensure deserializers do not instantiate such types.
Isolate potentially risky code from code with internet access to avoid exposing known gadgets, such as System.Windows.Data.ObjectDataProvider
in WPF applications, to untrusted data sources.
Java and .Net JSON deserialization paper: https://www.blackhat.com/docs/us-17/thursday/us-17-Munoz-Friday-The-13th-JSON-Attacks-wp.pdf, talk: https://www.youtube.com/watch?v=oUAeWhW5b8c and slides: https://www.blackhat.com/docs/us-17/thursday/us-17-Munoz-Friday-The-13th-Json-Attacks.pdf
In Ruby, serialization is facilitated by two methods within the marshal library. The first method, known as dump, is used to transform an object into a byte stream. This process is referred to as serialization. Conversely, the second method, load, is employed to revert a byte stream back into an object, a process known as deserialization.
For securing serialized objects, Ruby employs HMAC (Hash-Based Message Authentication Code), ensuring the integrity and authenticity of the data. The key utilized for this purpose is stored in one of several possible locations:
config/environment.rb
config/initializers/secret_token.rb
config/secrets.yml
/proc/self/environ
Ruby 2.X generic deserialization to RCE gadget chain (more info in https://www.elttam.com/blog/ruby-deserialization/):
Other RCE chain to exploit Ruby On Rails: https://codeclimate.com/blog/rails-remote-code-execution-vulnerability-explained/
As explained in this vulnerability report, if some user unsanitized input reaches the .send()
method of a ruby object, this method allows to invoke any other method of the object with any parameters.
For example, calling eval and then ruby code as second parameter will allow to execute arbitrary code:
Moreover, if only one parameter of .send()
is controlled by an attacker, as mentioned in the previous writeup, it's possible to call any method of the object that doesn't need arguments or whose arguments have default values.
For this, it's possible to enumerate all the methods of the object to find some interesting methods that fulfil those requirements.
This technique was taken from this blog post.
There are other Ruby libraries that can be used to serialize objects and therefore that could be abused to gain RCE during an insecure deserialization. The following table shows some of these libraries and the method they called of the loaded library whenever it's unserialized (function to abuse to get RCE basically):
Library | Input data | Kick-off method inside class |
Marshal (Ruby) | Binary |
|
Oj | JSON |
|
Ox | XML |
|
Psych (Ruby) | YAML |
|
JSON (Ruby) | JSON |
|
Basic example:
In the case of trying to abuse Oj, it was possible to find a gadget class that inside its hash
function will call to_s
, which will call spec, which will call fetch_path which was possible to make it fetch a random URL, giving a great detector of these kind of unsanitized deserialization vulnerabilities.
Moreover, it was found that with the previous technique a folder is also created in the system, which is a requirement to abuse another gadget in order to transform this into a complete RCE with something like:
Check for more details in the original post.
Learn & practice AWS Hacking:HackTricks Training AWS Red Team Expert (ARTE) Learn & practice GCP Hacking: HackTricks Training GCP Red Team Expert (GRTE)