Daneel: Type inference for Dalvik bytecode
In the last blog post about Daneel I mentioned one particular caveat of Dalvik bytecode, namely the existence of untyped instructions, which has a huge impact on how we transform bytecode. I want to take a similar approach as last time and look at one specific example to illustrate those implications. So let us take a look at the following Java method.
public float untyped(float[] array, boolean flag) { if (flag) { float delta = 0.5f; return array[7] + delta; } else { return 0.2f; } }
The above is a straightforward snippet and most of you probably know how the generated Java bytecode will look like. So let’s jump right to the Dalvik bytecode and discuss that in detail.
UntypedSample.untyped:([FZ)F: [regs=5, ins=3, outs=0] 0000: if-eqz v4, 0009 0002: const/high16 v0, #0x3f000000 0004: const/4 v1, #0x7 0005: aget v1, v3, v1 0007: add-float/2addr v0, v1 0008: return v0 0009: const v0, #0x3e4ccccd 000c: goto 0008
Keep in mind that Daneel doesn’t like to remember things, so he wants to look through the code just once from top to bottom and emit Java bytecode while doing so. He gets really puzzled at certain points in the code.
- Label 2: What is the type of register
v0
? - Label 4: What is the type of register
v1
? - Label 9: Register
v0
again? What’s the type at this point?
You, as a reader, do have the answer because you know and understand the semantic of the underlying Java code, but Daneel doesn’t, so he tries to infer the types. Let’s look through the code in the same way Daneel does.
At method entry he knows about the types of method parameters. Dalvik passes parameters in the last registers (in this case in v3
and v4
). Also we have a register (in this case v2
) holding a this
reference. So we start out with the following register types at method entry.
UntypedSample.untyped:([FZ)F: [regs=5, ins=3, outs=0] uninit uninit object [float bool
The array to the right represents the inferred register types at each point in the instruction stream as determined by the abstract interpreter. Note that we also have to keep track of the dimension count and the element type for array references. Now let’s look at the first block of instructions.
0002: const/high16 v0, #0x3f000000 u32 uninit object [float bool 0004: const/4 v1, #0x7 u32 u32 object [float bool 0005: aget v1, v3, v1 u32 float object [float bool 0007: add-float/2addr v0, v1 float float object [float bool
Each line shows the register type after the instruction has been processed. At each line Daneel learns something new about the register types.
- Label 2: I don’t know the type of
v0
, only that it holds an untyped 32-bit value. - Label 4: Same applies for
v1
here, it’s an untyped 32-bit value as well. - Label 5: Now I know
v1
is used as an array index, it must have been an integer value. Also the array reference in registerv3
is accessed, so I know the result is a float value. The result is stored inv1
, overwriting it’s previous content. - Label 7: Now I know
v0
is used in a floating-point addition, it must have been a float value.
Keep in mind that at each line, Daneel emits appropriate Java bytecode. So whenever he learns the concrete type of a register, he might need to retroactively patch previously emitted instructions, because some of his assumptions about the type were broken.
Finally we look at the second block of instructions reached through the conditional branch as part of the if
-statement.
0009: const v0, #0x3e4ccccd u32 uninit object [float bool 000c: goto 0008 float uninit object [float bool
When reaching this block we basically have the same information as at method entry. Again Daneel learns in the process.
- Label 9: I don’t know the type of
v0
, only that it holds an untyped 32-bit value. - Label 12: Now I know that
v0
has to be a float value because the unconditional branch targets the join-point at label 8. And I already looked at that code and know that we expect a float value in that register at that point.
This illustrates why our abstract interpreter also has to remember and merge register type information at each join-point. It’s important to keep in mind that Daneel follows the instruction stream from top to bottom, as opposed to the control-flow of the code.
Now imagine scrambling up the code so that instruction stream and control-flow are vastly different from each other, together with a few exception handlers and an optimal register re-usage as produced by some SSA representation. That’s where Daneel still keeps choking at the moment. But we can handle most of the code produced by the dx
tool already and will hunt down all those nasty bugs triggered by obfuscated code as well.
Disclaimer: The abstract interpreter and the method rewriter were mostly written by Rémi Forax, with this post I take no credit for it’s implementation whatsoever, I just want to explain how it works.
Keep in mind that Daneel
Keep in mind that Daneel doesn’t like to remember things, so he wants to look through the code just once from top to bottom and emit Java bytecode while doing so.
Extremely good post. Really
Extremely good post. Really thank you! I will read on…
Wonderful information. Keep
Wonderful information. Keep up the great work.
I found Hubwit as a
I found Hubwit as a transparent s ite, a social hub which is a conglomerate of Buyers and Sellers who are ready to offer online digital consultancy at decent cost.
One of the benefits of type
One of the benefits of type inference is that it can help improve the performance of Dalvik bytecode. Type inference can help the Dalvik VM better understand the types of data being used by a program and optimize its execution accordingly. In some cases, type inference can also help to eliminate entire classes of errors that would otherwise be present in bytecode.
Keep sharing the helpful
Keep sharing the helpful information.
Thanks for the information.
Thanks for the information. Keep sharing.
Helpful information. Thanks
Helpful information. Thanks for sharing.
Thank you; I've recently been
Thank you; I've recently been looking for information on this issue.
Great website. Very
Great website. Very informative and helpful. https://www.concretecontractorsfortworth.com/stamped-concrete-fort-worth...
great post home slice! I wish
great post home slice! I wish i could write code like this!
See this first:
See this first: https://www.tileandgroutcleaningprosgoodyear.com
Dalvik bytecode is notoriously difficult to work with, due to its lack of type information. However, recent research has shown that it is possible to perform type inference on Dalvik bytecode, which can help make working with it much easier. Type inference can be used to automatically detect the types of variables and methods, as well as to automatically generate type-safe code. This can make working with Dalvik bytecode much easier and more efficient, as well as help to improve the safety and security of Android applications. Type inference is an important tool for developers who want to work with Dalvik bytecode, and it is something that all Android developers should be aware of.
great post! as always your
great post! as always your content is superior to the other blogs on this subject! Cheers!
Bytecode help, thank you for
Bytecode help, thank you for sharing, really will help https://northamptonplasteringpros.com
Excellent insights into
Excellent insights into Dalvik bytecode. Thank you for your efforts in sharing this type of information; I greatly appreciate it. It will help me improve my knowledge about transforming bytecodes.
Hye. I think this is one of
Hye. I think this is one of the best sources. Thanks
Įvairios patrauklios
Įvairios patrauklios pakuotės, kurios patiks visiems. Užsisakykite kokybiškas pakuotes. <a href="https://iprint.lt/pakuote.html">Pakuote</a>
Now we agree here, I'd add a
Now we agree here, I'd add a few more points to support it.
Bonita Springs Tow Truck have
Bonita Springs Tow Truck have been in the towing industry for many years, committed to the towing of accident vehicles and mechanical failures of any nature.
https://bonitaspringstowtruck.com/
Will note that we also have
Will note that we also have to keep track of the dimension count and the element type for array references.
Excellent information about
Excellent information about Dalvik bytecode. Thank you for taking the time to share this knowledge with me; I really appreciate it. It will help me better my understanding of bytecode transformation.
This was very helpful for one
This was very helpful for one of our mechanics. Many thanks!
Nice post! This is actually
Nice post! This is actually made my day.
https://www.parratreeremoval.com.au
Vienos geriausiu virtuves
Vienos geriausiu virtuves kedziu jusu regione. Uzsisakykite kokybisku virtuves kedziu dabar ir megaukites patogumu.
Just wondering what it is.
Just wondering what it is.
Toothaches are common and can
Toothaches are common and can range in intensity from minor to severe. The pain can be so severe that it drives you crazy! -
This is new to me
This is new to me
Good read! An article like
Good read! An article like this is worth reading https://www.raytowntowtruck.com/
Really useful source of
Really useful source of information. Thanks for sharing
Here at Bel Air Tow Truck, we
Here at Bel Air Tow Truck, we strive to provide a great towing experience, and we do not settle for less. We offer a variety of services that will allow you and your company, or family to rest assure they are getting the best rate as possible.
https://belairtowtruck.com/
Thanks for this great
Thanks for this great information you've shared. https://www.sugarcreektowtruck.com/
This is very refreshing!
This is very refreshing! hope to read another kind of article like this.
https://www.northshorecommercialsolar.com.au
Thanks for sharing this
Thanks for sharing this one https://www.kansascitymotowingservice.com/
I am glad to read the
I am glad to read the information you shared https://www.kansascitymotowtruck.com/
I like this
I like this one https://www.independencetowingservice.com/
Northern Virginia Equipment
Northern Virginia Equipment Transport is a local service that you can trust to get your large valuable assets to where they need to be.
http://novaequipmenttransport.com
How nice, great
How nice, great post! https://www.bluespringstowtruck.com/
Really interesting article, I
Really interesting article, I hope you produce more of it. Thank you for sharing. https://www.treeserviceschelmsford.co.uk
I agree with you the content
I agree with you the content was very informative. Thanks for sharing.[URL=https://www.treeserviceshighwycombe.co.uk/]Tree Surgeon High Wycombe[/URL]
Love this thread, thank you
Love this thread, thank you for sharing https://greensborotowtruck.com/
Thanks for sharing this great
Thanks for sharing this great stuff here https://fayettevilletowtruck.com/
Glad to check this
Glad to check this site https://durhamtowtruck.com/
Thanks for this great post
Thanks for this great post you shared http://www.charlotteareatowingservice.com/
Today, Culpeper Tow Truck is
Today, Culpeper Tow Truck is one of the top towing companies in the state and can provide a wide range of towing and roadside assistance services.
http://culpepertowtruck.com/
Delray Beach Towing Service
Delray Beach Towing Service is the premier towing company serving Delray Beach, FL and the surrounding region.
https://delraybeachtowingservice.com
Good job. Very informative
Good job. Very informative article. https://ketteringtowingservice.com/
Awesome site to check
Awesome site to check in. https://daytontowingservice.com/
Thank you for taking the time
Thank you for taking the time to publish this information https://athenstowtruck.com/
Thank you for sharing. This
Thank you for sharing. This content has a detailed information https://towinglexington.com/
I am appreciating it very
I am appreciating it very much! http://www.towingcolumbiasc.com/